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TITLE OF INVENTION 



P. GINGIVALIS POLYPEPTIDES AND NUCLEOTIDES 



APPLICANT(S) FOR DO/EO/US 



ROSS et al 



Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 
S This is a FIRST submission of items concerning a filing under 35 U.S.C. 371 . 

□ This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371 . 



This is an express request to begin national examination procedures (35 U.S.C. 371(f) at any time rather than delay 
examination until the expiration of the applicable time limit set in 35 U.S.C. 371 (b) and PCT Articles 22 and 39(1 ). 



4. □ A proper Demand for International Preliminary Examination was made by the 1 9 th month 

from the earliest claimed priority date. 

5. A copy of the International Application as filed (35 U.S.C. 371 (c)(2)). 

f a. [X] is transmitted herewith (required only if not transmitted by the International Bureau). 

b. □ has been transmitted by the International Bureau. 

c. □ is not required, as the application was filed in the United States Receiving Office (RO/US). 

6. □ A translation of the International Application into English (35 U.S.C. 371 (c)(2)). 

7. □ Amendments to the claims of the International Application under PCT Article 1 9 (35 U.S.C. 371 (c)(3)). 

a. □ are transmitted herewith (required only if not transmitted by the International Bureau). 

b. □ have been transmitted by the International Bureau. 

c. □ have not been made; however, the time limit for making such amendments has NOT expired, 
d- □ have not been made and will not be made. 

8. □ A translation of the amendments to the claims under PCT Article 1 9 (U.S.C. 371 (c)(3)). 

9. □ An oath or declaration of the inventor(s) (35 U.S.C. 371 (c)(4)). 

1 0. □ A translation of the annexes to the International Preliminary Examination Report under PCT Article 36 

(35 U.S.C. 371(c)(5)). 

Items 11. To 16. Below concern document(s) or information included: 

^ 1 . □ An Information Disclosure Statement under 37 C.F.R. 1 .97 and 1 .98. 

12. □ An assignment document for recording. A separate cover sheet in compliance with 

37 C.F.R. 3.28 and 3.31 is included. 9 

13. ^ A FIRST preliminary amendment. 

□ A SECOND or SUBSEQUENT preliminary amendment. 

14. □ A substitute specification. 

1 5. □ A change of power of attorney and/or address letter. 
1*6. □ Other items or information. 
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INTERNATIONAL APPLICATION NO 

PCT/AU98/01023 



ATTORNEY'S DDCi 
47-138 



17. Eg] The following fees are submitted: 



CALCULATIONS PTO use only 



BASIC NATIONAL FEE (37 C.F.R. 1 .492(a)(1)-(5): 

-- Neither international preliminary examination fee (37 C.F.R. 1 .482) 
nor international search fee (37 C.F.R. 1 .445(a)(2)) paid to USPTO 

and International Search Report not prepared by the EPO or JPO $970.00 

-- International preliminary examination fee (37 C.F.R. 1 .482) not paid to 

USPTO but International Search Report prepared by the EPO or JPO $840.00 

-- International preliminary examination fee (37 C.F.R. 1.482) not paid to USPTO 

but international search fee (37 C.F.R. 1 .445(a)(2) paid to USPTO $690.00 

- International preliminary examination fee paid to USPTO (37 C.F.R. 1 .482) 

but all claims did not satisfy provisions of PCT Article 33(1 )-(4) $670.00 

- International preliminary examination fee paid to USPTO (37 C.F.R. 1 .482) 

and all claims satisfied provisions of PCT Article 33(1 )-(4) $96.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 

Surcharge of $130.00 for furnishing the oath or declaration later than □ 20 □ 30 
months from the earliest claimed priority date (37 C.F.R. 1 .492(e)). 



970.00 



0.00 



CLAIMS 



NUMBER FILED 



NUMBER EXTRA 



RATE 



Total Claims 



34 



-20 = 



14 



X 



$18.00 



252.00 



Independent Claims 



-3 = 



MULTIPLE DEPENDENT CLAIMS(S) (if applicable) 



$78.00 



0.00 



$260.00 



0.00 



TOTAL OF ABOVE CALCULATIONS = 



1222.00 



Reduction by 1 / 2 for filing by small entity, if applicable. A Small Entity Statement 
must also be filed (Note 37 C.F.R. 1 .9, 1 .27, 1 .28). 



0.00 



SUBTOTAL = 



1222.00 



Processing fee of $130.00, for furnishing the English Translation later than □ 20 □ 30 
months from the earliest claimed priority date (37 C.F.R. 1 .492(f)). + 



0.00 



TOTAL NATIONAL FEE = 



1222.00 



Fee for recording the enclosed assignment (37 C.F.R. 1 .21 (h)). The assignment must be 
accompanied by an appropriate cover sheet (37 C.F.R. 3.28, 3.31 ). $40.00 per property 



0.00 



Fee for Petition to Revive Unintentionally Abandoned Application ($1210.00 - Small Entity = $605.00) 



0.00 



TOTAL FEES ENCLOSED = 



1222.00 



Amount to be: 
refunded 



Charged 



d. 



E3 
□ 

□ 



A check in the amount of $1222.00 to cover the above fees is enclosed. 

Please charge my Deposit Account No. 14-11 40 in the amount of $ 

form is enclosed. 



to cover the above fees. A duplicate copy of this 



NOTE: 



The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any overpayment to 
Deposit Account No. 14-1140 . A duplicate copy of this form is enclosed. 

The entire content of the foreign application(s), referred to in this application is/are hereby incorporated by reference in this 
application. 

Where an appropriate time limit under 37 C.F.R. 1.494 or 1.495 has not been met, a petition to revive (37 C.F.R. 
1.137(a) or (b)) must be filed and granted to restore the application to pending status. 



SEND ALL CORRESPONDENCE TO: 

NIXON & VANDERHYE P.C. 
11 00 North Glebe Road, 8 th Floor 
Arlington, Virginia 22201 
Telephone: (703) 816-4000 




Leonard C. Mitchard 



NAME 



29,009 



June 12, 2000 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
In re Patent Application of 

ROSS eta I Atty. Ref.: 47-138 

Serial No. Unassigned Group: 

Filed: June 12, 2000 Examiner: 

For: P. GINGIVALIS POLYPEPTIDES AND 
NUCLEOTIDES 

■a-****-***-*** 

t 

June 12, 2000 / 

Assistant Commissioner for Patents 
Washington, DC 20231 

PRELIMINARY AMENDMENT 

Sir: 

Please amend the above application as follows: 



IN THE CLAIMS 

Claim 11, lines 2 and 3, delete "any one of claims 1 to 10" and replace by -- 
claim 1 -. 

Claim 13, line 2, delete "or claim 12". 

Claim 16, line 3, delete "any one of claims 1 to 10" and replace by --claim 

1 -. 

Claim 17, lines 2 and 3, delete "claimed in claim 11 or claim 12" and 
replace by --defined above--. 
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ROSS et al- 

Serial No. Unassigned 

Claim 18, line 1 delete "or claim 17". 

Claim 19, lines 2 and 3 delete "any one of claims 16 or 18" and replace by - 
-claim 16--. 

Claim 22, line 3, delete "or claim 12". 
Claim 24, line 2, delete "or claim 23". 

Claim 27, lines 1 and 2, delete "any one of claims 1 to 10" and replace by -- 
claim 1--. 

Claim 30, lines 1 and 2, delete "any one of claims 27 to 29" and replace by 
--claim 27--. 

Claim 34, line 4, delete "or claim 33" 
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ROSS et al - 

Serial No. Unassigned 



REMARKS 



The above amendments have been made to place the application in a more 
traditional format. 



LCM:lks 

1100 North Glebe Road, 8th Floor 
Arlington, VA 22201-4714 
Telephone: (703) 816-4000 
Facsimile: (703)816-4100 



Respectfully submitted 



NIXON & VANDERHYE P.C. 
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IN THE UNITED STATES PATENT AND TRADEMARK OF06e/ 5812 86 

In re Patent Application of 4fg Rec'd PCT/PTO 1 2 JUN 2000 

ROSS et al Atty. Ref.: 47-138 

Serial No. Unassigned Group: 
Filed: June 12, 2000 Examiner: 
For: P. GINGIVALIS POLYPEPTIDES AND NUCLEOTIDES 

-X- * * -X- -X- -X- * -X- X ^ * 



June 12, 2000 



Assistant Commissioner for Patents 
Washington, DC 20231 



Sir: 

SUBMISSION OF SEQUENCE LISTING 

Attached is a printed sequence listing for this application and a copy of the 
computer readable CD-ROM disk in the attached envelope. 

In accordance with 37 C.F.R. 1.821-1.825, I hereby state that the content of the 
paper and computer-readable copies of the sequence listing submitted in accordance 
with 37 C.F.R. 1.821(c) and (e), respectively, are the same. I hereby state that the 
submission, filed in accordance with 37 C.F.R. 1.821(g), does not introduce new matter. 

Respectfully submitted, 
NIXON & VANDERHYE P.C. 



By: 



LCM:lks 

1100 North Glebe Road, 8th Floor 
Arlington, VA 22201-4714 
Telephone: (703) 816-4000 
Facsimile: (703)816-4100 
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1 



Porphorymonas gingivalis polypeptides and nucleotides 



10 



15 



20 



25 



30 



FIELD OF THE INVENTION 

The present invention relates to P. gingivalis nucleotide sequences, 
P. gingivalis polypeptides and probes for detection of P. gingivalis. The 
P. gingivalis polypeptides and nucleotides can be used in compositions for 
use in raising an immune response in a subject against P. gingivalis and 
treating or preventing or reducing the severity of the condition known as 
periodontitis. 

BACKGROUND OF THE INVENTION 

Periodontal diseases are bacterial-associated inflammatory diseases of 
the supporting tissues of the teeth and range from the relatively mild form of 
gingivitis, the non-specific, reversible inflammation of gingival tissue to the 
more aggressive forms of periodontitis which are characterised by the 
destruction of the tooth's supporting structures. Periodontitis is associated 
with a subgingival infection of a consortium of specific Gram-negative 
bacteria that leads to the destruction of the periodontium and is a major 
public health problem. One bacterium that has attracted considerable 
interest is P. gingivalis as the recovery of this microorganism from adult 
periodontitis lesions can be up to 50% of the subgingival anaerobically 
cultivable flora, whereas P. gingivalis is rarely recovered, and then in low 
numbers, from healthy sites. A proportional increase in the level of 
P. gingivalis in subgingival plaque has been associated with an increased 
severity of periodontitis and eradication of the microorganism from the 
cultivable subgingival microbial population is accompanied by resolution of 
the disease. The progression of periodontitis lesions in non-human primates 
has been demonstrated with the subgingival implantation of P. gingivalis. 
These findings in both animals and humans suggest a major role for 
P. gingivalis in the development of adult periodontitis, 

P. gingivalis is a black-pigmented, anaerobic, asaccharolytic, 
proteolytic Gram-negative rod that obtains energy from the metabolism of 
specific amino acids. The microorganism has an absolute growth 
requirement for iron, preferentially in the form of haeme or its Fe(III) 
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oxidation product haemin and when grown under conditions of excess 
haemin is highly virulent in experimental animals. A number of virulence 
factors have been implicated in the pathogenicity of P. gingivalis including 
the capsule, adhesins, cytotoxins and extracellular hydrolytic enzymes. 
5 In order to develop an efficacious and safe vaccine to prevent, 

eliminate or reduce P. gingivalis colonisation it is necessary to identify and 
produce antigens that are involved in virulence that have utility as 
immunogens possibly through the generation of specific antibodies. Whilst it 
is possible to attempt to isolate antigens directly from cultures of P. gingivalis 

10 this is often difficult. For example as mentioned above, P. gingivalis is a 

strict anaerobe and can be difficult to isolate and grow. It is also known that, 
for a number of organisms, when cultured in vitro that many virulence genes 
are down regulated and the encoded proteins are no longer expressed. If 
conventional chemistry techniques were applied to purify vaccine candidates 

15 potentially important (protective) molecules may not be identified. With 

DNA sequencing, as the gene is present (but not transcribed) even when the 
organism is grown in vitro it can be identified, cloned and produced as a 
recombinant DNA protein. Similarly, a protective antigen or therapeutic 
target may be transiently expressed by the organism in vitro or produced in 

20 low levels making the identification of these molecules extremely difficult by 
conventional methods. 

With serological identification of therapeutic targets one is limited to 
those responses which are detectable using standard methods such as 
Western Blotting or ELISA. The limitation here is the both the level of 

25 response that is generated by the animal or human and determining whether 
this response is protective, damaging or irrelevant. No such limitation is 
present with a sequencing approach to the identification of potential 
therapeutic or prophylactic targets. 

It is also well known that P. gingivalis produces a range of broadly 

30 active proteases (University of Melbourne International Patent Application 
No PCT /AU 96/00673, US Patent Nos 5,475,097 and 5,523,390), which make 
the identification of intact proteins difficult because of their degradation by 
these proteases. 
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SUMMARY OF THE INVENTION 

The present inventors have attempted to isolate P. gingivalis 
nucleotide sequences which can be used for recombinant production of 
5 P. gingivalis polypeptides and to develop nucleotide probes specific for 
P. gingivalis. The DNA sequences listed below have been selected from a 
large number of P. gingivalis sequences according to their indicative potential 
as vaccine candidates. This intuitive step involved comparison of the 
deduced protein sequence from the P. gingivalis DNA sequences to the 
10 known protein sequence databases. Some of the characteristics used to 

select useful vaccine candidates include; the expected cellular location, such 
as outer membrane proteins or secreted proteins, particular functional 
activities of similar proteins such as those with an enzymatic or proteolytic 
activity, proteins involved in essential metabolic pathways that when 
15 inactivated or blocked may be deleterious or lethal to the organism, proteins 
that might be expected to play a role in the pathogenesis of the organism 
eg. red cell lysis, cell agglutination or cell receptors and proteins which are 
paralogues to proteins with proven vaccine efficacy. 

In a first aspect the present invention consists an isolated antigenic 
20 Porphorymonas gingivalis polypeptide, the polypeptide comprising; 

an amino acid sequence selected from the group consisting of SEQ. 

ID. NO. 265 to SEQ. ID. NO. 528, SEQ. ID. NO. 531 and SEQ. ID. NO. 

532; or 

an amino acid sequence at least 85%, preferably at least 95%, 
25 identical to an amino acid sequence selected from the group 

consisting of SEQ. ED. NO. 265 to SEQ. ID. NO. 528, SEQ. ID. NO. 
531 and SEQ. ID. NO. 532; or 

at least 40 amino acids having a contiguous sequence of at least 40 
amino acids identical to a contiguous amino acid sequence selected 
30 from the group consisting of SEQ. ID. NO. 265 to SEQ. ID. NO. 528, 

SEQ. ID. NO. 531 and SEQ. ID. NO. 532. 
In an embodiment of the present invention the polypeptide 
comprises; 

an amino acid sequence selected from the group consisting of SEQ. 
35 ID. NO, 386 to SEQ. ID. NO. 528 and SEQ. ED. NO. 532; or 
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an amino acid sequence at least 85%, preferably at least 95%, 
identical to an amino acid sequence selected from the group 
consisting of SEQ. ID. NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 
532; or 

5 at least 40 amino acids having a contiguous sequence of at least 40 

amino acids identical to a contiguous amino acid sequence selected 
from the group consisting of SEQ. ID. NO. 386 to SEQ. ID. NO. 528 
and SEQ. ID. NO. 532. 
As used herein % identity for polypeptides is to be calculated using the 
10 alignment algorithm of Needleman and Munsch (9) using a standard protein 
scoring matrix (Blosum 50}. 

In a preferred embodiment of the present invention the polypeptide 
comprises an amino acid sequence selected from the group consisting of 
SEQ. ID. NO. 386, SEQ. ID. NO. 424, SEQ. ID. NO. 425, SEQ. ID. NO. 434, 
15 SEQ. ID. NO. 447, SEQ. ID. NO. 458, SEQ. ID. NO. 475, SEQ. ID. NO. 498, 
SEQ. ID. NO. 499, SEQ. ID. NO. 500, SEQ. ID. NO. 501, SEQ. ID. NO. 387, 
SEQ. ID. NO. 400, SEQ. ID. NO. 411, SEQ. ID. NO. 419, SEQ. ID. NO. 420, 
SEQ. ID. NO. 427, SEQ. ID. NO. 429, SEQ. ID. NO. 433, SEQ. ID. NO. 437, 
SEQ. ID. NO. 438, SEQ. ID. NO. 443, SEQ. ID. NO. 444, SEQ. ID. NO. 448, 
20 SEQ. ID. NO. 449, SEQ. ID. NO. 452, SEQ. ID. NO. 455, SEQ. ID. NO. 457, 
SEQ. ID. NO. 459, SEQ. ID. NO. 461, SEQ. ID. NO. 462, SEQ. ID. NO. 463, 
SEQ. ID. NO. 467, SEQ. ID. NO. 468, SEQ. ID. NO. 469, SEQ. ID. NO, 482, 
SEQ. ID. NO. 484, SEQ. ID. NO. 485, SEQ. ID. NO. 494, SEQ. ID. NO. 508, 
SEQ. ID. NO. 509, SEQ. ID. NO. 510, SEQ. ID. NO. 520, SEQ. ID. NO. 521, 
25 SEQ. ID. NO. 522, SEQ. ID. NO. 525, SEQ. ID. NO. 526, SEQ. ID. NO. 528, 
SEQ. ID. NO. 389, SEQ. ID. NO. 390 and SEQ. ID. NO. 391. 

In another preferred embodiment of the present invention the 
polypeptide comprises an amino acid sequence selected from the group 
consisting of residue 422 to residue 531 of SEQ. ID. NO. 303, residue 534 to 
30 residue 582 of SEQ. ID. NO. 303, residue 127 to residue 232 of SEQ. ID. NO, 
301, residue 240 to residue 259 of SEQ. ID. NO. 301, residue 139 to residue 
156 of SEQ. ID. NO. 295, residue 160 to residue 178 of SEQ. ID. NO. 295, 
residue 180 to residue 207 of SEQ. ID. NO. 295, residue 221 to residue 257 of 
SEQ. ID. NO. 295, residue 259 to residue 323 of SEQ. ID. NO. 295, residue 
35 885 to residue 985 of SEQ. ID. NO. 299, residue 147 to residue 259 of SEQ. 
ID. NO. 363, residue 140 to residue 252 of SEQ. ID. NO. 344, residue 247 to 
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residue 356 of SEQ. ID. NO. 353, residue 359 to residue 391 of SEQ. ID. NO. 
353, residue 120 to residue 254 of SEQ. ID. NO. 300, residue 287 to residue 
311 of SEQ. ID. NO. 286, residue 313 to residue 352 of SEQ. ID. NO. 288, 
residue 354 to residue 401 of SEQ. ID. NO. 286, residue 208 to residue 252 of 
5 SEQ. ID. NO. 287, residue 259 to residue 373 of SEQ. ID. NO. 287, residue 5 
to residue 120 of SEQ. ID. NO. 293, residue 123 to residue 139 of SEQ. ID. 
NO. 293, residue 233 to residue 339 of SEQ. ID. NO. 265, residue 67 to 
residue 228 of SEQ. ID. NO. 278, residue 130 to residue 172 of SEQ. ID. NO. 
274, residue 174 to residue 238 of SEQ. ID. NO. 274, residue 99 to residue 

10 112 of SEQ. ID. NO. 274, residue 114 to residue 128 of SEQ. ID. NO. 274, 
residue 26 to residue 69 of SEQ. ID. NO. 285, residue 71 to residue 128 of 
SEQ. ID. NO. 285, residue 130 to residue 146 of SEQ. ID. NO. 285, residue 
620 to residue 636 of SEQ. ID. NO. 327, residue 638 to residue 775 of SEQ. 
ID. NO. 327, residue 397 to residue 505 of SEQ. ID. NO. 301, residue 528 to 

15 residue 545 of SEQ. ID. NO. 301, residue 556 to residue 612 of SEQ. ID. NO, 

301, residue 614 to residue 631 of SEQ. ID. NO. 301, residue 633 to residue 
650 of SEQ. ID. NO. 301, residue 553 to residue 687 of SEQ. ED. NO. 299, 
residue 305 to residue 447 of SEQ. ID. NO. 289, residue 1 to residue 52 of 
SEQ. ID. NO. 364, residue 65 to residue 74 of SEQ. ID. NO. 364, residue 486 

20 to residue 604 of SEQ. ID. NO. 275, residue 158 to residue 267 of SEQ. ID. 
NO. 272, residue 270 to residue 282 of SEQ. ID. NO. 272, residue 163 to 
residue 237 of SEQ. ID. NO. 273, residue 240 to Tesidue 251 of SEQ. ID. NO. 
273, residue 213 to residue 344 of SEQ. ID. NO. 282, residue 183 to residue 
324 of SEQ. ID. NO. 292, residue 327 to residue 341 of SEQ, ID. NO. 292, 

25 residue 352 to residue 372 of SEQ, ID. NO. 292, residue 141 to residue 166 of 
SEQ. ID, NO. 271, residue 168 to residue 232 of SEQ. ID. NO. 271, residue 1 
to residue 13 of SEQ. ID, NO. 302, residue 15 to residue 28 of SEQ. ID. NO. 

302, residue 30 to residue 72 of SEQ. ID. NO. 302, residue 476 to residue 529 
of SEQ. ID. NO. 277, residue 41 to residue 146 of SEQ. ID. NO. 299, residue 

30 149 to residue 162 of SEQ. ID. NO. 299, residue 166 to residue 177 of SEQ. 
ID. NO. 299, residue 192 to residue 203 of SEQ. ID. NO. 299, residue 71 to 
residue 343 of SEQ. ID. NO. 290, residue 346 to residue 363 of SEQ. ID. NO. 
290, residue 36 to residue 240 of SEQ. ID. NO. 331, residue 242 to residue 
270 of SEQ. ID. NO. 331, residue 1 to residue 192 of SEQ. ID. NO. 375, 

35 residue 266 to residue 290 of SEQ. ID. NO. 375, residue 23 to residue 216 of 
SEQ. ED. NO. 279, residue 220 to residue 270 of SEQ. ID. NO. 279, residue 
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285 to residue 386 of SEQ. ID. NO. 279, residue 84 to residue 234 of SEQ. ID. 
NO. 297, residue 248 to residue 259 of SEQ. ID. NO. 297, residue 261 to 
residue 269 of SEQ. ID. NO. 297, residue 275 to residue 402 of SEQ. ED. NO. 
294, residue 1 to residue 171 of SEQ. ID. NO. 298, residue 403 to residue 417 
5 of SEQ. ID. NO. 307, residue 420 to residue 453 of SEQ. ID. NO. 307, residue 
456 to residue 464 of SEQ. ID. NO. 307, residue 468 to residue 690 of SEQ. 
ID. NO. 307, residue 1 to residue 285 of SEQ. ID. NO. 304, residue 287 to 
residue 315 of SEQ. ID. NO. 304, residue 318 to residue 336 of SEQ. ID. NO. 
304, residue 255 to residue 269 of SEQ. ID. NO. 342, residue 271 to residue 

10 337 of SEQ. ID. NO. 342, residue 347 to residue 467 of SEQ. ID. NO. 281, 

residue 116 to residue 136 of SEQ. ID. NO. 375, residue 138 to residue 357 of 
SEQ. ID. NO. 375, residue 133 to residue 423 of SEQ. ID. NO. 364, residue 
141 to residue 299 of SEQ. ID. NO. 305, residue 202 to residue 365 of SEQ. 
ID. NO. 296, residue 134 to residue 426 of SEQ. ID, NO. 288, residue 1 to 

15 residue 218 of SEQ. ID. NO. 276, residue 1 to residue 246 of SEQ. ID. NO. 
280, residue 444 to residue 608 of SEQ. ID. NO. 364, residue 10 to residue 
686 of SEQ. ID. NO. 283, residue 1 to residue 148 of SEQ. ID. NO. 296, 
residue 1 to residue 191 of SEQ. ID. NO. 287, residue 193 to residue 204 of 
SEQ. ID. NO. 287, residue 209 to residue 373 of SEQ. ID. NO. 287, residue 

20 211 to residue 470 of SEQ. ED. NO. 284, residue 472 to residue 482 of SEQ. 
ID. NO. 284, residue 133 to residue 144 of SEQ. ED. NO. 281, residue 146 to 
residue 336 of SEQ. ID. NO. 281, residue 1 to residue 264 of SEQ. ED. NO. 
303, residue 265 to residue 295 of SEQ. ID. NO. 303, residue 297 to residue 
326 of SEQ. ID. NO. 303. residue 328 to residue 338 of SEQ. ID. NO. 303, 

25 residue 247 to residue 356 of SEQ. ID. NO. 353, residue 358 to residue 391 of 
SEQ, ID. NO. 353, residue 257 to residue 288 of SEQ. ID. NO. 298, residue 
290 to residue 385 of SEQ, ID. NO. 298, residue 245 to residue 256 of SEQ. 
ID. NO. 298, residue 422 to residue 802 of SEQ. ID. NO. 303, residue 803 to 
residue 814 of SEQ. ID. NO. 303, residue 139 to residue 156 of SEQ. ID, NO. 

30 295, residue 160 to residue 340 of SEQ. ID. NO. 295, residue 145 to residue 
361 of SEQ, ID. NO. 282, residue 363 to residue 387 of SEQ. ID. NO. 282, 
residue 398 to residue 471 of SEQ. ID. NO. 282, residue 573 to residue 679 of 
SEQ. ID. NO. 320, residue 27 to residue 168 of SEQ. ID. NO. 291, residue 170 
to residue 183 of SEQ. ID. NO. 291, residue 185 to residue 415 of SEQ. ID. 

35 NO. 291, residue 1 to residue 301 of SEQ. ID. NO. 364, residue 114 to residue 
702 of SEQ. ID. NO. 337, residue 377 to residue 412 of SEQ. ID. NO, 321, 
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residue 413 to residue 772 of SEQ. CD. NO. 321, residue 14 to residue 454 of 

SEQ. ID. NO. 265, residue 129 to residue 614 of SEQ. ID. NO. 268, residue 1 

to residue 930 of SEQ. ID. NO. 300, residue 932 to residue 1046 of SEQ. ID. 

NO. 300, residue 1 to residue 301 of SEQ. ID. NO. 364, residue 1 to residue 
5 42 of SEQ. ID. NO. 381, residue 44 to residue 973 of SEQ. ED. NO. 381, 

residue 1 to residue 93 of SEQ. ED. NO. 358, residue 95 to residue 179 of SEQ. 

ID. NO. 358, residue 181 to residue 227 of SEQ. ID. NO. 358, residue 114 to 

residue 702 of SEQ. ED. NO. 337, residue 1 to residue 659 of SEQ. ID. NO. 

355, residue 661 to residue 907 of SEQ. ID. NO. 355, residue 1 to residue 131 
10 of SEQ. ID. NO. 370, residue 133 to residue 601 of SEQ. ID. NO. 370, residue 

1 to residue 813 of SEQ. ID. NO. 344, residue 377 to residue 412 of SEQ. ID. 

NO. 321, residue 413 to residue 772 of SEQ, ID. NO. 321, and residue 189 to 

residue 614 of SEQ. ED. NO, 364, 

In a second aspect the present invention consists in a n isolated 
15 antigenic Porphoiymonas gingivalis polypeptide, the polypeptide comprising 

an amino acid sequence selected from the group consisting of SEQ. ID. NO. 

386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 532 less the leader sequence set 

out in Table 3. 

In a third aspect the present invention consists in an isolated DNA 
20 molecule, the DNA molecule comprising a nucleotide sequence which 
encodes the polypeptide of the first aspect the present invention or a 
sequence which hybridises thereto under stringent conditions. 

It is preferred that the isolated DNA molecule comprises a nucleotide 
sequence selected from the group consisting of SEQ. ID. NO. 1 to SEQ. ID. 
25 NO. 264, SEQ. ID. NO. 529 and SEQ. ED. NO. 530. 

In a fourth aspect the present invention consists in a recombinant 
expression vector comprising the DNA molecule of the second aspect of the 
present invention operably linked to a transcription regulatory element. 
The present invention also provides a cell comprising this 
30 recombinant expression vector. 

In a further aspect the present invention consists in a method for 
producing a P. gingivalis polypeptide comprising culturing the cell under 
conditions that permit expression of the polypeptide, 

In yet a further aspect the present invention provides a composition 
35 for use in raising an immune response directed against P. gingivalis in a 
subject, the composition comprising an effective amount of at least one 
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polypeptide of the first aspect of the present invention, or at least one DNA 
molecule of the second aspect of the present invention, or both,and a 
pharmaceutically acceptable carrier. It is preferred that the pharmaceuticaHy 
acceptable carrier is an adjuvant In other aspects the present invention 
5 provides methods of treating P. gingivalis infection in subject comprising the 
administration of the composition to the subject such that treatment of P. 
gingivalis infection occurs. The treatment may be prophylactic or 
therapeutic* 

In yet another aspect the present invention provides an antibody 
10 raised against a polypeptide of the first aspect the invention. The antibody 
may be polyclonal or monoclonal. The present invention also provides 
compositions including these antibodies. It is preferred that these 
compositions are adapted for oral use and may be, for example, dentrifices, 
mouthwashes, etc. 

15 In a still further aspect the present invention provides a nucleotide 

probe comprising at least 18 nucleotides and having a contiguous sequence 
of at least 18 nucleotides identical to a contiguous nucleotide sequence 
selected from the group consisting of SEQ. ID. NO. 1 to SEQ. ED. NO. 121, 
SEQ. ID. NO, 529, and sequences complementary thereto. It is preferred that 

20 the probe further comprises a detectable label. 

The present invention also provides a method for detecting the 
presence of P. gingivalis nucleic acid in a sample comprising: 

(a) contacting a sample with the nucleotide probe under conditions in 
which a hybrid can form between the probe and a P, gingivalis 

25 nucleic acid in the sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a 
hybrid indicates the presence of a P. gingivalis nucleic acid in the 
sample. 



30 DETAILED DESCRIPTION 



Definitions 

A purified or isolated polypeptide or a substantially pure preparation 
of a polypeptide are used interchangeably herein and, as used herein, mean a 
35 polypeptide that has been separated from other proteins, lipids, and nucleic 
acids with which it naturally occurs. Preferably, the polypeptide is also 
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separated from substances, e.g., antibodies or gel matrix, e.g., 
polyacrylamide, which are used to purify it Preferably, the polypeptide 
constitutes at least 10, 20, 50 70, 80 or 95% dry weight of the purified 
preparation. Preferably, the preparation contains: sufficient polypeptide to 

5 allow protein sequencing; at least 1, 10, or 100 mg of the polypeptide. 

A purified preparation of cells refers to, in the case of plant or animal 
cells, an in vitro preparation of cells and not an entire intact plant or animal. 
In the case of cultured cells or microbial cells, it consists of a preparation of 
at least 10% and more preferably 50% of the subject cells. 

0 A purified or isolated or a substantially pure nucleic acid, e.g., a 

substantially pure DNA, (are terms used interchangeably herein) is a nucleic 
acid which is one or both of the following: not immediately contiguous with 
both of the coding sequences with which it is immediately contiguous (i.e., 
one at the 5 l end and one at the 3* end) in the naturally occurring genome of 

5 the organism from which the nucleic acid is derived; or which is 

substantially free of a nucleic acid with which it occurs in the organism from 
which the nucleic acid is derived. The term includes, for example, a 
recombinant DNA which is incorporated into a vector, e.g., into an 
autonomously replicating plasmid or virus, or into the genomic DNA of a 

0 prokaryote or eukaryote, or which exists as a separate molecule (e.g., a cDNA 
or a genomic DNA fragment produced by PCR or restriction endonuclease 
treatment) independent of other DNA sequences. Substantially pure DNA 
also includes a recombinant DNA which is part of a hybrid gene encoding 
additional P. gingivalis DNA sequence. 

5 A "contig" as used herein is a nucleic acid representing a continuous 

stretch of genomic sequence of an organism. 

An "open reading frame", also referred to herein as ORF, is a region of 
nucleic acid which encodes a polypeptide. This region may represent a 
portion of a coding sequence or a total sequence and can be determined from 

0 a stop to stop codon or from a start to stop codon. 

As used herein, a "coding sequence" is a nucleic acid which is 
transcribed into messenger RNA and/or translated into a polypeptide when 
placed under the control of appropriate regulatory sequences. The boundaries 
of the coding sequence are determined* by a translation start codon at the five 

5 prime terminus and a translation stop code at the three prime terminus, A 
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coding sequence can include but is not limited to messenger RNA synthetic 
DNA, and recombinant nucleic acid sequences. 

A "complement" of a nucleic acid as used herein refers to an anti- 
parallel or antisense sequence that participates in Watson-Crick base-pairing 
5 with the original sequence. 

A "gene product" is a protein or structural RNA which is specifically 
encoded by a gene. 

As used herein, the term "probe" refers to a nucleic acid, peptide or 
other chemical entity which specifically binds to a molecule of interest. 

10 Probes are often associated with or capable of associating with a label. A 
label is a chemical moiety capable of detection. Typical labels comprise 
dyes, radioisotopes, luminescent and chemilu Tumescent moieties, 
fluorophores, enzymes, precipitating agents, amplification sequences, and 
the like. Similarly, a nucleic acid, peptide or other chemical entity which 

15 specifically binds to a molecule of interest and immobilizes such molecule is 
referred herein as a "capture ligand". Capture ligands are typically associated 
with or capable of associating with a support such as nitro-cellulose, glass, 
nylon membranes, beads, particles and the like. The specificity of 
hybridization is dependent on conditions such as the base pair composition 

20 of the nucleotides, and the temperature and salt concentration of the 

reaction. These conditions are readily discernible to one of ordinary skill in 
the art using routine experimentation. 

Homologous refers to the sequence similarity or sequence identity 
between two polypeptides or between two nucleic acid molecules. When a 

25 position in both of the two compared sequences is occupied by the same base 
or amino acid monomer subunit, e.g., if a position in each of two DNA 
molecules is occupied by adenine, then the molecules are homologous at that 
position. The percent of homology between two sequences is a function of 
the number of matching or homologous positions shared by the two 

30 sequences divided by the number of positions compared x 100. 

The terms peptides, proteins, and polypeptides are used 
interchangeably herein. 

An "immunogenic component" as used herein is a moiety, such as an 
P. gingivalis polypeptide, analog or fragment thereof, that is capable of 

35 eliciting a humoral and/or cellular immune response in a host animal. 



WO 99/29870 



PCT/AU98/01023 



An "antigenic component" as used herein is a moiety, such as 
P. gingivalis polypeptide, analog or fragment thereof, that is capable of 
binding to a specific antibody with sufficiently high affinity to form a 
detectable antigen-antibody complex. 
5 As used herein, the term "cell-specific promoter* means a DNA 

sequence that serves as a promoter, i.e., regulates expression of a selected 
DNA sequence operably linked to the promoter, and which effects expression 
of the selected DNA sequence in specific cells of a tissue. The term also 
covers so-called "leaky" promoters, which regulate expression of a selected 

10 DNA primarily in one tissue, but cause expression in other tissues as well. 

As used herein, the term "control sequence" refers to a nucleic acid 
having a base sequence which is recognized by the host organism to effect 
the expression of encoded sequences to which they are ligated. The nature of 
such control sequences differs depending upon the host organism; in 

15 prokaryotes, such control sequences generally include a promoter, ribosomal 
binding site, terminators, and in some cases operators; in eukaryotes, 
generally such control sequences include promoters, terminators and in some 
instances, enhancers. The term control sequence is intended to include at a 
minimum, all components whose presence is necessary for expression, and 

20 may also include additional components whose presence is advantageous, for 
example, leader sequences. 

As used herein, the term "operably linked" refers to sequences joined 
or ligated to function in their intended manner. For example, a control 
sequence is operably linked to coding sequence by ligation in such a way 

25 that expression of the coding sequence is achieved under conditions 
compatible with the control sequence and host cell. 

A "sample" as used herein refers to a biological sample, such as, for 
example, tissue or fluid isolated from an individual (including without 
limitation plasma, serum, cerebrospinal fluid, lymph, tears, saliva and tissue 

30 sections) or from in vitro cell culture constituents, as well as samples from 
the environment. 

The practice of the invention will employ, unless otherwise 
indicated, conventional techniques of chemistry, molecular biology, 
microbiology, recombinant DNA, and immunology well known to those 

35 skilled in the art. Such techniques are described and explained throughout 
the literature in sources such as, J. Perbal, A Practical Guide to Molecular 
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Cloning, John Wiley and Sons (1984), J. Sambrook et al., Molecular Cloning: 
A Laboratory Manual, Cold Spring Harbour Laboratory Press (1989), T.A. 
Brown (editor), Essential Molecular Biology: A Practical Approach, Volumes 
1 and 2, IRL Press (1991), D.M. Glover and B.D. Hames (editors), DNA 
5 Cloning: A Practical Approach, Volumes 1-4, IRL Press (1995 and 1996), and 
F.M. Ausubel et al. (Editors), Current Protocols in Molecular Biology, Greene 
Pub. Associates and Wiley-Interscience (1988, including all updates until 
present). The disclosure of these texts are incorporated herein by reference. 

10 Pharmaceutically Acceptable Carriers 

The antibodies, polypeptides and DNA of the present invention can 
be included in compositions which include a carrier or diluent These 
compositions include pharmaceutical compositions where the carrier or 

15 diluent will be pharmaceutically acceptable. Pharmaceutically acceptable 
carriers or diluents include those used in compositions suitable for oral, 
rectal, nasal, topical (including buccal and sublingual), vaginal, parenteral 
(including subcutaneous, intramuscular, intravenous, intradermal, 
intrathecal and epidural) administration. They are non-toxic to recipients at 

20 the dosages and concentrations employed. Representative examples of 

pharmaceutically acceptable carriers or diluents include, but are not limited 
to; water, isotonic solutions which are preferably buffered at a physiological 
pH (such as phosphate-buffered saline or Tris-buffered saline) and can also 
contain one or more of, mannitol, lactose, trehalose, dextrose, glycerol, 

25 ethanol or polypeptides (such as human serum albumin). The compositions 
may conveniently be presented in unit dosage form and may be prepared by 
any of the methods well known in the art of pharmacy. 

As will be well understood by those skilled in the art alterations may 
30 be made to the amino acid sequences set out in the Sequence Listings. These 
alterations may be deletions, insertions, or substitutions of amino acid 
residues. The altered polypeptides can be either naturally occurring (that is 
to say, purified or isolated from a natural source) or synthetic (for example, 
by performing site-directed metagenesis on the encoding DNA). It is 
35 intended that such altered polypeptides which have at least 85%, preferably 
at least 95% identity with the sequences set out in the Sequence Listing are 
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within the scope of the present invention. Antibodies raised against these 
altered polypeptides will also bind to the polypeptides having one of the 
sequences set out in the Sequence Listings. The level of % identity is to be 
calculated as set out above. 
5 Protein sequences are homologous if they are related by divergence 

from a common ancestor. Consequently, a species homologue of the protein 
will be the equivalent protein which occurs naturally in another species. 
Within any one species a homologue may exist as numerous allelic variants, 
and these will be considered homologues of the protein. Allelic variants and 
10 species homologues can be obtained by following standard techniques known 
to those skilled in the art. 

An allelic variant will be a valiant that is naturally occurring within 
an individual organism. 

15 Mutants, Variants and Homology - Nucleic Acids 

Mutant polynucleotides will possess one or more mutations which 
are deletions, insertions, or substitutions of nucleotide residues. Mutants 
can be either naturally occurring (that is to say, isolated from a natural 
source) or synthetic (for example, by performing site-directed metagenesis on 

20 the DNA). It is thus apparent that polynucleotides of the invention can be 
either naturally occurring or recombinant (that is to say prepared using 
recombinant DNA techniques). 

An allelic variant will be a variant that is naturally occurring within 
an individual organism. 

25 Nucleotide sequences are homologous if they are related by 

divergence from a common ancestor. Consequently, a species homologue of 
the polynucleotide will be the equivalent polynucleotide which occurs 
naturally in another species. Within any one species a homologue may exist 
as numerous allelic variants, and these will be considered homologues of the 

30 polynucleotide. Allelic variants and species homologues can be obtained by 
following standard techniques known to those skilled in the art. 

Antibody Production 

Antibodies, either polyclonal or monoclonal, which are specific for a 
35 polypeptide of the present invention can be produced by a person skilled in 
the art using standard techniques such as, but not limited to, those described 
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by Harlow et al. Antibodies: A Laboratory Manual, Cold Springs Harbor 
Laboratory Press (1988), and D. Catty (editor), Antibodies: A Practical 
Approach, IRL Press (1988). 

Various procedures known in the art may be used for the production 
5 of polyclonal antibodies to epitopes of a protein. For the production of 
polyclonal antibodies, a number of host animals are acceptable for the 
generation of antibodies by immunization with one or more injections of a 
polypeptide preparation, including but not limited to rabbits, mice, rats, etc. 
Various adjuvants may be used to increase the immunological response in 

10 the host animal, depending on the host species, including but not limited to 
Freund's (complete and incomplete), mineral gels such as aluminium 
hydroxide, surface active substances such as lysolecithin, pluronic polyols, 
polyanions, oil emulsions, keyhole lympet hemocyanins, dinitrophenol, and 
potentially useful human adjuvants such as BCG (bacille Calmette-Guerin) 

15 and Corynebacterium parvum. 

A monoclonal antibody to an epitope of a protein may be prepared by 
using any technique which provides for the production of antibody 
molecules by continuous cell lines in culture. These include but are not 
limited to the hybridoma technique originally described by Kohler and 

20 Milstein (1975, Nature 256, 493-497), and the more recent human B-ceil 

hybridoma technique (Kesber et al. 1983, Immunology Today 4:72) and EBV- 
hybridoma technique (Cole et al. 1985, Monoclonal Antibodies and Cancer 
Therapy, Alan R. liss, Inc. pp. 77-96). In addition, techniques developed for 
the production of "chimeric antibodies" by splicing the genes from antibody 

25 molecule of appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity may be used 
(Morrison et al. 1984, Proc. Natl. Acad. Sci„ 81:6851-6855; Neuberger et al. 
1984 Nature 312:604-608; Takeda et al. 1985 Nature 31:452-454). 
Alternatively, techniques described for the production of single chain 

30 antibodies (U.S. Patent 4,946,778) can be adapted to produce 4-specific single 
chain antibodies. 

Recombinant human or humanized versions of monoclonal 
antibodies are a preferred embodiment for human therapeutic applications. 
Humanized antibodies may be prepared according to procedures in the 

35 literature (e.g. Jones et al. 1986, Nature 321:522-25; Reichman et al. 1988 
Nature 332:323-27; Verhoeyen et al. 1988, Science 239:1534-36). The 
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recently described "gene conversion metagenesis" strategy for the production 
of humanized monoclonal antibody may also be employed in the production 
of humanized antibodies (Carter et al. 1992 Proc. Natl. Acad. Sci. U.S.A. 
89:4285-89). Alternatively, techniques for generating the recombinant phase 
5 library of random combinations of heavy and light regions may be used to 
prepare recombinant antibodies (e.g. Huse etal. 1989 Science 246:1275-81). 

Antibody fragments which contain the idiotype of the molecule such 
as Fu F(abl) and F(ab2) may be generated by known techniques. For 
example, such fragments include but are not limited to; the F(ab) E2 

10 fragment which can be produced by pepsin digestion of the intact antibody 
molecule; the Fab 1 fragments which can be generated by reducing the 
disulfide bridges of the F(ab*)2 fragment, and the two Fab fragments which 
can be generated by treating the antibody molecule with papain and a 
reducing agent. Alternatively, Fab expression libraries may be constructed 

15 (Huse et al. 1989, Science 246:1275-1281) to allow rapid and easy 

identification of monoclonal Fab fragment with the desired specificity to a 
protein. 

Adjuvants 

20 "Adjuvant" means a composition comprised of one or more 

substances that enhances the immunogenicity and efficacy of a vaccine 
composition. Non-limiting examples of suitable adjuvants include squalane 
and squalene (or other oils of animal origin); block copolymers; detergents 
such as Tween®-80; Quil® A, mineral oils such as Drakeol or Marcol, 

25 vegetable oils such as peanut oil; Corynebacterium-derived adjuvants such as 
Corynebacterium parvum; Propionibacterium-derived adjuvants such as 
Propionibacterium acne; Mycobacterium bovis (Bacillus Calmetic and 
Guerinn or BCG); interleukins such as interleukin 2 and interleukin-12; 
monokines such as interleukin 1; tumour necrosis factor; interferons such as 

30 gamma interferon; combinations such as saponin-aluminium hydroxide or 
Quil-A aluminium hydroxide; liposomes; ISCOM adjuvant; mycobacterial 
cell wall extract; synthetic glycopeptides such as muramyl dipeptides or 
other derivatives; Avridine; Lipid A; dextran sulfate; DEAE-Dextxan or DHAE- 
Dextran with aluminium phosphate; carboxypolymethylene such as 

35 Carbopol* EMA; acrylic copolymer emulsions such as Neocryl A640 (e.g. U.S. 
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Pat. No. 5,047,238); vaccinia or animal posvirus proteins; sub-viral particle 
adjuvants such as cholera toxin, or mixtures thereof. 

As used herein, stringent conditions are those that (1) employ low 
5 ionic strength and high temperature for washing, for example, 0.015 M 
NaCl/0.0015 M sodium citrate/0.1% NaDodS04 at 50°C; (2) employ during 
hybridisation a denaturing agent such as formamide, for example, 50% 
(vol/vol) formamide with 0.1% bovine serum albumin, 0,1% Ficoll, 0.1% 
polyvinylpyrrolidone, 50 mM sodiuriTphosphate buffer at pH 6.5 with 750 
10 mM NaCl, 75 mM sodium citrate at 42°C; or (3) employ 50% formamide, 
5 x SSC (0.75 M NaCI, 0.075 M sodium citrate), 50 mM sodium phosphate 
(pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated 
salmon sperm DNA (50 ng/ml), 0.1% SDS and 10% dextran sulfate at 42°G in 
0,2 x SSC and 0.1% SDS 

15 

As will be understood the present invention includes within its scope 
DNA vaccination. Further information regarding DNA vaccination may be 
found in Donnelly et al, Journal of Immunological Methods 176(1994) 145- 
152, the disclosure of which is incorporated herein by reference. 
20 Throughout this specification the word "comprise", or variations such 

as "comprises" or "comprising", will be understood to imply the inclusion of a 
stated element or integer or group of elements or integers but not the 
exclusion of any other element or integer, or group of elements or integers. 

25 Preparation of the P. gingivalis library for sequencing. 

To determine the DNA sequence of P. gingivalis genomic DNA was 
isolated from P. gingivalis strain W50 (ATCC 53978) essentially by the 
method described by Mamur J. ( J. Mol. Biol. 3, 208-218, 1961). Cloning of 

30 DNA fragments was performed essentially as described by Fleischmann et a/., 
(Science; 269, 496-512, 1995)(2). Briefly, purified genomic DNA from 
P. gingivalis was nebulized to fragment the DNA and was treated with Bal31 
nuclease to create blunt ends then run twiGe through preparative 1% agarose 
gels. DNA fragments of 1.6-2.0 kb were excised from the gel and the DNA 

35 recovered. This DNA was then ligated to the vector pUClS (Smal digested 
and dephosphorylated; Pharmacia) and electrophoresed through a 1% 
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preparative agarose gel. The fragment comprising linear vector plus one 
insert was excised, purified and this process repeated to reduce any vector 
without insert contamination. The recovered vector plus insert DNA was 
blunt-ended with T4 DNA polymerase, then a final ligation to produce 
5 circular DNA was performed. Aliquots of Epicurian Coli Electroporation- 
Competent Cells (Stratagene) were transformed with the ligated DNA and 
plated out on SOB agar antibiotic diffusion plates containing X-gal and 
incubated at 37°C overnight. Colonies with inserts appeared white and those 
without inserts (vector alone) appeared blue. Plates were stored at 4°C until 
10 the white clones were picked and expanded for the extraction of plasmid 
DNA for sequencing. 

DNA sequencing 

Plasmid DNA was prepared by picking bacterial colonies into 1.5ml 

15 of LB, TB or SOB broth supplemented with 50-100ug/ml Ampicillin in 96 
deep well plates. Plasmid DNA was isolated using the QIAprep Spin or 
QLAprep 96 Turbo miniprep kits (QIAGEN GmbH, Germany). DNA was 
eiuted into a 96 well gridded array and stored at -20C. 

Sequencing reactions were performed using ABI PRISM Dye 

20 Terminator and ABI PRISM BIGDye Terminator Cycle Sequencing Ready 
Reaction kits with AmpliTaq DNA polymerase FS (PE Applied Biosystems, 
Foster City, CA) using the M13 Universal forward and reverse sequencing 
primers. Sequence reactions were conducted on either a Perkin-Elmer 
GeneAmp 9700 (PE Applied Biosystems) or Hybaid PCR Express (Hybaid, 

25 UK) thermal cyclers. Sequencing reactions were analysed on ABI PRISM 377 
DNA sequencers (PE Applied Biosystems). 

The sequences obtained are set out below. The relationship between 
these sequences is set out in Table 1. The initiation codon was calculated 
using a combination of sequence homology alignment (FASTA), signal 

30 sequence prediction (PSORT, SignalP) or ORF prediction (GeneMark). 
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Table 1: Reference table indicating the relationships of each sequence ID to 
the selected proteins. 



Protein 


DNA 


Amino acid 


DNA sequence of 


Amino acid 


name 


sequence of 


sequence of 


protein 


sequence of 




complete 


complete ORF 




protein 
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122 


386 
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387 
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6 
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130, 131, 132 
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21 
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409 
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22 
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146 


410 


PG12 


23 
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147 


411 
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24 
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148 


412 
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DNA 
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DNA sequence of 


Ammo acid 
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sequence of 


sequence of 
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sequence of 
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DNA sequence analysis 

5 

DNA files in FASTA format were converted to GCG format files and 
imported into a database. The DNA files were translated into amino acid 
files using the program Flip obtained from ANGIS (Australian Genomic 
Information Service, University of Sydney, Australia). A series of 

10 bioinformatic analyses were performed on the proteins in order to select 
potential vaccine candidates. The programs used were FASTA homology 
searching (1), PSORT (2,3), SignalP (4), TopPred (5), and GeneMaxk (6). The 
proteins and their bioinformatic results were stored in the custom written 
database for search and retrieval of proteins with the desired characteristics 

15 The FASTA homology results for these proteins were then examined 

for any alignment with a protein suggesting^ surface location or vaccine 
efficacy. All proteins were searched for homology against a non-redundant 
bacterial protein database compiled by ANGIS using the FASTA algorithm. 
The settings used for the FASTA searches were Ktup = 2, gap creation 

20 penalty = -12, gap extension penalty = -2, width for deriving alignment in 

opt = 16 and the Blosum 50 scoring matrix. Individual FASTA search results 
were examined for significant homology by statistical probability and amino 
acid alignments. The results are set out in Table 2. 
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Protein files were then trimmed to the first, second, third, fourth and 
fifth methionine residues using a protein trimming program (ANGIS). The 
trimmed proteins were then subjected to PSORT analysis for the detection of 
signal sequences and the prediction of cell location. Proteins exhibiting a 
5 PSORT probability of outer membrane >0.8 were considered to indicate 
surface localisation. A second signal sequence detection program SignalP 
was also performed and, in certain instances, this program detected signals 
not identified with PSORT. All proteins identified by other methods were 
also analysed by PSORT and SignalP. Previously, the C-terminal amino acid 

10 of bacterial outer membrane proteins has been shown to be important for the 
assembly of the protein on the outer membrane (7). A typical structure 
definition for outer membrane proteins has been determined as the presence 
of a signal sequence at the N-terminus and a tyrosine or phenylalanine at the 
C- terminus. A number of the selected proteins exhibit this characteristic 

15 structure. The program TopPred was used to determine the presence and 
number of membrane spanning domains (MSDs) and the presence of such 
sequences indicates a preference to be attached to membranes such as the 
outer membrane. The results of PSORT, SignalP and TopPred analyses with 
the C-terminal amino acids of the selected proteins are set out in Table 3. 

20 The 70 amino acids from the C-terminus of a number of P. gingivalis 

outer membrane proteins share 50-100% protein sequence identity. These 
proteins included RGP1, RGP2, KGP, HagA, HagC, HagD, prtH and prtT. This 
conserved motif may be involved in the attachment or sorting of proteins to 
the outer membrane. The protein data set was searched using FASTA 

25 homology as described above and a number of novel proteins were identified 
which demonstrate similar motifs at their C-termini. The results are listed in 
Table 4 

The TonRTTT box is a 30 amino acid motif present within TonB outer 
membrane receptors in a wide variety of bacteria. The TonBIE box of 
30 P. gingivalis (8) was used to search the protein data set for homology by 
FASTA as described above. Those proteins demonstrating significant 
homology are listed in Table 5. 
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FASTA homology results 
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FASTA homology results 
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FASTA homology results 
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FASTA homology results 
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Table 4: Percentage identity and percentage similarity of various proteins 
with the 70 amino acids from the C-terminal of the P, gingivalis arginine 
protease 1 (RGPl), arginine protease 2 (RGP2), and the cysteine 
protease/hemagglutinin (prtT). 



Protein 
name 




Percent identity 




Percent similarity 






RGPl 


RGP2 


prtT 




RGPl 


RGP2 


IprtT 


PG21 1 


17 


29 


21 I 


40 


57 


!49 ! 
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73 
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Table 5: Percentage identity and percentage similarity of various proteins 
with the TonBm box of P. gingivalis. 



Protein 
name 


Percent identity 


Percent similarity 


PG2 
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PG13 
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50 
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Cloning, expression and purification of recombinant P. gingivalis genes. 
PGl 

5 Oligonucleotides to the 5' and 3' regions of the deduced protein were 

used to amplify the gene of interest from a preparation of F. gingivalis W50 
genomic DNA using the TaqPlus Precision PCR System ( Stratagene) and a 
PTC-100 (MJ Research) thermal cycler or similar device. The 5' 
oligonucleotide primer sequence was GCGGC ATATGCTGGCCG AACCGGGC , 

10 the 3' oligonucleotide primer sequence was 

GCGCCTCGAGTCAATTCATTTCCTTATAGAG. The PCR fragment was 
purified, digested with Nde I, Xho I restriction enzymes (Promega) and 
ligated into the corresponding sites of the plasmid pProEx-1 (Gibco-BRL) and 
transformed into E. coli ER1793 cells (a gift from Elizabeth Raleigh, New 

15 England Biolabs). A resulting clone expressing the correct insert was 

selected and induced with or without O.lmM IPTG (Promega) for expression 
of the recombinant protein. Expression of the recombinant protein was 
determined by SDS-PAGE analysis and Western Blot using the one of the 
rabbit antisera described above or an anti-hexahistidine antibody (Clontech) 

20 that detects the hexahistidine tag that was fused to the P. gingivalis 

recombinant protein. PGl was purified by disruption of theE. coli cells by 
sonication in binding buffer (Novagen) and solubilisation by the addition of 
sarkosyl (N-Lauroyl sarcosine) to a 1% final concentration. There after the 
preparation was diluted to 0.1% sarkosyl in binding buffer, bound to a 

25 Nickel-mtrilotriacetic acid column (Ni-NTA; Qiagen), after washing bound 
:: proteins were eluted with 1M imidazole in elution buffer (Novagen) 

according to the Qiagen recommendations with 0.1% sarkosyl added to all 
buffers. Following purification samples were dialysed against 500mM NaCl, 
20mM Tris, 0.1% sarkosyl at pH7.4 to remove the imidazole, concentrated as 

30 required and stored at 4°C until used. Purity and antigenicity were assessed 
by SDS-PAGE and Western blot using selected antisera (from those described 
above) and the protein concentration was determined by the BCA assay 
(Pierce). 
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PG2 

The methods used for PG2 were essentially the same as for PGl with 
the following exceptions. The 5' oligonucleotide primer sequence was 
CGCGGTATACATGAAAAGAATGACGC, the 3* oligonucleotide primer 
5 sequence was CGCGAGATCTGAAAGACAACTGAATACC and the PCR 

product was cloned into pGex-stop KBS(IV) (Patent application W09619496, 
JG Cox, SE Edwards, I Frazer and EA Webb. Variants of human papilloma 
virus antigens) using the BstZ 171 and Bgl II restriction sites, 2% sarkosyl 
was used to solubilise PG2 and 8M urea was added to the solublisation buffer 
10 and to all other buffers. Urea was removed from the purified protein by 
sequential dialysis (4M then 2M then 1M then 0.5M then 0M urea all in 
50mM Tris, 500mM NaCl, 0.1% sarkosyl, pH7.4). Purified protein was stored 
at 4°C until required, 

15 PG3 

The methods used for PG3 were essentially the same as for PGl with 
the following exceptions. The 5' oligonucleotide primer sequence was 
GCGCGTATACATGAAGAAATCAAGTGTAG, the 3" oligonucleotide primer 
sequence was GCGCAGATCTCTTCAGCGTACCTTGCTGTG and DNA was 

20 amplified with Pfu DNA polymerase (Stratagene). The PCR product was 
cloned directly into pCR-Blunt and transformed into E. coli 
ToplOF'(InVitrogen) before subcloning into the expression plasmid pGex- 
stop RBS(IV) using the Bst Z171 and Bgl II restriction sites and transformed 
into E. coli BL21DE3 (Pharmacia Biotech). The following modifications were 

25 made to the purification of PG3 from the PGl method. Cells expressing the 
recombinant protein were disrupted by sonication in binding buffer and the 
insoluble inclusion bodies concentrated by centrifugation. Inclusion bodies 
were then soiubilised in 6M urea (Sigma) in binding buffer and eluted with 
6M urea added to the eiution buffer. In some instances 6M guanidine 

30 hydrochloride (Sigma) was used instead of urea for these steps. Urea (or 
guanidine hydrochloride when it was substituted) was removed from the 
purified protein by sequential dialysis against reducing levels of urea (3M 
then 1.5M then 0.5M then 0M urea all in 5pmM Tris, 500mM NaCl, 8% 
glycerol, pH7.4). Purified protein was stored frozen at -8(f C until required. 

35 Protein concentration was determined by the Coomassie Plus protein assay 
(Pierce). 
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PG4 

The methods used for PG4 were essentially the same as for PG3 with 
the following exceptions. The 5' oligonucleotide primer sequence was 
CTTCTGTATACTTACAGCGGACATCATAAAATC, the 3' oligonucleotide 
5 primer sequence was TTCCAGGAGGGTACCACGCAACTCTTCTTCGAT and 
DNA was amplified with the Tth XL PCR kit (Perkin Elmer). The PGR 
product was cloned into the expression plasmid pGex-stop RBS(IV) using the 
Bst Z171 and Kpn I restriction sites and transformed into E. coli ER1793. 

10 PG5 

The methods used for PG5 were essentially the same as for PG3 with 
the following exceptions. The 5' oligonucleotide primer sequence was 
TTGCAACATATGATCAGAACGATACTTTCA, the 3' oligonucleotide primer 
sequence was AGCAATCTCGAGCGGTTCATGAGCCAAAGC and DNA was 
15 amplified with the Tth XL PCR kit The PCR product was cloned into the 
expression plasmid pET24 (Novagen) using the Nde I and Xho I restriction 
sites and transformed into E. coli BL21 {Pharmacia Biotech). Removal of 
urea was not proceeded past 1M urea as the protein was insoluble at lower 
concentrations of urea. Purified protein was stored at 4°C until required. 

20 

PG6 

The methods used for PG6 were essentially the same as for PG3 with 
the following exceptions. The 5 1 oligonucleotide primer sequence was 
TAAACATATGTGCCTCGAACCCATAATTGCTCCG, the 3' oligonucleotide 
25 primer sequence was CGTCCGCGGAAGCTTTGATCGGCCATTGCTACT and 
DNA was amplified with the Tth XL PCR kit. The PCR product was cloned 
into the expression plasmid pET24a using the Nde I and Hind m restriction 
sites and transformed into E. coli BL21. 



30 PG8 

The methods used for PG8 were essentially the same as for PG3 with 
the following exceptions. The 5* oligonucleotide primer sequence was 
CGCGGTATACATGGAGTTCAAGATTGTG, the 3' oligonucleotide primer 
sequence was CGCGAGATCTGTTTTCTGAAAGCTTTTC and DNA was 
35 amplified with the TaqPlus Precision PCR System. The PCR product was 
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cloned into the expression plasmid pProEx-1 using the Nde I and Xho I 
restriction sites and transformed into E. coli ER1793. 

PG8A 

5 PG8A is a shortened version of PG8 and has the first 173 amino acids 

removed. The methods used fox PG8A were essentially the same as for FG3 
with the following exceptions. The 5' oligonucleotide primer sequence was 
CGCGGTATACATGGAAAACTTAAAGAAC, the 3' oligonucleotide primer 
sequence was CGCGAGATCTGTTTTCTGAAAGCTTTTC and DNA was 
10 amplified with the TaqPlus Precision PCR System. The PCR product was 
cloned into the expression plasmid pGex-stop RBS(IV) using the Bst Z171 
and Bgl II restriction sites and transformed into E. coli ER1793. Prior to 
dialysis of the purified protein EDTA (Sigma) was added to a final 
concentration of lOmM. 

15 

PG10 

The methods used for PGlO were essentially the same as for PG3 with 
the following exceptions. The 5' oligonucleotide primer sequence was 
CGCGGATATCATGGATAAAGTGAGCTATGC, the 3* oligonucleotide primer 
20 sequence was CGCGAGATdTTTGTTGATACTCAATAATTC and DNA was 
amplified with the TaqPlus Precision PCR System. The PCR product was 
digested with Eco RV and Bgl Et and Ugated into the expression plasmid 
pGex-stop RBS(W) using the Bst Z171 and Bgl II restriction sites and 
transformed into E. coli ER1793. 

25 

PG11 

The methods used for PGll were essentially the same as for PGl with 
the following exceptions. The 5' oligonucleotide primer sequence was 
GCGCGTATACATGAGAGCAAACATTTGGCAGATACTTTCCG, the 3' 

30 oligonucleotide primer sequence was 

GCGCAGATCTGCGCAAGCGCAGTATATCGCC and DNA was amplified with 
Tli DNA polymerase (Promega). The PCR product was cloned into pCR-Blunt 
and transformed into E. coli ToplOF'before subcloning into the expression 
plasmid pGex-stop RBS(IV) using the Bst Z171 and Bgl II restriction sites and 

35 transformed into F. coli ER1793. PGll was purified by solubilisation of E. 
coli cells with 2% sarkosyl in binding buffer (Qiagen) which was diluted to 
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0.1% sarkosyl in binding buffer, bound to a Nickel-nitrilotriacetic acid 
column (Ni-NTA; Qiagen), after washing bound proteins were eluted with 
1M imidazole (0.7% CHAPS (Sigma) in elution buffer; Qiagen) according to 
the Qiagen recommendations. Following purification samples were dialysed 
5 against 500mM NaCl, 20mM Tris, 0.7% CHAPS, 20% glycerol (Sigma) at 
pH7.4 to remove the imidazole, concentrated as required and stored at € C 
until used. 



PG12 

10 Tiie methods used for PG12 were essentially the same as for PGl with 

the following exceptions. The 5' oligonucleotide primer sequence was 

GCGCGTATACATGAATAGCAGACATCTGACAATCACAATCATTGCCGG, 
the 3' oligonucleotide primer sequence was 

GCGCAGATCTGCTGTTCTGTGAGTGCAGTTGTTTAAGTG and DNA was 
15 amplified with Tli DNA polymerase. The PGR product was cloned into pCR- 
Blunt and transformed intoS. coli ToplOFcells before subcloning into the 
expression plasmid pGex-stop RBS(IV) using the Bst Z171 and Bgl H 
restriction sites and transformed into E. coli BL21. Purification of the 
recombinant protein was essentially the same as PGll except 0.5% DHPC 
20 (l,2-Diheptanoyl-sn-glycero-3-phosphocholine; Avanti) in 50mM Tris, 50mM 
NaCl, pH8,0 was used to solubilise the inclusion bodies instead of sarkosyl 
and the DHPC was diluted to 0.1% before addition to the Ni-NTA and 0.1% 
DHPC was added to all buffers. 



25 PG13 

The methods used for PG13 were essentially the same as for PG3 with 
the following exceptions. The 5' oligonucleotide primer sequence was 

GCGCCATATGCGGACAAAAACTATCTTITTTGCG, the 3' oligonucleotide 
primer sequence was 

30 GCGCCTCGAGGTTGTTGAATCGAATCGCTATTTGAGC and DNA was 
amplified with Tli DNA polymerase. The PCR product was cloned the 
expression plasmid pET24b using the Nde I and Xho I restriction sites and 
transformed into K coli BL21. Purification of the recombinant protein was 
essentially the same as PG3 using 6M urea and 1% NOG (n-octyl glucoside; 

35 Sigma) was added to the dialysis buffer. Removal of urea was not proceeded 
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past 2M urea as the protein was insoluble at lower concentrations of urea. 
Purified protein was stored at 4°C until required. 

PG14 

5 The methods used for PG12 were essentially the same as for PGl with 

the following exceptions. The 5' oligonucleotide primer sequence was 
GCGCGGCGCCATGACGGACAACAAACAACGTAATATCG, the 3' 
oligonucleotide primer sequence was 

GCGCCTCGAGTTACTTGCGTATGATCACGGACATACCC and DNA was 
10 amplified with Tli DNA polymerase. The PGR product was cloned the 

expression plasmid pProEx-1 using the Ehe I and Xho I restriction sites and 
transformed into E. coli BL21. Purification of the recombinant protein was 
essentially the same as PG12. 



15 PG15 

The methods used for PG15 were essentially the same as for PG3 with 
the following exceptions. The 5* oligonucleotide primer sequence was 
CAAAAGTATACTAATAAATATCATTCTCAA, the 3' oligonucleotide primer 
sequence was GCTTATGGTACCTTTGGTCTTATCTATTAT and DNA was 
20 amplified with the Tth XL PCR kit The PCR product was cloned into the 
expression plasmid pGex-stop RBS(IV] using the Bst Z171 and Kpn I 
restriction sites and transformed into E. coli ER1793. 



PG22 

25 Tne methods used for PG22 were essentially the same as for PGl with 

the following exceptions. The 5' oligonucleotide primer sequence was 
CCCCGGATCCGATGCGACTGATCAAGGC, the 3' oligonucleotide primer 
sequence was CCCCCTCGAGCGGAACGGGGTCATAGCC and DNA was 
amplified with the TaqPlus Precision PCR System. The PCR product was 

30 cloned into the expression plasmid pET24b using the Bam HI and Xho I 
restriction sites and transformed into E. coli BL21DE3. Once PG22 was 
purified dialysis was performed in the same manner as for PGl but in the 
presence of 1M imidazole. 
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PG24 

The methods used for PG24 were essentially the same as for PG3 with 
the following exceptions. The 5 1 oligonucleotide primer sequence was 
CGCGGTATACATGAATTACCTGTACATAC, the 3' oligonucleotide primer 
5 sequence was CGCGGGATCCGTTCGATTGGTCGTCGATGG and DNA was 
amplified with the TaqPlus Precision PCR System. The PCR product was 
digested with Bst Z171 and Bam HI and ligated into the expression plasmid 
pGex-stop RBS(IV) using the Bst Z171 and Bgl II restriction sites and 
transformed into E. coli ER1793. Due to the low level of expression of PG24 
10 purification was not proceeded with except on small scale. 

PG24A 

A modified version of PG24 was also cloned and expressed. PG24A is 
the same as PG24 with the predicted N-terminal sequence removed. The 

15 methods used for PG24A were essentially the same as for PG3 with the 
following exceptions. The 5' oligonucleotide primer sequence was 
CGCGCATATGGAGATTGCTTTCCTTTCTTCG, the 3 5 oligonucleotide primer 
sequence was CGCGCTCGAGTTAGTTCGATTGGTCGTCG and DNA was 
amplified with the TaqPlus Precision PCR System. The PCR product was 

20 cloned into the expression plasmid pProEx-1 using the Nde I and Xho I 
restriction sites and transformed into E. coli ER1793. Purification of the 
recombinant protein was essentially the same as PG3 except 8M urea was 
used to solubilise the inclusion bodies and in the buffers used for the Ni-NTA 
column purification. Urea was removed by sequential dialysis (4M then 2M, 

25 then 1M then 0.5M then 0M urea all in 50mM Tris, 500mM NaCl, 8% 

glycerol, pH7.4). Purified protein was stored frozen at -80°C until required. 

PG29 

The methods used for PG29 were essentially the same as for PG3 with 
30 the following exceptions. The 5' oligonucleotide primer sequence was 

GCGCGATATCGCTAGCATGAAAAAGCTATTTCTC, the 3' oligonucleotide 
primer sequence was 

GCGCAGATCTCTCGAGTTTGCCATCGGATTGCGGATTG and DNA was 
amplified with Pfu DNA polymerase being used. The PCR product was 
35 cloned into pCR-Blunt (InVitrogen) and transformed into E. coli 

ToplOF'before subcloning into the expression plasmid pGex-stop RBS(IV) 
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using the EcoR V and Bgl II restriction sites and transformed into E. coli 
BL21. 6M urea was used throughout the purification process. 

PG30 

5 The methods used for PG30 were essentially the same as for PG3 with 

the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was TACGGAATTCGTGACCCCCGTCAGAAATGTGCGC, the 3' 
oligonucleotide primer sequence was 

10 CTATGCGGCCGCTTTGATCCTCAAGGCTTTGCCCGG and DNA was 

amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates of PG30. 10ml cultures of 

15 recombinant E. coli were grown to an OD of 2.0 (Agoo^, ) in terrific broth and 
the cells were induced with 0.5mM IPTG and samples taken for analysis at 4 
hours post induction. Purification was not done for these studies. 

PG31 

20 The methods used for PG31 were essentially the same as for PG30 

with the following exceptions. The 5* oligonucleotide primer sequence was 
CGGGGAATTCGCAAAAATCAATTTCTATGCTGAA, the 3' oligonucleotide 
primer sequence was CTATGCGGGCGCTGTATGCAATAGGGAAAGCTCCGA 
and DNA was amplified with the Tth XL PCR kit The PGR product was 

25 cloned into the expression plasmid pET24a using the Eco RI and Not I 

restriction sites and transformed into E.coli BL21DE3. Expression studies 
and immunoreactivity studies were carried out on whole E« coli lysates. 
Purification was not done for these studies. 

30 PG32 

The methods used for PG32 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was CGCAGAATTCCAGGAGAATACTGTACCGGCAACG, the 3' 
35 oligonucleotide primer sequence was 

CTATGCGGCCGCCTTGGAGCGAACGATTACAACAC and DNA was 
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amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3, Expression studies and irrununoreactivily 
studies were carried out on whole E, coli lysates. Purification was not done 
5 for these studies. 

PG33 

The methods used for PG33 were essentially the same as for PG30 
with the following exceptions. The predicted N- terminal signal sequence was 
10 removed from the recombinant protein. 

The 5* oligonucleotide primer sequence was 

TGCAGAATTCCAAGAAGCTACTACACAGAAGAAA, the 3' oligonucleotide 
primer sequence was CTATGCGGCCGCTTCCGCTGCAGTCATTACTACAA 
and DNA was amplified with the Tth XL PCR kit. The PCR product was 
15 cloned into the expression plasmid pET24a using the Eco RI and Not I 

restriction sites and transformed intoi?. coli BL21DE3, Expression studies 
and immunoreactivity studies were carried out on whole E. coli lysates. 
Purification was not done for these studies. 

20 PG35 

The methods used for PG35 were essentially the same as for PG30 
with the following exceptions. The 5' oligonucleotide primer sequence was 
GCGCGAATTCATGAAACAACTAAACATTATCAGC, the 3' oligonucleotide 
primer sequence was GCGTGCGGCCGCGAAATTGATCTTTGTACCGACGA 
25 and DNA was amplified with the Tth XL PCR kit. The PCR product was 
cloned into the expression plasmid pET24a using the Eco RI and Not I 
restriction sites and transformed intoli, coli BL21DE3. Expression studies 
and immunoreactivity studies were carried out on whole E. coli lysates. 
Purification was not done for these studies. 

30 

PG36 

The methods used for PG36 were essentially the same as for PG30 
with the following exceptions. The 5' oligonucleotide primer sequence was 
AAAGGAATTCTACAAAAAGATTATTGCCGTAGCA, the 3' oligonucleotide 
35 primer sequence was CTATGCGGCCGCGAACTCCTGTCCGAGCACAAAGT 
and DNA was amplified with the Tth XL PCR kit. The PCR product was 
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cloned into the expression plasmid pET24a using theEco RI and Not I 
restriction sites and transformed into E. coli BL21DE3. Expression studies 
and immunoreactivity studies were carried out on whole E. coli lysates. 
Purification was not done for these studies. 

5 

PG37 

The methods used for FG37 were essentially the same as for PG3G 
with the following exceptions. The 5' oligonucleotide primer sequence was 
TGGCGAATTCAAACGGTTTTTGATTTTGATCGGC, the 3* oligonucleotide 

10 primer sequence was CTATGCGGCCGCCTTGCTAAAGCCCATCTTGCTCAG 
and DNA was amplified with the Tth XL PCR kit. The PCR product was 
cloned into the expression plasmid pET24a using the Eco RI and Not I 
restriction sites and transformed into E. coli BL21DE3. Expression studies 
and immunoreactivity studies were carried out on whole E. coli lysates. 

15 Purification was not done for these studies. 

PG38 

The methods used for PG38 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
20 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was CCTCGAATTCCAAAAGGTGGCAGTGGTAAACACT, the 3' 
oligonucleotide primer sequence was 

CTATGCGGCCGCCTTGATTCCGAGTTTCGCTTTTAC and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
25 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies, 

30 PG39 

The methods used for PG39 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was AGCTGGATCCCAAGGGGTCAGGGTATCGGGCTAT, the 3* 
35 oligonucleotide primer sequence was 

CTATGCGGCCGCGAATTCGACGAGGAGACGCAGGT and DNA was 
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amplified with the Tth XL PCR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
5 for these studies. 



PG40 

The methods used for PG40 were essentially the same as for PG30 
with the following exceptions. The predicted N-tercninal signal sequence was 
10 removed from the recombinant protein. The 5 f oligonucleotide primer 

sequence was GGCTGAATTCAAGACGGACAACGTCCCGACAGAT, the 3' 
oligonucleotide primer sequence was 

CTATGCGGCCGCGAAGTTGACCATAACCTTACCCA and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
15 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E~ coli lysates. Purification was not done 
for these studies. 



20 PG41 

The methods used for PG41 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5 f oligonucleotide primer 
sequence was GACTGAATTCCAAAACGCCTCCGAAACGACGGTA, the 3' 

25 oligonucleotide primer sequence was 

CTATGCGGCCGCTTGTTCGGGAATCCCCATGCCGTT and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 

30 studies were carried out on whole F. coli lysates. Purification was not done 
for these studies. 



PG42 

The methods used for PG42 were essentially the same as for PG30 
35 with the following exceptions. The 5' oligonucleotide primer sequence was 
GTTTGAATTCGCAAATAATACTCTTTTGGCGAAG, the 3' oligonucleotide 
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primer sequence was 

GAGTGCGGCCGCTTTGCCGGACATCGAAGAGATCGTC and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
5 transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG43 

10 The methods used for PG43 were essentially the same as for PG30 

with the following exceptions. The 5' oligonucleotide primer sequence was 
GC G C G AATTC AAAAAAG AAAAACTTTGG ATTGC G , the 3' oligonucleotide 
primer sequence was CTATGCGGCCGCCTTCAAAGCGAAAGAAGCCTTAAC 
and DNA was amplified with the Tth XL PCR kit. The PCR product was 

15 cloned into the expression plasmid pET24a using the Eco RI and Not I 

restriction sites and transformed into E. coli BL21DE3. Expression studies 
and immunoreactivity studies were carried out on whole E. coli lysates. 
Purification was not done foT these studies. 

20 PG44 

The methods used for PG44 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was AGCCGAATTCTGTAAGAAAAATGCTGACACTACC, the 3' 

25 oligonucleotide primer sequence was 

CTATGCGGCCGCCTTTTTCCCGGGCTTGATCCCGAT and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 

30 studies were carried out on whole E. coli lysates. Purification was not done 
for these studies, 

PG45 

The methods used for PG45 were essentially the same as for PG30 
35 with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
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sequence was GACAGGATCCTGCTCCACCACAAAGAATCTGCCG, the 3' 
oligonucleotide primer sequence was 

CTATGCGGCCGCGAAGGGATAGCCGACAGCCAAAT and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
5 expression plasmid P ET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

10 PG46 

The methods used for PG46 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was CTCGGAATTCCGTTATGTGCCGGACGGTAGCAGA, the 3' 
15 oligonucleotide primer sequence was 

CTATGCGGCCGCGAACGGATAGCCTACTGCAATGT and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
20 studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG47 

The methods used forPG47 were essentially the same as for PG30 
25 with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was 

CGCCGAATTCCAAACAGTGGTGACCGGTAAGGTGATCGATTCAGAA, the 
3' oligonucleotide primer sequence was 

30 CTATGCGGCCGCGAAGTTTACACGAATACCGGTAGACCAAGTGCGGCC 
and DNA was amplified with the Tth XL PCR kit. The PCR product was 
cloned into the expression plasmid pET24a using the Eco RI and Not I 
restriction sites and transformed into£,coZ/BL21DE3. Expression studies 
and immunoreactivity studies were carried out on whoie E. coli lysates. 

35 Purification was not done for these studies. 
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PG48 

The methods used for PG48 were essentially the same as for PG30 
with the following exceptions. The predicted N-tenninal signal sequence was 
removed from the recombinant protein. The 5 f oligonucleotide primer 
sequence was TGCTGAATTCCAAAAATCCAAGCAGGTACAGCGA, the 3' 
oligonucleotide primer sequence was 

CTATGCGGCCGCTCGTAACCATAGTCTTGGGTTTTG and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intoE. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



PG49 

The methods used for PG49 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GAACGGATCCAACGAGCCGGTGGAAGACAGATCC, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTAATCTCGACTTCATACTTGTACCA and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



PG50 

The methods used for PG50 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GCTGGGATCCGCGACAGACACTGAGTTCAAGTAC, the 3' 
oligonucleotide primer sequence was 

CTATGCGGCCGCGAACTTCACTACCAAGCCCATGT and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid P ET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
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studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG51 

5 The methods used for PG51 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was TCTTGAATTCGGGCAAAGTCTTTTCAGCAGCGAA, the 3' 
oligonucleotide primeT sequence was 

10 CTATGCGGCCGCACTTTTTCGTGGGATCACTCTCTT and DNA was 

amplified with the Tth XL PCR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

15 for these. studies. 

PG52 

The methods used for PG52 were essentially the same as for PG30 
with the following exceptions. The 5* oligonucleotide primer sequence was 

20 AGAAGAATTCAAACGGACAATCCTCCTGACGGCA, the 3' oligonucleotide 
primer sequence was CTATGCGGCCGCGAAGTCTTTGCCCTGATAGAAATC 
and DNA was amplified with the Tth XL PCR kit. The PCR product was 
cloned into the expression plasmid pET24a using the Eco RI and Not I 
restriction sites and transformed into E, coli BL21DE3. Expression studies 

25 and immunoreactivity studies were carried out on whole E. coli lysates. 
Purification was not done for these studies. 

PG53 

The methods used for PG53 were essentially the same as for PG30 
30 with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCTGAATTCGCGAATCCCCTTACGGGCCAATCG, the 3' 
oligonucleotide primer sequence was 

CTATGCGGGCGCGTCCGAAAGGCAGCCGTAATAGG and DNA was 
35 amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
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transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

5 PG54 

The methods used for PG54 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was CGCTGAATTCCAGATTTCGTTCGGAGGGGAACCC, the 3* 

10 oligonucleotide primer sequence was 

CTATGCGGCCGCCTGCTTCACGATCTTTTGGCTCA and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 

15 studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG55 

The methods used for PG55 were essentially the same as for PG30 
20 with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5 1 oligonucleotide primer 
sequence was CGAGGGATCCGAGCTCTCTATTTGCGATGGCGAG, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTCTTACCTGACTTCTTGTCACGAAT and DNA was 
25 amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

30 

PG56 

The methods used for PG56 were essentially the same as for PG30 
with the following exceptions. The 5' oligonucleotide primer sequence was 
AAATGGATCCCGAAAAATTTTGAGCnTTTTGATG, the 3' oligonucleotide 
35 primer sequence was CTATGCGGCCGCTTTGATTCGTAATTTTTCCGTATC 
and DNA was amplified with the Tth XL PCR kit. The PCR product was 
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cloned into the expression plasmid pET24a using the Bam HI and Not I 
restriction sites and transformed intoi?. coli BL21DE3. Expression studies 
and immunore activity studies were carried out on whole E, coli lysates. 
Purification was not done for these studies, 

5 

PG57 

The methods used for PG57 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
10 sequence was TGCTGGATCCCAAGAGATCTCAGGCATGAATGCA, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTCGGCCTCTTTATCTCTACCTTTTC and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
15 transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole £*. coli lysates. Purification was not done 
for these studies. 

PG58 

20 The methods used for PG58 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was CGGTGAATTCCAAACCCGAGGAAATACAGAAACC, the 3* 
oligonucleotide primer sequence was 

25 GAGTGCGGCCGCTGAAAGTCCAGCTAAAACCGGCGAA and DNA was 
amplified with the Tth XL PGR kit The PCR product was cloned into the 
expression plasmid pET24a using the Eco Rt and Not I restriction sites and 
transformed into E. coli BL21DE3, Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

30 for these studies. 

PG59 

The methods used for PG59 were- essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
35 removed from the recombinant protein. The 5* oligonucleotide primer 

sequence was TGCTGAATTCCAACAAGAGAAGCAGGTGTTTCAT, the 3' 
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oligonucleotide primer sequence was 

GAGTGCGGCCGCTGAAGATGCTCTTATCGTCCAAACG and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



PG60 

The methods used for PG60 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was GGCGGAATTCCAGATGCTCAATACTCCTTTCGAG, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTGAAGAGGTAGGAGATATTGCAGAT and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



PG61 

The methods used for PG61 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was AGCAGAATTCCCCGTCTCCAACAGCGAGATAGAT, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTGAAATCGATTGTCAGACTACCCAG and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3, Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 
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PG62 



10 



The methods used for PG62 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was TGCTGAATTCCAGCGGTTTCCGATGGTGCAGGGA, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTGAAGTGAAATCCGACACGCAGCTG and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG63 

The methods used for PG63 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGC AGAATTCC AAG AAGCAAACACTGCATCTGAC , the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTGAAAGTGTACGCAACACCCACGCC and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into 5. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
25 for these studies. 

PG64 

The methods used for PG64 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
30 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was TGCTGAATTCCAGAGTCGTCCTGCTCTTAGACTG, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTGAAGCGAACACCGAGACCCACAAA and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
35 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
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studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG65 

5 The methods used for PG65 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCCGGATCCATCGGACAAAGCCGCCCGGCACTT, the 3' 
oligonucleotide primer sequence was 

10 GAGTGCGGCCGCTAAAGCGGTAACCTATGCCCACGAA and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole 2?. coli lysates. Purification was not done 

15 for these studies. 

PG66 

The methods used for PG66 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
20 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was GTTTGAATTCCAAGACGTTATCAGACCATGGTCA, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTAAAATGAGTGGAGAGCGTGGCCAT and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
25 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

30 PG67 

The methods used for PG67 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GAACGAGCTCGCGGAACGTCCTATGGCCGGAGCA, the 3' 
35 oligonucleotide primer sequence was 

GAGTGCGGCCGCTATACCAAGTATTCGTGATGGGACG and DNA was 
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amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Sac I and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
5 for these studies. 

PG68 

The methods used for PG68 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
10 removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GCTTGCGGCCGCCCTTATGAAAGATTTGCAGAT, the 3' 
oligonucleotide primer sequence was 

GGTGCTCGAGTATACTCAACAAGCACCTTATGCAC and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
15 expression plasmid pET24a using the Not I and Xho I restriction sites and 

transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

20 PG69 

The methods used for PG69 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was TGCTGAATTCCAGGAAGGGGAGGGGAGTGCCCGA, the 3' 

25 oligonucleotide primer sequence was 

GAGTGCGGCCGCTGAAGCTGTAGCGGGCTTTGAACCA and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 

30 studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG70 

The methods used for PG70 were essentially the same as for PG30 
35 with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
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sequence was CGGTGGATCCTCGCAAATGCTCTTCTCAGAGAAT, the 3* 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTAAACGAAATATCGATACCAACATC and DNA was 
amplified with the Tth XL PCR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG71 

The methods used for PG71 were essentially the same as for PG30 
with the following exceptions. The predicted N- terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was TGCTGAATTCCAGAACAATACCCTCGATGTAC AC , the 3* 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTATTGCCGGTAGGATTTCCTTGTCC and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole F. coli lysates. Purification was not done 
for these studies. 

PG72 

The methods used for PG72 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was TGCTGAATTCGGAGAGCGACTGGAGACGGACAGC, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTATGATTGCCrrTTCAGAAAAGCTAT and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed TGCTGAATTCGGAGAGCGACTGGAGACGGACAGC into£. coli 
BL21DE3. Expression studies and immunoreactivity studies were carried out 
on whole E. coli lysates. Purification was not done for these studies. 
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PG73 

The methods used for PG73 were essentially the same as forPG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
5 sequence was CGGTGAATTCCAACAGACAGGACCGGCCGAACGC, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTTAAGAAAGGTATCTGATAGATCAG and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
10 transformed into E, coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



PG74 

15 The methods used for PG74 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was TGCTGAATTCCAAGAAAATAATACAGAAAAGTCA, the 3' 
oligonucleotide primer sequence was 

20 GAGTGCGGCCGCTGAGGTTTAATCCTATGCCAATACT and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E, coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E« coli lysates. Purification was not done 

25 1 for these studies. 



PG75 

The methods used for PG75 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
30 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was GGCGGGATCCGCTCAGGAGCAACTGAATGTGGTA, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTGTGGAACAAATTGGGCAATCCATC and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
35 expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
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studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG76 

5 The methods used for PG76 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5 1 oligonucleotide primer 
sequence was AGCAGAATTCGGAAACGCACAGAGCTTTTGGGAA, the 3' 
oligonucleotide primer sequence was 

10 GAGTGCGGCCGCTTACCTGCACCTTATGACTGAATAC and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

15 for these studies. 

PG77 

The methods used for PG77 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
20 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was TGCTGAATTCCAAGAGAAAAAGGATAGTCTCTCT, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTCTTCTTATCGCCATAGAATACAGG and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
25 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

30 PG78 

The methods used for PG78 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCAGAATTCCAGGATTCTTCCCACGGTAGCAAT, the 3' 
35 oligonucleotide primer sequence was 

GAGTGCGGCCGCTATCATGATAGTAAAGACTGGTTCT and DNA was 
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amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG79 

The methods used for PG79 were essentially the same as for PG30 
with the following exceptions. The 5* oligonucleotide primer sequence was 
TGCTGAATTCGTAGTGACGCTGCTCGTAATTGTC, the 3' oligonucleotide 
primer sequence was 

GAGTGCGGCCGCTGCCGTCCTGCCTTTCTGCCTGACG and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG80 

The methods used for PG80 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCCGAATTCCAAAACGTGCAGTTGCACTACGAT, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTGTTGAAAGTCCATTTGACCGCAAG and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG81 

The methods used for PG81 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was GTTTGAATTCCAGGATTTTCTCTATGAAATAGGA, the 3' 
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oligonucleotide primer sequence was 

GAGTGCGGCCGCTTTGTTTATTACAAAAAGTCTTACG and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression pi as mid pET24a using the Eco RI and Not I restriction sites and 
5 transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG82 

10 The methods used for PG82 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was G AAC G AATTC CAGAAC AAC AACTTTAC C G AGTC G, the 3' 
oligonucleotide primer sequence was 

15 GAGTGCGGCCGCTGTTCAGTTTCAGCTTTTTAAACCA and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intoi?. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

20 for these studies. 

PG84 

The methods used for PG84 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
25 removed from the recombinant protein. The 5* oligonucleotide primer 

sequence was TGCTGGATCCCAGAATGATGACATCTTCGAAGAT, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTATTGCGTCCCCGGCCACTACGTCC and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
30 expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole £*. coli lysates. Purification was not done 
for these studies. 
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PG85 

The methods used for PG85 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
5 sequence was CGGTGAATTCGTACCAACGGACAGCACGGAATCG, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTCAGATTGGTGCTATAAGAAAGGTA and DNA was 
amplified with the Tth XL PCR kit The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
10 transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



PG86 

15 The methods used for PG86 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was TGCTGGATCCCAAACGCATGATCATCTGATCGAA, the 3' 
oligonucleotide primer sequence was 

20 GAGTGCGGCCGCTGTGGTTCAGGCCGTGGGCAAATCT and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

25 for these studies. 



PG87 

The methods used for PG87 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
30 removed from the recombinant protein. The 5* oligonucleotide primer 

sequence was GGCGGAATTCCAGAGCTATGTGGACTACGTCGAT, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTATTACTGTGATTAGCGCGACGCTG and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
35 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
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studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG88 

5 The methods used for PG88 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was AGCAGAATTCGCCGAATCGAAGTCTGTCTCTTTC, the 3' 
oligonucleotide primer sequence was 

10 GAGTGCGGCCGCTCGGCAAGTAACGCTTTAGTGGGGA and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E* coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

15 for these studies. 

PG89 

The methods used for PG89 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
20 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was TGCTGAATTCCAATCGAAGTTAAAGATCAAGAGC, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTATTAGTCCAAAGACCCACGGTAAA and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
25 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivily 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

30 PG90 

The methods used for PG90 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5 f oligonucleotide primer 
sequence was GGGAGAATTCCAAACAACGACGAACAGTAGCCGG, the 3' 
35 oligonucleotide primer sequence was 

GAGTGCGGCCGCTTTTTTGTTGTGATACTGTTTGGGC and DNA was 
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amplified with the Tth XL PCR kit The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intoi?. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
5 for these studies. 

PG91 

The methods used for PG91 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
10 removed from the recombinant protein. The 5' oligonucleotide primer 
j . sequence was TGCTGAATTCCAGACGATGGGAGGAGATGATGTC, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTTTCCACGATGAGCTTCTCTACGAA and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
15 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

20 PG92 

The methods used for PG92 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCCGAATTCGCCGATGCACAAAGCTCTGTCTCT, the 3' 

25 oligonucleotide primer sequence was 

GAGTGCGGCCGCTTCGAGGACGATTGCTTAGTTCGTA and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intol?. coli BL21DE3. Expression studies and immunoreactivity 

30 studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG93 

The methods used for PG93 were essentially the same as for PG30 
35 with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
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sequence was GGCCGAGCTCCAAGAGGAAGGTATTTGGAATACC, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTGCGAATCACTGCGAAGCGAATTAG and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
5 expression plasmid pET24a using the Sac I and Not I restriction sites and 

transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

10 PG94 

The methods used for PG94 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCCGAGCTCCAAGAGGAAGGTATTTGGAATACC, the 3' 
15 oligonucleotide primer sequence was 

GAGTGCGGCCGCTITGTGCTACCACGATCATTTTCTT and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
20 studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG95 

The methods used for PG95 were essentially the same as for PG30 
25 with the following excepti ons. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5 r oligonucleotide primer 
sequence was GGCCGAGCTCTGTGGAAAAAAAGAAAAACACTCT, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTAACTGTCTCCTTGTCGCTCCCCGG and DNA was 
30 amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Sac I and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 
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PG96 

The methods used for PG96 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
5 sequence was TGCTGAGCTCCAAACGCAAATGCAAGCAGACCGA, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTTTTGAGAATTTTGATTGTCTCACG and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Sac I and Not I restriction sites and 
10 transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG97 

15 The methods used for PG97 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was GGCGGGATCCCAGTTTGTTCCGGCTCCCACCACA, the 3' 
oligonucleotide primer sequence was 

20 GAGTGCGGCCGCTCTGTTTGATGAGCTTAGTGGTATA and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

25^ for these studies. 

. PG98 

The methods used for PG98 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
30 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was AGCAGAATTCCAAGAAAGAGTCGATGAAAAAGTA, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTTAGCTGTGTAACATTAAGTTTTTTATTGAT and DNA 
was amplified with the Tth XL PCR kit The PCR product was cloned into 
35 the expression plasmid pET24a using the Eco RI and Not I restriction sites 
and transformed into E. coli BL21DE3. Expression studies and 
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imrnunoreactivity studies were carried out on whole E. coli lysates. 
Purification was not done for these studies. 

PG99 

5 The methods used for PG99 were essentially the same as for PG30 

with the following exceptions. The predicted N- terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was TGCTGAATTCAAGGACAATTCTTCTTACAAACCT, the 3' 
oligonucleotide primer sequence was 

10 GAGTGCGGCCGCTTCGAATCACGACTTTTCTCACAAA and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and imrnunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

15 for these studies. 

PG100 

The methods used for PG100 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
20 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was GGCAGAATTCCAGTCTTTGAGCACAATCAAAGTA, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTGATAGCCAGCTTGATGCTCTTAGC and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
25 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and imrnunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

30 PG101 

The methods used for PG101 were essentially the same as for PG30 
with the following exceptions. The 5* oligonucleotide primer sequence was 
TGCTGAATTCAAAGGCAAGGGCGATCTGGTCGGG, the 3' oligonucleotide 
primer sequence was 
35 GAGTGCGGCCGCITCTCTTCTCGAACTTGGCCGAGTA and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
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expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

5 

PG102 

The methods used for PG102 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
10 sequence was GGCCGAATTCCAGATGGATATTGGTGGAGACGAT, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTCTCTACAATGA'I-rrriTCCACGAA and DNA was 
amplified with the Tth XL PCR kit. The PCR product was cloned into the 
expression plasmid pET24a using the Eco KI and Not I restriction sites and 
15 transformed into J?, co/i BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG104 

20 The methods used for PG104 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GAACGGATCCAACGTGTCTGCTCAGTCACCCCGA, the 3* 
oligonucleotide primer sequence was 

25 . GAGTGCGGCCGCTTCTGAGCGATACTTTTGCACGTAT and DNA was 
amplified with the Tth XL PCR kit The PCR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

30 for these studies. 

Animal antisera and human patient sera. 

Various antisera were raised for detecting the expression and 
35 refolding of the recombinant P. gingivalis proteins. A whole cell antisera was 
raised by injecting New Zealand White rabbits with 3 doses of sonicated 



WO 99/29870 



PCT/AU98/01023 



77 

P. gingivalis (strain W50) containing approximately 2mg of protein. The first 
dose was given in Freunds complete adjuvant (FCA) and the second and third 
doses were given in Freunds incomplete adjuvant (IFA) at 3 week intervals. 
Doses (1ml) were given intramuscularly into the hind legs and rabbits bled 7 
5 days after the last dose, the blood clotted and serum removed and stored at - 
20°C until required. A second rabbit antisera was produced in a similar 
manner but using a sarkosyl insoluble fraction (each dose was 0.69mg of 
protein) derived from P. gingivalis W50 according to the method of Doidg and 
Trust T. et al 1994 as the immunogen. A third rabbit antisera was produced 

10 in a similar manner to the first only the sarkosyl soluble fraction (lmg of 
protein per dose) derived from P. gingivalis W50 cells according to the 
method of Doidg P. and Trust TJ, (1994 Infect Immun 62:4526-33) was used 
as the immunogen. 

A "protected rat serum" pool was also used in these studies and was 

15 obtained from rats immunised with formalin killed whole P. gingivalis cells 
in FIA (strain ATCC 33277; 2 doses of 2xl0 9 cells, 3 weeks apart). Rats were 
then challenged 2 weeks after their last dose with live P. gingivalis cells 
(strain 33277) given orally as previously described (Klaussen B. et al. 1991, 
Oral Microbiol Immunol 6:193-201) and the serum obtained from these rats 6 

20 weeks after the final challenge inoculation at the time of sacrifice. 

Human sera were obtained from adult patients undergoing treatment 
or assessment for periodontitis at an outpatient clinic. These patients had at 
least 6 teeth with 6mm attachment loss and had P. gingivalis present in their 
sub-gingival plaque as detected using a P. gingivalis specific DNA probe. 

25 Sera was pooled from these patients and compared to a pool of sera from 
periodontally healthy patients. 

Immunization and Murine Lesion Model Protocols 

30 The mouse abscess model was used to assess the efficacy of 

immunising mice with recombinant P. gingivalis proteins in protecting mice 
from formation of a subcutaneous abscess. This model has been used by 
others as a predictor of potential vaccines against periodontal disease (Bird 
PS, et al. 1995 J. Periodontol. 66:351-362. BALB/c mice 6-8 weeks old were 

35 immunised by subcutaneously injecting them with 0.1 ml containing either 
10 or 20ug of recombinant P. gingivalis protein, 20ug of E. coli lysate protein, 
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2 x 10 9 formalin killed cells of P. gingivalis strain 33277 emulsified in 
incomplete Freund's adjuvant (IFA; Sigma) on day 0. At day 21 mice were 
re-injected with the same dose and then bled 1 week later and evaluated for 
antibody levels. At day 35 mice all mice were challenged with 
5 approximately 2 x 10 9 cells of live P. gingivalis (ATCC 33277) by 

subcutaneous injection in the abdomen. Following challenge mice were 
monitored daily for weight loss and the size of the lesion measured for the 
next 10 days. Lesion sizes were measured by length and width and expressed 
as mm 2 . Groups were statistically analysed using a Kruskal-Wallis one-way 

10 ANOVA and were also individually examined using the unpaired t test or 
Mann- Whitney rank sum test using the Ins tat statistical package. 

Figure 1 shows the results of one experiment at day 4 after challenge 
(lesions were at maximum size at this time point). Control mice immunised 
with E. coli lysate showed large lesions while mice immunised with killed 

15 cells of P, gingivalis strain 33277 were fully protected. This indicates that 
whole cells provide protection against P. gingivalis while E. coli protein 
immunised mice were not protected. Mice given the various PG recombinant 
proteins showed significant levels of protection for PG2, PG22, PG24 and 
PG29 (p<0.05 unpaired t test) while PG8A was not quite significantly 

20 different (p=0.07) compared to the E. coli control group. 

Figure 2 shows the results of a separate experiment using 
combinations of recombinant proteins. Mice given PGl 4- PG2 showed a 
significant level of protection compared to control mice give E. coli lysate 
(p<0.026 unpaired t test), 

25 

Immunoscreening 

Cloned candidates were cultured in 15ml of Terrific broth, induced 
with IPTG and sampled at 4h post-induction. One ml of culture was 

30 removed, pelleted and the cells resuspended in a volume of PBS determined 
by dividing the OD A^^ of the culture by 8. An aliquot of lysate (lOOul) 
was added to lOOul of 2x sample reducing buffer (125mM Tris pH 6.8, 20% 
glycerol, 4% SDS, 80mM DTT, 0.03% bromophenol blue) and boiled for 
lOmin. SDS-PAGE was performed according to the method of Laemmli UK. 

35 1970 (Nature 227:680-685) using 4-20% 1.0mm Tris-Glycine gels (Novex) 

according to the manufacturers recommendations. Proteins were transferred 
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onto Hybond-C Extra nitrocellulose membranes (Amersham) by transblotting 
and the membranes were then blocked for 2h at room temperature (RT) in 5% 
skim milk in 20mM Tris, 0.5M NaCl, 0.05% Tween-20, pH 7.5 (TTBS). 
Immunoscreening was performed separately with the rabbit 
5 anti-P. gingivalis whole cell serum, the rat protective serum, a pool of human 
periodontal patients serum, and in many cases an anti-T7-Tag antibody HRP 
conjugate (Novagen). Prior to use, the rabbit, rat and human sera were 
diluted 1/5000, 1/1000 and 1/500 respectively in 5% skim milk in TTBS and 
absorbed with lOOui (for the rabbit serum) or 250ul (for the rat and human 

10 sera) E. coli extract (20mg/ml; Promega) for 6h at RT, 

Membranes were incubated overnight at RT with the absorbed 
antisera, or for 1 hr at RT with 1/5000 diluted anti-T7-Teg conjugate. 
Following 3xl0min washes with TTBS, HRP-conjugated anti-rabbit (Silenus), 
anti-mouse (Silenus) or anti-human (KPL) antibody, diluted 1/5000 in 5% 

15 skim milk in TTBS, was added for lh at RT. Membranes were washed as 
before, prior to addition of TMB membrane peroxidase substrate (KPL) for 
detection of immunoreactive proteins. Results of reactivity for the 
recombinant P. gingivalis proteins is shown in Table 7. 

In addition some of the sera (pooled sera diluted 1/1000) from the 

20 mice immunised with P. gingivalis recombinant proteins (prior to challenge) 
were analysed for their reactivity against Western blots of whole native W50 
P. gingivalis proteins using similar techniques as those outlined above. PG2, 
PG8A, PG29 and PG3 ail showed bands at a similar molecular weight to that 
of the recombinant PG protein in the native W50 blot This indicates that PG 

25 proteins are expressed in the W50 strain and that the recombinant proteins 
have at least some identical immunogenicity to the native proteins. 

m-RNA analysis 

30 Hot Phenol RNA Extraction 

P. gingivalis W50 cells (150ml culture) were grown anaerobically to 
mid log phase (OD A fl0f) =0.18) mixed with 50% glycerol and stored at -70°C 
until RNA extraction. Cells were pelleted by centrifugation at 6000g, and 
35 resuspended in 8ml ASE (20mM NaOAc, 0.5% SDS, ImM EDTA). An equal 
volume of 20mM NaOAc(pH 4.5)-saturated phenol was added and mixed by 
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shaking for 30 seconds, incubated at 65°C for 5 minutes, followed by a 
further 5 second shaking and repeated incubation. After cooling, 2ml 
chloroform was added and mixed by shaking for 5 seconds, and the mixture 
spun at lOOOOg for 10 minutes at 4°C. The top aqueous phase was transferred 
5 and re-extxacted by repeating the phenol and chloroform steps. The aqueous 
phase was transferred again and 100U RNase inhibitor (RNAsin; Promega} 
were added. RNAwas precipitated with 3 volumes 100% ethanol at -2(fC 
overnight. The RNA precipitate was recovered by centrifugation at lOOOOg at 
4°C for 15 minutes, then washed with 100% ethanol, dried and re suspended 
10 in 600jul sterile, deionised, dH 2 0 with 1^1 of fresh RNase inhibitor. RNA was 
aliquoted and stored at -70°C. The RNA concentration was determined 
spectrophotometrically. A formaldehyde RNA gel confirmed RNA integrity 
(Sambrook J. et al. 1989, Molecular Cloning. A laboratory manual. Cold 
Spring Laboratory Press, New York. 2nd Edition). 

15 

RT-PCR 

The isolated RNA was used as a template for Reverse Transcription 

20 (RT) to produce cDNA. Varying RNA concentrations were used for the RT as 
each RNA transcript was potentially present at different levels. Subsequent 
amplification of the cDNA was performed using Polymerase Chain Reaction 
(PCR). RT-PCR was performed using GeneAmp® RNA PCR Kit (Perkin 
Elmer) according to the manufacturer's protocol with the following exception 

25 to the PCR; 35 cycles were performed as follows: Melt phase 95°C for 30 
seconds, Anneal phase varied between 50-60°C for 30 seconds, Extension 
phase 72°C for 1 minute. Amplification was performed in a PTC-100 
Programable Thermal Controller (MJ Research Inc.). As a control to 
demonstrate that the amplified product did not arise from contaminating 

30 DNA, Reverse Transcriptase (RTase) was omitted from a parallel tube. The 
PCR products were examined against DNA markers (GIBCO lkB ladder) on a 
1% agarose gel stained with ethidium bromide. 

RT-PCR results are shown in Table 6 using the oligonucleotide 
primers as used in "Cloning, expression and purification of recombinant 

35 P. gingivalis genes 1 ' section described above,except for the following changes. 
For PGl the 3* reverse primer used was 
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GCGCCTCGAGATTCATTTCCTTATAGAG, for PG4 the 5' forward primer 
was CTTCTTGTCGACTACAGCGGACATCATAAAATC and the 3' reverse 
primer was TTCCACCTCGAGTTAACGCAACTCTTCTTGGAT, for PG6 the 5' 
forward primer was TAAAGAATTCTGCCTCGAACCCATAATTGCTCCG, for 
5 PG10 the 5' forward primer was CGCGCATATGGATAAAGTGAGCTATGC 
and the 3' reverse primer was 

CGCGCTCGAGTTTGTTGATACTCAATAATTC, for PG13 the 5' forward 
primer was GCCCGGCGCCATGCGGACAAAAACTATCTTTTTTGCG and the 
3' reverse primer was 

10 GCCCGGCGCCTTAGTTGTTGAATCGAATCGCTATTTGAGC. Amplification 
of P. gingivalis transcripts is a likely indication that RNA for a specific 
candidate is present and that the protein is produced. However, where there 
is no amplification achieved this does not indicate that this gene is never 
transcribed and may be the result of the culture conditions or the state of the 

15 cells when harvested. 

Table 6. Expression of PG m-RNA within vitro grown P. gingivalis W50. The 
symbols are + band visible on agarose gel, - no band present on agarose gel, 
ND not detected. 

20 



PG# 


RNA 
"g 


Annealing 
temp. °C 


RT-PCR 


PCR (-RT) 


Approx. 
fragment 
size bp 


Expected 
fragment 
size bp 


1 


0.15 


55 






1300 


1362 


2 


1.0 


50 






3200 


3051 


3 


0.15 


60 


+ 




720 


690 


4 


2.9 


55 






N.D. 


2000 


5 


0.02 


50 


+ 




1000 


947 


6 


1.0 


55 


+ 




1000 


972 


8A 


0.15 


50 






1200 


1278 


10 


0.15 


55 






590 


585 


11 


0.10 


60 


+ 




960 


942 


12 


0.02 


60 






880 


831 


13 


1.0 


50 


+ 




2150 


2274 


14 


0.15 


60 






1050 


996 
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PG# 


RNA 
US 


Annealing 
temp. °C 


RT-PCR 


PCR (-RT) 


Approx. 
fragment 
size bp 


Expected 
fragment 
size bp 


22 


1.0 


60 






N.D. 


228 


24 


1.0 


55 


+ 


+ 


1150 


1194 


29 


0.15 


60 


+ 




880 


885 



Table 7: Immunoblot results of proteins expressed in E.coli against rabbit, rat 
and human antisera. Deduced MW was calculated from amino acid sequence 
of the P. gingivalis proteins, some of which had their N-terminal signal 
sequences removed. Apparent MW was determined from SDS-PAGE gels. 
The N- and C-terminal tags add approximately 2.5 KDa to the deduced MW of 
the recombinant proteins. The symbols are + positive, - negative, +/- weak 
positive, ND not done. 



Protein 


Deduced 


Apparent 


Antisera reactivity 


number 


MW (KDa] 


MW (KDa) 
















T7 


Rabbit 


Rat 


! Human 


PGl 


47.5 


63 


ND 






1 


PG2 


112.4 


125.7 


j ND 


+ 




PG3 


! 22.6 


18.3 


ND 


a 




| 


PG4 


75 


90.6 


ND 






— — — - i 


PG5 


34.9 


43.8 


ND 






— * —t 


PG6 


36.7 


47.1 


ND 






j 


PG8 


67.5 


63.1 


ND 


b 






PG8A 


47.7 


90.6 


ND 






PG10 


21.3 


25.5 


ND 


+ r — 




PG11 , 


36.2 


42.4 


ND j 


- ! - 




PG12 


30.7 


30.6 


ND j - j - 




PG13 


84.5 


101 


ND j - j - 




PG14 


36 


42.4 


ND ! 




+ 


i 

+ i 


PG22 


8.6 


11.1 j 


.ND ! 


i 
: 




- i 


! PG24A j 

I -•; 


47 | 


63.1 | 


ND j 


j 




- J 


• PG29 ! 


31.1 | 


40.9 ! 


ND 1 


+ i 




+ I 
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Protein 
number 


Deduced 
MW(KDa) 


Apparent Antisera reactivity 
MWfKDa) 1 






I T7 


\ Rabbit 


Rat I Human 


PG30 


35.1 


| 46.9 j + ] 




j PG31 


i6.7 i - i . I 






! PG32 


41.2 ! 59.5 


. + ! + 


j + 




PG33 ! 39.9 


52.7 




1 + 




: 


PG35 | 92.6 


1 116.6 


+ 








PG36 


98.9 


120.2 




I 




PG37 


18.8 


23.1 


+ 


_L 

i i 






PG38 


16.1 


22.9 


+ 








PG39 


87.9 


116.6 






* - r 


PG40 


76.6 


103.1 


+ 






PG41 


48.3 


81.1 








PG42 


59.3 


73.9 










PG43 


27.1 


50.3 


+ 








I PG44 


28.6 


32.3 


+ 




+ 




I PG45 


84 


100.6 




- 






PG46 


— 8 1 . 


97.7 


+ 






| 


PG47 


93.7 


42.5 | + 


+ 




+ i 


PG48 


45.2 


37.9 








1 


PG49 


33.3 1 


64.1 






+ 


i 

" : 


PG50 


91.9 


113.2 


+ 


+ 




: 


PG51 


19.6 


27.2 


+ 








PG52 


50.4 1 


64.4 


+ 


+ 




+ 


PG53 


47.4 


45.4 


+ 






PG54 


101.4 


46.7 




+ 


1 


PG55 


70.4 


68.4 






t 




PG56 


142.3 












PG57 


100 


134.5 


+ 


+ 


+ 


+ 


PG58 


63 


82.9 










PG59 


33.3 


43.6 


+ 








! PG60 


55.6 


77.8 


+ 1 - 






PG61 


81.5 


107.3 


+ I - ! 
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Protein 
number 


Deduced 
MW fKDa) 


Apparent 
MW fKDa) 


Antisera reactivity 










T7 


1 Rabbit 1 Rat 


! FTlirnnri 


1 PG62 


i 


51.9 


| 58.4 




+ 




PG63 


1 


29.6 


43.6 


+ 




-* 

_ 1 


PG64 


.4. 
1 


18.5 


J 26 *9 










PG65 




25.9 


*?A ft 


i 
I 
: 










PG66 




22.2 


9^ 1 




+ 


+ 






PG67 




103.7 


IUO 


t 

i 


+ 






PG68 


j 


133.3 


i OU. / 


I 
i 




: i 

1 - i 4- 


j 


PGC9 




44,4 


^ OU.o 




+ 




PG70 


j 25.9 


on ft 
; OU.O 


i 

i 


+ 


. ! . 


: 1 


I PG71 


! 88.9 


1UO.J 


-h 






-| l_„j 


I PG72 


40.7 


A.Q ft 




+ 


1 




_| • i 

i 


1 PG73 


40.7 


zy 




+/- 


1 • — 






PG74 


22.2 












PG75 


40.7 


46.7 


+ 


f 




PG76 


48.1 


55.6 


4- 




+ 


PG77 


29.6 j 


36.9 


+ 






- t 


PG78 


j 33.3 ! 


35.4 








1 

I 


PG79 


33.3 




s 








1 


PG80 


25.9 


20.5 








PG81 


23 


25.8 






PG82 


44.8 


48.5 




— j _ U „~ j 

- i . 1 . 1 


PG84 




41.7 { 




52.4 1 




+ 






j PG85 1 




62.7 ! 


72.4 f 










i 


! PG86 


21.7 


27.4 ! 










| FG87 


83 


91.3 


+ i 






+ 


PG88 


27 


40.1 \ 








PG89 


26.2 ! 


29.4 










PG90 


23 


28.4 


+- 






j 


PG91 ! 


57.2 


85.7 ! 




+ 




+ \ 


PG92 ! 




83.6 


110.4 i 




+ 






" i 

+ ! 


PG93 1 




83.4 


110.4 1 




+ 


_ t 


+ ! 
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Protein 
number 


Deduced 
MW(KDa) 


Apparent 
MW(KDa) 


Antisera reactivity 








T7 


! Rabbit 


Rat 


Human 


PG96 


59.3 


70.3 


I + 


+ 


+ 


+ 


PG97 


44.4 


57.5 


+ 








PG98 


33.3 


36 


+ 








PG99 


40.7 


55.6 








+ 


PG100 


29.6 


10.8 


+ 








PG101 


14.8 


19.7,14.1 










PG102 


59.3 j 


70.3 


+ 




! + 


PG104 


40.7 


57.5 






! + 



a. Positive reaction detected with the rabbit antiserum to sarkosyl insoluble 
5 P. gingivalis antigen. 

b. Purified protein demonstrated weak positive reaction with the rabbit 
antiserum to whole P. gingivalis. 

It will be appreciated by persons skilled in the art that numerous 
10 variations and/or modifications may be made to the invention as shown in 
the specific embodiments without departing from the spirit or scope of the 
invention as broadly described. The present embodiments are, therefore, to 
be considered in all respects as illustrative and not restrictive. 
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CLAIMS:- 

1. An isolated antigenic Porphorymonas gingivalis polypeptide, the 
polypeptide comprising; 

an amino acid sequence selected from the group consisting of SEQ. 
5 ID. NO. 265 to SEQ. ED, NO. 528, SEQ. ID. NO. 531 and SEQ. ID. NO. 

532; or 

an amino acid sequence at least 85%, preferably at least 95%, 
identical to an amino acid sequence selected from the group 
consisting of SEQ. ID. NO. 265 to SEQ. ID. NO. 528, SEQ. ID. NO. 
10 531 and SEQ. ID. NO. 532; or 

at least 40 amino acids having a contiguous sequence of at least 40 
amino acids identical to a contiguous amino acid sequence selected 
from the group consisting of SEQ. ID. NO. 265 to SEQ. ID. NO. 528, 
SEQ. ID. NO. 531 and SEQ. ID. NO. 532. 
15 2 - A polypeptide as claimed in claim 1 in which the polypeptide 
comprises an amino acid sequence selected from the group consisting of 
SEQ. ID. NO. 265 to SEQ. ID. NO. 528, SEQ. ID. NO. 531 and SEQ. ID. NO. 
532. 

3. A polypeptide as claimed in claim 1 in which the polypeptide 
20 comprises an amino acid sequence at least 85%, preferably at least 95%, 

identical to an amino acid sequence selected from the group consisting of 
SEQ. ID. NO. 265 to SEQ. ID. NO. 528, SEQ. ID. NO. 531 and SEQ. ID. NO. 
532. 

4. A polypeptide as claimed in claim 1 in which the polypeptide 

25 comprises at least 40 amino acids having a contiguous sequence of at least 40 
amino acids identical to a contiguous amino acid sequence selected from the 
group consisting of SEQ. ID. NO. 265 to SEQ. ID. NO. 528, SEQ. ID. NO. 531 
and SEQ. ID. NO. 532. 

5. A polypeptide as claimed in claim 1 in which the polypeptide 
30 comprises: 

an amin o acid sequence selected from the group consisting of SEQ. 
ID. NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 532; or 
an amino acid sequence at least 85%, preferably at least 95%, 
identical to an amino acid sequence selected from the group 
consisting of SEQ. ID. NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 
532; or 



35 
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at least 40 amino acids having a contiguous sequence of at least 40 
amino acids identical to a contiguous amino acid sequence selected 
from the group consisting of SEQ. ID. NO. 386 to SEQ. ID. NO. 528 
and SEQ. ID. NO. 532. 
5 6. A polypeptide as claimed in claim 1 in which the polypeptide 
comprises an amino acid sequence selected from the group consisting of 
SEQ. ID. NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 532. 

7. A polypeptide as claimed in claim 1 in which the polypeptide 
comprises an amino acid sequence at least 85%, preferably at least 95%, 

10 identical to an amino acid sequence selected from the group consisting of 
SEQ. ID. NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 532. 

8. A polypeptide as claimed in claim 1 in which the polypeptide 
comprises at least 40 amino acids having a contiguous sequence of at least 40 
amino acids identical to a contiguous amino acid sequence selected from the 

15 group consisting of SEQ. ID. NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 
532. 

9. A polypeptide as claimed in claim 6 in which the polypeptide 
comprises an amino acid sequence selected from the group consisting of 
SEQ. ID. NO. 386, SEQ. ID. NO. 424, SEQ. ID. NO. 425, SEQ. ID. NO. 434 , 

20 SEQ. ID. NO. 447, SEQ. ID. NO. 458, SEQ. ID. NO. 475, SEQ. ID. NO. 498, 
SEQ. ID. NO. 499, SEQ. ID. NO. 500, SEQ. ID. NO. 501, SEQ. ID. NO. 387, 
SEQ. ED. NO. 400, SEQ. ID. NO. 411, SEQ. ID. NO. 419, SEQ. ID. NO. 420, 
SEQ. ID. NO. 427, SEQ. ID, NO. 429, SEQ. ID. NO. 433, SEQ. ID. NO. 437, 
SEQ. ID. NO. 438, SEQ. ID. NO. 443, SEQ. ID. NO. 444, SEQ. ID. NO. 448, 

25 SEQ. ID. NO. 449, SEQ. ID. NO. 452, SEQ. ID. NO. 455, SEQ. ID. NO. 457, 
SEQ. ID. NO. 459, SEQ. ID. NO. 461, SEQ. ID. NO. 462, SEQ. ID. NO. 463, 
SEQ. ID. NO. 467, SEQ. ED. NO. 468, SEQ. ID. NO. 469, SEQ. ID. NO. 482, 
SEQ. ID. NO. 484, SEQ. ED. NO. 485, SEQ. ID. NO. 494, SEQ. ID. NO. 508, 
SEQ. ID. NO. 509, SEQ. ID. NO. 510, SEQ. ID. NO. 520, SEQ, ID. NO. 521, 

30 SEQ. ID. NO. 522, SEQ. ID. NO. 525, SEQ. ID. NO. 526, SEQ. ED. NO. 528, 
SEQ. ED. NO. 389, SEQ. ID. NO. 390 and SEQ. ED. NO. 391. 

10. An isolated antigenic Porphorymonas gingivalis polypeptide, the 
polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ. ID. NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 532 

35 less the leader sequence set out in Table 3. 
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11. An isolated DNA molecule, the DNA molecule comprising a 
nucleotide sequence which encodes the polypeptide as claimed in any one of 
claims 1 to 10 or a sequence which hybridises thereto under conditions of 
high stringency. 

5 12. An isolated DNA molecule as claimed in claim 11 in which the DNA 
molecule comprises a nucleotide sequence selected from the group 
consisting of SEQ. ID. NO. 1 to SEQ. ID. NO. 264, SEQ. ID. NO. 529 and SEQ. 
ID. NO. 530. 

13. A recombinant expression vector comprising the DNA molecule as 
claimed in claim 11 or claim 12 operably linked to a transcription regulatory 
element 

14. A cell comprising the recombinant expression vector as claimed in 
claim 13. 

15. A method for producing a P. gingivalis polypeptide comprising 
culturing the cell as claimed in claim 14 under conditions that permit 
expression of the polypeptide. 

16. A composition for use in raising an immune response directed 
against P. gingivalis in a subject, the composition comprising an effective 
amount of at least one polypeptide as claimed in any one of claims 1 to 10 
and a pharmaceutically acceptable carrier. 

17. A composition as claimed in claim 16 in which the composition 
further comprises at least one DNA molecule as claimed in claim 11 or claim 
12. 

18. A composition as claimed in claim 16 or claim 17 in which the 
pharmaceutically acceptable carrier is an adjuvant 

19. A method of treating a subject for P. gingivalis infection comprising 
administering to the subject a composition as claimed in any one of claims 
16 or claim 18 such that treatment of P. gingivalis infection occurs. 

20. A method as claimed in claim 19, wherein the treatment is a 
prophylactic treatment. 

21. A method as claimed in claim 19, wherein the treatment is a 
therapeutic treatment 

22. A composition for use in raising an immune response directed 
against P. gingivalis in a subject, the composition comprising an effective 
amount of at least one DNA molecule as claimed in claim 11 or claim 12 and 
a pharmaceutically acceptable carrier. 
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23. A composition as claimed in claim 22 in which the pharmaceutical^ 
acceptable carrier is an adjuvant. 

24. A method of treating a subject for P. gingivalis infection comprising 
administering to the subject a composition as claimed in claim 22 or claim 23 

5 such that treatment of P. gingivalis infection occurs. 

25. A method as claimed in claim 24, wherein the treatment is a 
prophylactic treatment. 

26. A method as claimed in claim 24, wherein the treatment is a 
therapeutic treatment. 

10 27. An antibody raised against a polypeptide as claimed in any one of 
claims 1 to 10. 

28. An antibody as claimed in claim 27 in which the antibody is 
polyclonal. 

29. An antibody as claimed in claim 27 in which the antibody is 
15 monoclonal. 

30. A composition comprising at least one antibody as claimed in any 
one of claims 27 to 29, 

31. A composition as claimed in claim 30 in which the composition 
adapted for oral use. 

20 32. A nucleotide probe comprising at least 18 nucleotides and having a 
contiguous sequence of at least 18 nucleotides identical to a contiguous 
nucleotide sequence selected from the group consisting of SEQ. ID, NO. 1 to 
SEQ. ID. NO. 121, SEQ. ED. NO. 529 and sequences complementary thereto. 

33. A nucleotide probe as claimed in claim 32 in which the probe further 
25 comprises a detectable label. 

34. A method for detecting the presence of P. gingivalis nucleic acid in a 
sample comprising: 

(a) contacting a sample with the nucleotide probe as claimed in claim 
32 or claim 33 under conditions in which a hybrid can form between 

30 the probe and a P. gingivalis nucleic acid in the sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a 
hybrid indicates the presence of a P. gingivalis nucleic acid in the 
sample. 
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PCT/AU98/01023 



Day/Month/Year Filed 

10 Decsmb^r 1998 



Status: patented 
pending, abandoned 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are believed to 
be true; and further that these statements were made with the knowledge that willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such willful false statements may jeopardize the validity of thQ 
application or any patent issued thereon. And on behalf of the owner(s) hereof, I hereby appoint NIXON & VANDERHYE P.C., 1100 North Glebe Rd., 
8* Floor, Arlington, va 22201-4714, telephone number <703) 816-4000 (to whom all communications are to be directed), and the following 
attorneys thereof (of the same address) individually and collectively owners/owners' attorneys to prosecute this application and to transact all business 
in the Patent and Trademark Office connected therewith and with the resulting patent: . 1 also authorize Nixon & Vanderhye to delate any 

attorney names/numbers no longer with the firm and to act and rely solely on instructions directly communicated from the person, assignee, attorney, 
firm, or other organization ©ending instructions to Nixon & Vanderhye on behalf of the owner(s) 



3. 



Inventor's Signature: 
Inventor. 

Residence: (city) 
Post office Address: 



Inventor's Signature; 
Inventor: 

Residence: (city) 
Post Office Address: 



Inventor's Signature: 
Inventor. 

Residence: (city) 
Post Office Address: 
(Zip Code) 

Inventor's Signature: 
InvBntor 

Residence: (city) 
Post Office Address: 
(Zip Code) 



Michelle 
(first) 

Victoria /\U X 



Date: 



Ml 



(Zip Code) 3021 



14 Cottereil Way. Layerton, Victoria, Australia 



Patterson 
(last) 

(state/country) Australia 



Australian 
(citizenship) 



Catherine " 



Catherine 
(first) 

Victoria Aux 



250 Elgar Road, Box Hill South, Victona, Australia 



~L Agfus 

Ml Tiasty 

(state/country) Australia 



Date: 



Xo - 6*OQ 

Australian 
(citizenship) 



(Zip Code) 3128 



Date: 



6-G-Qo 



Linda 
(first) 



J 
Ml 



RotheJ 



Victoria 



Aux 



(last) 

(state/country) Australia 



Australian 
(citizenship) 



10 Kotnschild Street, Glenhuntiy, victoria, Australia 




B 
Ml 



Marfletts 

(last) 

(state/country) Australia 

92 Bent Street, Mor^ Ponds" Victor ia, Australia v - i x . . 

3039 



Australian 
(citizenship) 



Victoria 



FOR ADDITIONAL INVENTORS, check box H and attach sheet with same information and signature and data for each. 
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RULE 63 (37 C.F.R. 1 .63) 
INVENTORS DECLARATION FOR PATENT APPLICATION 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

As a below named inventor, l hereby declare thai my residence, post office address and citizenship are as stated below next to my name, and I believe I 
am it.** original, first and sole inventor (if only one name is listed below) or an original, first and Joint Inventor (If plural names are listed below) of the 
subject matter which is claimed and for which a patent la sought on the Invention entitled: 

*_ P. GINGIVALIS POLYPEPTIDES AND NUCLEOTIDES . 

the specification of which (check applicable box(s»: 

□ is attached hereto ^ /A+ . K . iOQX 

H was filed on as U.S. Application Serial No. Unaligned (AttyDkt No. 47-138) 

H was filed as PCT International application No. PCT/AU98/01023 on , 10 December 1 998 

and (if applicable to U.S, or PCT application) was amended on — .._ _____ 



l hereby state that I have reviewed and understand the contents of the above identified specification, including the claims, as amended by any 
amendment referred to above. I acknowledge the duty to disclose information which is material to the patentability of this application m accordance with 
37 C F R 1 .56. I hereby claim foreign priority benefits under 35 U.S.C. 1 1 9/365 of any foreign application (s) for patent or inventor's certificate listed 
below and have also identified below any foreign application for patent or inventor's certificate having a filing date before that of the applicabon on which 
priority is claimed or t If no priority Is claimed, before the filing date of this application: 

JSSSSXSS^ Country -^'^ 

PP0839 Australia „ 10 December 1997 



I hereby claim the benefit under 35 U.S.C. §1 19(e) of any United States provisional application's) listed below. 
Application Number Date/Mo nth/Year Filed 



l hereby claim the benefit under 35 U.S.C. 120/365 of all pnor United States and PCT international applications listed above or below and. insofar as the 
subject matter of each of the claims of this application Is not disclosed in such prior applications in the manner provided by the first j^aoraph of 35 
U.S.C. 112, l acknowledge the duty to disclose material Information as defined in 37 C.F R. 1.56 which occurred between the filing date of the prior 
applications and the national or PCT international filing date of this application: 

Prior U.S7PCT Applications): n^^K^n^ 
Application Serial No, Day/Month/Y^r Filed pending, abandoned 

PCT/AU9S/01023 1 <> December 199S 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and bel.ef are believed to 
be true- and further that these statements were made with the knowledge that willful false statements and the like so made are punishable by tine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such willful false statements may jeopardy the v altdrtyj the 
Z S or any patent issued thereon. And on behalf of the owner(s) hereof, I hereby appoint NIXON & VANDERHYE Rd *' 
8 E Floor. Arlington, VA 22201-4714, telephone number (703) 616-4000 (to whom all communications are to be directed), and the following 
attorneys thereof (of the same address) individually and collectively owner's/owners' attorneys to prosecute fhis applicat^ an^ d t<r d^siness 
in tne Patent and Trademark Office connected tnerewitn and with the resulting patent: . I also authorize Nhcon & vandemye to delete any 
attorney names/numbers no longer with the firm and to act and rely solely on instructions directly communicated from the person, assignee, attorney, 
firm, or other organization sanding instrucnons to Nixon & Va^rhyean behalf of the owner(s). , ^- 
7.^ invents Signature: ^MZT~ 6 ZSSZ 

InV8nt0r ^ (Unship) 

Residence: (city) victoria. (state/country) Australia 

Post Office Address: 



(Zip Code) 

Inventor's Signature: 
Inventor: 

Residence: (city) 
Post Office Address: 
(Zip Code) 



49 lllawarra Road. Victoria, Australia ^-v . _ 

^nr^i — 




Date: 


<E> June. -Z^tro 


BizaSeth A. 
(first) Ml 
Victoria 


Webb 
(last) 

(state/countrv} Ausualia 


Australian 
(citizenship) 


36 Zigzag Road, Victoria, Australia ^< * .v 

3422 



430944 



JL HI™ 43 Cm 



416 Rec'd PCT/PTt) 1 2 JUN 



WO 99/29870 



PCT/AU98/01023 



1 /490 



10 



15 



20 



25 



30 



35 



40 



45 



( 2} INFORMATION FOR SEQ ID NO; I 

(I) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 1362 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: circular 

Ui) MOLECULE TYPE: DNA (genomic) 

Uii) HYPOTHETICAL: HO 

tiv) AN Tl —SENSE : NO 

fvi) ORIGINAL SOURCE : 

(A) ORGANISM: PORY PHYROMON AS GINGIVALIS 

tlx} FEATURE : 

(A) NAME/ KEY: misc feature 
( B J LOCATION 1 . . . 1362 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 



TTCTGTGTCA 

AAACCGCTGC 

GACGATTTTCJ 

GGCTCAGCAC 

GGCGAAGTGA 

CCGGACGGAC 

GAACAGATCA 

TACGACATAG 

ACTGCACCAT 

ACTGCCATCG 

GGTTCATCTC 

GGTAACGTGG 

ACGCTCAAGG 

TTTACCAGAA 

ATGCCGGGTT 

GAGCGTGTGA 

CTGTCAGCCC 

TTCGGGTGGG 

TTGCAGGGGA 

ATCATCATGA 

CTCAAGGCTA 

GCTCCGGAGG 

ATCGTTCGCG 



TGGCAAAAGT 
CCGAGATGCT 
AAGGTGTTAT 
TGATGCACCA 
TCGCGCTGAA 
TGAACGAATA 
AGGAGCTTTT 
TGGCTACACC 
TGGCTCCGGA 
ATGCCTTGGC 
TGGGCTTGCA 
GCGTGCTGAT 
CTACCGACCT 
TGATTGCCAT 
GCAATGTCTT 
TCGATGGCAA 
GGTGTGACCA 
CTGCACCCCG 
AAAACAAAGA 
GCAACGAGTA 
TTATAGCATT 
ACTTT GCCAC 
AGGGCTTGGA 



TATAAAAACA 

GGCCCAACCG 

CCCCAAGGTG 

CAAGGCATAT 

TCGCGGTGCC 

CGAGTCATTC 

ACTGTCGAGC 

GGATATAGCT 

CTTCGATTTC 

CAAACTCACG 

CAATGCAGAA 

CAATCATACG 

GATCGTGATC 

GACCGGCTCA 

TGCTTCCTTC 

TGTGCTGACC 

GATCACGGTG 

TCTCGATCAG 

GTACGTACTC 

TGACCGCGTT 

CGACATCGAC 

TTGCGAATTT 

TATGCTCTAT 



AAAAAAGGCC 
GCCCAAAGTC 
ACGGCTCGTC 
CCGGAGATGA 
AAGCGCAAGG 
CCTGTCGGGG 
GGTATGTGGG 
CCACGCGACA 
ATCGTTCGAG 
ACAGGAAAGG 
ATCGTAGAAG 
AAGCCAATCA 
GGACGTTTCC 
GACGCTGCAG 
CCCGGCCGAC 
GGTAAGAAGC 
ATCCCCGAAG 
TACAGCATGA 
GATCCCCGGA 
TTCCCGATGG 
AAGATGGAGG 
GTGGATACAT 
AAGGAAATGA 



TTGCACTTAA 

CTACTTACGC 

CG3GGGATAA 

AGTTTACAAG 

TGTTGAGCAT 

ATCCGTCTGC 

GTTTl'ATTAA 

TTTATATTAC 

GAGAAGAACG 

TGTATGTGGG 

TACACGGACC 

ATCGGGGCGA 

TGCTTACGGG 

CTCACGGATA 

TGACAATAAA 

TCTGCGAGAA 

GCGACGATGT 

GCAGAGCTTA 

TCAAGGGTGG 

ACATCT AT CC 

ACTTAGGCAT 

CCAAGATCGA 

AT 



TCTGAAAGGA 

GGTCGTGCCC 

GGTGCGTGCC 

TCCGGTTAGC 

CGAGGTGAAA 

CCTCTCTG CC 

G C AACGT CCT 

TGCCAACTTT 

CGCCCTGCAG 

CCTGAAGCCG 

TCATCCGGCA 

AACGGTGTGG 

CAAAGCCGAT 

CGTCCGTATT 

GGAATCTCAC 

GGAGCCTTTC 

CGACGAACTC 

TTTCTCTTGG 

CGAACGTGCT 

GGAGTATTTG 

ATATGAAGTG 

GCTGCAGCGT 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1362 



50 



55 



60 



65 



70 



75 



12) INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A J LENGTH: 603 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(II) MOLECULE TYPE: DNA (genomic) 
liii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMON AS GINGIVALIS 
(ix) FEATURE: 

(A) HAI IE/KEY: mi sc_f eature 

IB) LOCATION 1 . . . 603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

GAACTAAGCA AGTGTTACAT GGATAAAGTG AGCTATGCTC TGGGATTGAG CAT^rrTAAT 
AATTTCAAGT CTTCGGGCAT CGACAGCGTC GTTATGGATG A TTTCATGCA S^J^I 

SSSSH HKSS SSSg BESS ™ EES 



60 
120 
180 
240 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



1LJ «»i ~L n»r jd tbi* m n Oi il,h 



30 



50 



60 



65 



(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 3 



<2l IN FORMAT T OH FOR SEQ ID NO: 4 



(i) SEQUENCE CHARACTERISTICS: 

{A> LENGTH : 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 
55 [0) TOPOLOGY: circular 



(ij) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

Ivi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOMAS GINGIVALIS 
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TTCCTCAAGA TAAATGCACA CAAGGAAGGT GTGACGACCT TACCGAGCGG CTTGCAATAC 300 

GAAGTCATTA AGATGGGAGA GGGCCCGAAA CCCACCCTTT CGGACACGGT AACCTGTCAT 360 

TATCACGGTA CGCTCATCAA CGGTATCGTT TTCGATAGCT CTATGGACAG GGGAGAACCG 4 20 

GCCAGTTTCC CTCTAAGAGG AGTTATAGCC GGCTGCACGG AGATTCTTCA ATTAATGCCT 4 80 

5 GTAGGATCCA AGTGGAAAGT AACTATACCG AGCGATCTGG CGTATGGAGA TCGTGGTGCC 540 

GGCGAACATA TCAAACCGCG TAGTACGCTC ATTTTTATAA TCGAATTATT GAGTATCAAC 600 

AAA 603 

10 (2) INFORMATION FOR SEQ ID NO:3* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 base pairs 
(DJ TYPE: nucleic acid 
15 (C) STRANDE DNESS : double 

(D) TOPOLOGY: circular 

(ii> MOLECULE TYPE: DHA (genomic) 

20 (ill) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

tvi) ORIGINAL SOURCE: 
25 (A) ORGANISM: POR Y PHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY : nviscjeatute 

( B) LOCATION 1. . .837 



CAAAAAAACA AA.CGAAAGAT GAAAAAAGCA TTACTTATTG GTGCTGCTCT TTTGGGAGCA 60 

GTCAGTTTTG CAAGTGCTCA GTCTTTGAGC ACAATCAAAG TACAGAACAA TTCAGTACAG J 20 

35 CAACCTCGTG AGGAAGCCAC TATTCAGGTT TGTGGAGAAT TGGCAGAGCA AGTTGACTGC 180 

ATTGGGACAG GTAATTCTGC AATCATAGCC GCTGCAGCGA AATTT GAAAG CGATGATCTC 24 0 

GAAAGCTATG TTGGCTGGGA GATCATGAGT GTTGATTTCT TCCCTGGATA TAAAGCGTGC 300 

AAGTACACAT CTGCAGTCTG GGCTGATGAT ATGACCATTT TGGGCCAATC AGAAGATAGT 360 

GATCCCGAAA TGCAGACTAT CAACAATCTT GCTCTCAAGA CTAGTGTCAA GATTGAAGCC 420 

40 GGCAAGAATT ACATAGTTGG TTATATTGCT AATACCGCAG GTGGACATCC TATCGGATGT 4 B0 

GATCAGGGCC CTGCCGTTGA TGGTTATGGA GATTTGGTTT CTATATCAGA AGATGGTGGT 54 0 

GCTACTTTCC CTCCGTTCGA ATCTCTTCAT CAAGCAGTTC CTACCTTAAA TTACAACATC 600 

TATGTCGTTG TTCATTTGAA GAAGGGTGAA GGTGTTGAGG CTGTTCTTAC CAACGACAAG 660 

GCTAATGCTT ATGTTCAGAA TGGCGTTATC TATGTAGCCG GAGCTAATGG TCGTCAGGTA 720 

45 TCTCTGTTCG ACATGAACGG TAAGGT TGT T TATACCGGCG TTAGCGAAAC GATTGCAGCT 7B0 

CCTCAGAAGG GCATGTATAT CCTCCGTGTA GGTGCTAAGA GCATCAAGCT GGCTATC fl.V7 



(ix) FEATURE: 

(A) WAME/*XY; miscf eature 

(B) LOCATION 1. . . 471 

70 txi) SEQUENCE DESCRIPTION: SEO ID NO: 4 

CGTGnATTCC TGCCGGAGAA AGCTCTCTAT ATCGGCTGCC GCGTGGAGAC GCAAGAGGGG 60 

CATGCCGTAG GTTTCGGACT GGATGACGGC CCT GCGATGA AAGGCAAGGG CGATCTGGTC 120 

GGGAGCTATC TTCCCGGTGC TGCTCCGATG CCTTTTGTCC CGCTTTCTGA TATTCCGGCT 180 

75 CGTTCGATGG ATGCCAATTT TTATATCTAT TCTCGTATTT CACTGGGTTC GGGCACGCAG 240 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



iiJi ""n» n,'.',b .JL 11™, iJT"h *i"3i uj jLJ* tI3 iuM il 
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GATGTTCTCC AGCATAGGAT GAAAGTGTAT CCCAACCCTG CCACGACGGA GCTGCATGTG 300 

GAAGCCCTGT CCGCATGGGT GGGCGAGCAG GCTGCGGTAT ATGATATGCG TGGTCGTCGG 360 

GTATCGGCTC GGACGGTGGA TAGCGAGAAG CTGTGCATCG ACATT GCCTC ACTGCCCGTG 4 20 

GGCGTCTATA TGCTGCGCAT CGGCAGCTAC TCGGCCAAGT TCGAGAAGAG A 4 71 



10 



15 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1686 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS: double 
(D> TOPOLOGY: circular 

{ID MOLECULE TYPE: DNA (genomic) 

(iil) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: POR Y PH YROHONAS GIWGIVALIS 

(IX) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1...16B6 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO:5 



30 



35 



40 



45 



50 



55 



60 



65 



70 
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ACAAGAAATG 
CTGCTGCTGG 
ATTGGTGGAG 
TTTTATTACA 
CGTATTTATT 
AAGTTAGGGT 
ATTGTAACGG 
TATGTAGGTG 
AATAATACAG 
ATCGCCTCCA 
GCTGTCGCCG 
GATGCTGGAG 
AATAGAGTAG 
CTAATGGGAG 
TCCAACTTTG 
GACTGTGGAT 
TTGGATGACA 
TACCCGCCCC 
ACAAAGAAAG 
GAAGGCGATC 
AAATACGATG 
GCTAAAGATA 
TGTTGGGTAT 
GTAATTATTA 
GACGTATTGC 
CTGATCAACC 
ATCGTGGAGA 
AAAGGTACGT 
GTAGAG 



TTTCACTCAT 
CGACTTTTGC 
ACGATGTATT 
AGATGGCTGT 
TTGATGATGT 
CTTTGGGGTC 
GAAAGGATGA 
GTACTATTGG 
AGTGTGTGTA 
ACTACCGTGC 
TTAGTGGCTC 
TACACTTTGA 
ACCTTTCATT 
TCGTATTCGA 
TGGACTATGA 
GGACTGATTT 
ATTCGGATAA 
ATTACGTATA 
ATCTGGTCTT 
GTTATCTGAC 
ACATTAACTC 
AAAAGAGGCG 
GGCATACTCG 
AACATTTCCT 
AGAAGGAGGG 
TACCCAAAGA 
AAGTTTCATT 
ACATGCTGAA 



AAAAAT G CCA 
CATAGTTTTA 
GATCGAGACG 
GGCAGACAAT 
CAGGCTGTAT 
TTTGGTGCCT 
AGATGATATC 
CAATGGCGTT 
CAAGAAGGAT 
GCCCTCTCCT 
CGGAAGCGAT 
GCAAAAGCGT 
AGGCAGTACA 
AATGAATAAG 
TCCCCGCTAT 
TAATCCTTTG 
TACCGTGGGT 
TCCGAAGCAA 
TAAACACTGT 
TACTTTTCAA 
TTTTTATGGT 
CCGTCCGCAA 
CAAGAGCCCA 
ATGGTCCGAT 
TAGCATGAAG 
AGGGGGGCAC 
TTCAGGGAAA 
AGTTGTAGCG 



AGAATTATGA 
TTTAGCCCTC 
ATGTCCACCC 
GGATGGATCT 
CGTT CCAAAG 
TATGACTTCG 
AATGTTTGGA 
TTGCTGATGC 
TTCCCTAATA 
TACGGTTTGG 
CACAGCTTCT 
ATTTACACAA 
TCTCCTTCTC 
AACCTTGATG 
GCGTGGTCTG 
GGAGCACTAA 
GGAGAACGCT 
TCTTTCAATT 
ATAGGTATTC 
GATCACAATC 
TGGAGTTCGC 
GTAGCACTCA 
TATGATGAAA 
ACGGAGTGGG 
CTCTACCCCA 
GAGGCAGTCG 
GAATATAAGC 
GATACGGAGT 



AATT AAAAAT 
TGGCCAAGGC 
TATCAGGATA 
ATGTGATGTT 
ACGGTGGTGC 
ATGTCTCGCA 
CAGTCATGAC 
ATCGCCATGA 
ATAGACTGAT 
GGGGCGATCC 
TGGACTATAT 
GACCCCAAAA 
TTGGATTTAA 
GCTTCGACAT 
AACCGATAAT 
GTATAGAGAT 
CCCATAACTT 
ATTCTCCCGG 
CGGCTTTGGC 
TAATGAGATA 
CATAT GTATA 
ATCCTACCAA 
CCAAACCACA 
TACATGCTCT 
ATCCTGCCAA 
TATACGACAT 
TGAATGTGCA 
ATTTCGTGGA 



TGCACTCAGA 
CCAGATGGAT 
TTCAGAGGAT 
GGATTTCTCT 
TACTTACCAA 
TTGCGATTTT 
AGCATTCGAA 
TGCAGATATC 
GGGTGTAGCC 
TTTTGCTCTC 
TTTTTCGTTA 
ACT G ACT AT C 
TACTTGGCCA 
TGGTTTCATT 
AATAGAAGAA 
CCAAATGATG 
CCTGATCACT 
ACATACACCG 
ATACGATAAG 
CAGATGGATC 
TGCAAAAGAA 
TGGAAAGGCT 
TCCTACTCCT 
GGACGTGGGG 
AGAATATGTT 
GCAGGGCCGA 
GTATCTGTCC 
AAAAATCATT 



(2) INFORMATION FOR SEQ ID NO: 6 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1173 base pairs 

( B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(il) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: UNKNOWN 

(vi) ORIGINAL SOURCE: 



60 
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240 
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360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1000 
1140 
1200 
1260 
1320 
1380 
1440 
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1560 
1620 
1680 
16B6 



suBsnnrm sheet (Rule 26) (ro/au) 
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(A) ORGANISM: PORV PI I Y ROMONAS GINGIVALIS 

(ix) FEATURE: 

(Al NAHE/KEY: misc_f eature 

(B) LOCATION 1...1173 



(Xi) SEQUENCE DESCRIPTION: SEQ ID HO: S 



CAAATGAAAC 
TCTGCTCAGT 
CGTTATCGAC 
AAAGAAGAAA 
GATTCGCTTT 
GAATTAAAAC 
CGCAAGTCGC 
AGCGATATGC 
CTTAACTATC 
GCCGAATTGA 
CTGCTTCGTG 
CGTTATACAT 
CTCACCCTTG 
CCCACGCATG 
AAGCAAACGT 
TTCTATACCG 
ATTTATCCTC 
TCGCTATTCG 
ATTATCGGAG 
AGCAAAACCA 



GATTACTCCC 
CACCCCGAAT 
TGGACAAGAT 
CGATACCCAC 
TCTATGAAGA 
AAGCGGAGAA 
CGACAGACGA 
CTTTTGACAT 
TGAATGGAAA 
TCGAAACGGG 
ACAGTGTATT 
ACGACAATAA 
CCAACAACTT 
AGGAGTTCTT 
ATTTCAATAA 
ATATGCAGGG 
GTCCTGCCAC 
ACATGAACGG 
TTGCATCTCT 
TACGTGCAAA 



CTTTCTCCTT 
CCCTCAAGTG 
CAGTGTCCCG 
TAAAATACAA 
CGACAGGTTG 
GTATGTATAC 
AACGCCAATA 
TACGACAGAG 
GATTGCCCGA 
TAGAATCGTA 
TCTTCCTCTT 
GCATAATTGT 
CGAATACGAC 
CCGTCCTCTT 
CTCCGGAGAA 
TAATGCACTG 
GGATTTTCTG 
GAAGCTCATC 
TCCGAGAGGC 
AGTATCGCTC 



TTAGCAGGAC 
GATGTACACA 
GATTCTCCTC 
ACGACCACAG 
GTTCAGGTGC 
GACGGTTCTA 
AAGAAAGTTA 
ATGAGCGATG 
AT AG AT AT CA 
TATGAGTTTG 
CAAAACAAGT 
ATTCGTTGGG 
ACCACTATCC 
CTTCCCAATT 
GGCTTGTCAG 
ACCGATGTTG 
CGTATAGAAC 
AGAGCTACCG 
ACT TACATCG 
AGA 



TCGTAGCCGT 
CTCGCATCGC 
AGATATTCGA 
GAGGTGCAAT 
GCTATTTTGA 
AGCTGGTCCT 
GCTATCACTA 
GCTATTTTGA 
TGACTCAACA 
ATGCCAATAA 
GGGTAGAAAT 
AACAAGACGA 
CTCTGTCGTC 
TTATGAAGCA 
AGGTATGCGA 
CCGTGAACGA 
GTTCGCAACT 
hATTGACAGG 
CAGAAATAAC 



AGGAAACGTG 
AAGAAATGCC 
TTACTTCTAT 
TACAAGCATC 
CAATAACCTT 
TCGAGAAAT'I* 
TCTCTGTGGC 
AAGCCATACG 
GAACCCATCG 
TGATGCTGTA 
GTTTACTCAC 
ATTCGGCACC 
TGTATTGTTC 
TATGCGTACG 
TTACAACTAC 
ATCGATCAAG 
GCTTCGCCTT 
CGATTTGGCC 
TGCTGCAAAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
' 660 
720 
780 
840 
900 
SCO 
102C 
1080 
1140 
1173 



(2) INFORMATION FOR SEQ ID NO: 7 



35 



U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



40 



tii> HOLECULE TYPE; DNA (genomic) 
(til) HYPOTHETICAL: HO 



45 



Uv) ANT I -SENSE: NO 



<Vi| ORIGINAL SOURCE: 

<A) ORGANISM: PORY PH Y ROMONAS GIHGIVALIS 



50 



(ix) FEATURE: 

(A) HAI-tF./KEY: miscf ea ture 

(B) LOCATIOJI 1...12G4 



(XI J SEQUENCE DESCRIPTION: SEQ ID NO: 7 



55 



60 



65 



70 



75 



AAGAGCJAATC 
CATTTATCTC 
AGAACAGTCC 
GGCTTCCGAC 
GAGGGATTGA 
AGCATGGCTA 
TCGCTACTGG 
GGCGGCGAGC 
TGGGTAGGAT 
TCCACCGTTC 
AAGGAGCGAC 
CAAACGCCCG 
TATTTCCGCA 
TCCGATCACG 
CCTGCGGACA 
TCCTATACCG 
AAGGGGGAGG 
GTGATTCCGG 
GCGGCACATG 
CCCGATTTCC 
ATATGCAGGC 
GGCCTGATCC 



CCTTACCTTT 
TATTTCCTTT 
GACAAACACC 
CGGACTATAC 
GCGGGTCGCT 
CGGGGCTTTA 
GCATCTTTCG 
CGGTTTGGAA 
CCGAAACGGC 
CGTTTCGTGA 
CGCGCTTGCT 
AAAGCCCGCT 
AGCGGTTGGA 
GTATGGCCAC 
GTTTCCTCTA 
AGCGAGCCTA 
TGCCCAAGCG 
ACATAGGCTC 
GCTATGACAA 
GTCCGGGCAG 
TGTTGGGTAT 
GAGACAAACG 



GACGGCATCA 
TATCTTGTTT 
TAAGCAGTCG 
CGATCGGGCA 
CCAACCATGC 
CCCCGATCAT 
TATATCCGAC 
TACGGCCGCA 
XGTGAACGGA 
CCCTGCCGAC 
CAT GTGGT AC 
GAC ACT GGCft 
CTCTCTGCCC 
GTACGAAAAT 
CATGGCCACC 
TGAGATCCTG 
TT'J'GCGCTGT 
CACCGTCi'TT 
CCAAGCACCG 
TAGGGTGGAA 
AGAGCCTGCA 
ACCA 



AATCGAAAGA 
CTGCTTCTTG 
GAACGGTACG 
CGTACACCGG 
TTCCCCTCGC 
CACGGTATCG 
CGAAAAGCCG 
CGCCAAGGCA 
AATCGGCCGT 
TCCGTCATCG 
ATCGAGGAGC 
ATGGTAGAGC 
ATAGCCGCAC 
GAGAAATGTG 
GGGGCCTTCA 
CGGGCCATTC 
GGCACCAATC 
TTCGCAATAA 
GAAATGCGGG 
AACCTGCCGA 
CCCAACGATG 



TTTTTATAAA 
CCTTCTCTTA 
TCGTAGTCCT 
CGTTGGATCG 
TTACATTTCC 
TAGCCAATGA 
TGGACACCCC 
TCCGTACCGG 
GGCGGTGGAA 
CGTGGCTCGG 
CGGATATGAT 
GGTTGGACAG 
AGACCGACTT 
TCAATCTGTC 
CCCACTTGTA 
CACATATATC 
CTCGTTTGGG 
ATGAAGACGT 
CTTTACTCCG 
ATATCACCAT 
CGGACGAAAC 



AATGAGACAG 
TG'i'CGGTTGC 
GTCTTTGGAC 
GATGGCACAG 
CAATCATTAC 
GTTTGTGGAT 
CGGATTTTGG 
TGTCTACTTT 
AAAATTCTCC 
ACTGCCCGAA 
CGGACACAGC 
TGTGCTCGGC 
CATCATAGTA 
GCATTATCTG 
CCCGAAGCCC 
GGTTTACCGC 
CGAACTGGTC 
TCGTCCGGGA 
GGCTGTCGGA 
CTATCCGCTC 
GTTGCTGAAC 



60 
120 

iao 

240 
300 
360 
420 
480 
540 
600 
660 
720 

7ac 

840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1284 
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t2) INFORMATION FOR SEQ ID NO; 8 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 6 base pairs 
fB) TYPE: nucleic acid 
(C) STRAMDEDNESS: double 
(DJ TOPOLOGY; circular 

(ii) MOLECULE TYPE: DHA (qenomic) 

( i J i ) HYPOTHETICAL : NO 

^5 |lv» ANTI-SENSE: NO 

<vi) ORIGINAL SOURCE ; 

(A) ORGANISM: PORY PHYROMONAS GINCIVALIS 
20 (ix) FEATURE: 

(A) NAME/ KEY: ml .«*c f eature 
<B) LOCATION 1 ... 846 

25 < xl > SEQUENCE DESCRIPTION: SEQ ID NO: 8 

CTTTCCCGTG GAG PTTTTCC CCTGATGTAT GGCCGAAGAG GAAGCATTCG TGCCTCTTCA 60 

GGGCATAGGG ACAAAATTTT TAAGAATACA ATTATCAGAT TTATCACAAT GAAAG^^T ill 

^JZ^l° C ^^ATGT CAATGCAGTG TATCCGGAAG TGGGTATCGC 180 

CTGCTGAAGA GTTTGGACAT AGATGTCGAC TACCCGATGG ATCAGACATG TTGCGGCCAT Vln 

30 CCTATGCCCA ATGCCGGATT CGAACAGAAA GCTCAAAAGC TGGCmSSs ^CGA^GAG loo 

CGTATGATGT AGTCGTAGGG CCATCGGCCA GTTGCG?TGC ScGT^A^ 360 
S^^TS ATCAT ATCCT CAGACCGACA GGACATGTCT GCAAGTCGGC AGOTWOT 

CGGGATATAT GCGAGTTCTT GCACGATGAC CTGAAGATCA CCAGCCTCCC CTCCCGA^ 2 An 
GCCCATAAGG TGACCCTGCA CAACAGTTGC CACGGTGTGO GCGAA^gS T^OTCcSc 
35 CCCAGTGAAG TGCACCGACC GTACCACAAC AAGGTGCGCC gGCTATTgS GAT^^rAr 

$^?^ G TATTCGAGCC GAAGCGAATA GACGAATGCT G^G^tTcGG 660 

lnn^l^ G AGCCGGAGGT ATCCACCTGT ATGGGGCATG ACAAGGTGCT GGA^ACA^A 7^0 

^n^^ G CGGAG TACAT CACAGGGCCG GACAGCT CGT GCCTCATGCA TATGCAGGGA 780 

40 S W GAGAGAAATT GCCGAT CAAG ACAATT CAT G CAGTaJa^ T^TAG^S 840 

846 



(2) INFORMATION FOR SEQ ID NO: 9 
45 !i> SEQUENCE CHARACTERISTICS: 



50 



65 



{A) LENGTH: 7 53 bare pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 
<DJ TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
55 (iv) ANTI- SENSE : NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 
60 (ix) FEATURE: 

(A) NAME/ KEY : miscj eature 

(B) LOCATION 1 . . .753 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 9 



CCTCTGAAAA AACGAATGGA TATTGTAAGT ATGGCCGATA AAGCTCTTGT APTrrnrn-rr 
AGAGATGTGA CGCTCTGTCA GGAGGAAAAC GTCATTTTTC ^^t7gZ iSS^S? 
££^25 ACTTCGTCTA TCTGATAGGC TCAGTGGGAT CGGGGAAGAG C^^TgSg 
^SIITS ATCCTGAGGT GCCTATCTCT GCCHGTTATG CCCGCGTGAT AGATTATCAT 
S^SJi^ TGAAACGGAA GCAGTTGCCC TATCTGCGCA GGAATTTGg£ «™gSotS? 

n^lll^ agttgctgaa cggacgtact gttgcggaga atttggattt cg?t??gcg! 

GCTACGGACT GGAAAAACCG AGCCGATCGC G AGCAGCG T A TCGAGGAGGT TTTGArrrr? 

gtgggaatgt ctcggaaggc ttataagaga ccgcacgaac tgt^aS gg^Sca! 

CGTGTGGGTA TAGCCAGAGC TTTGCTGGCG AAGCCTGCGT TGATCCTGGC C^A^C^ 
ACAGGCAACC TCGATTCGGT GACCGGATTG CAGATCGCTT ct£gS™ CG^Ag5 
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(2) INFORMATION FOR SEQ ID HO: 11 

^ tl) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1812 base pairs 
(BJ TYPE: nucleic acid 
(C) STRAHDEDNESS : double 
tD) TOPOLOGY: circular 

(li) MOLECULE TYPE: DKA (genomic) 

(iii) HYPOTHETICAL: WO 

55 (iv) ANT1 -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

<ix) FEATURE: 

(A) NAME/ KEY : mlsc feature 

(B) LOCATION 1, . .181? 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 



5SSE222 SSSSSS ESSE? 

70 SEE F™ — ™ — — sssEs EES 
ssssss ssks ssss s = 

^i TGGCAT CGGC ^AGGA ATTGCGTCGC GCGTTGCAGG ATTTCAAGAT GT^^Ir 
TTCGTCGTAT CCTATGCCGA CAGATACACC CAAAAGGGTT ACTACCTCTC 



PCT/AU98/01023 



GOTcS^S? ™™ ^ GCAC ° CAC **<^A GCCTOCTGTC GCATCTGCCG 660 
~S JJSSS ^ T TGCCTCCT CTTTCGTC - GCTGAGTGCA 7,0 



(2) INFORMATION FOR SEQ ID NO: 10 

(1) SEQUENCE CHARACTERISTICS: 
- n (A) LENGTH: 714 base pairs 

1U (B) TYPE: nucleic acid 

fC) STRAMDEDMESS: double 

(D) TOFOLOGY: circular 

(11) MOLECULE TYPE: DNA (genomic) 
UiiJ HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 
Ux) FEATURE: 
0 c <A) NAME /KEY : misc_f eature 

Z3 (H) LOCATION 1...714 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 10 

10 nS?nn^ TT GTCCGG ^*TG TCGCTCTTCC TTTCACCTCA TAAAAACAAG TAAAACAATG 
££^1™ GCAACCTCAC CAAGGTTTTC AGAACAGAAG AAATAGAGAC GGT^^C 
cSa rS^T^ GAATTTAT ^ CCATAATGGG GCC^cSga 
TACAAGCTCG A^Srr^I ^^ TCCTC GGCCTTCTCG ACAATCCCAC TTCCGGTATC 
ATCr GGGCAG A AGTGGGCAAC CTCCGCGAAA AAGACAGGAC TGCCGTCCGT 
35 ?£SSS£ IS 00 ™ 1 " ATTCCAGAGC TTCAACCTCA TCGAAGAGAT 

= S3ESS SSS3SSS ESSES c c ^? sdS 
SS|ES SSSSS SEES SEES EES iSS 

SS555E SSSSS aatctgttcg Z2SS3SS dF"" - 



60 
120 
180 
240 
300 
360 
420 
430 



714 
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10 



15 



20 



ACCTACAAGG 
CAAATCACCA 
CGCAAGACGG 
GCCGAGAAAG 
AAAGAACTCA 
TCTCAGGTTC 
TTTGCCGAAG 
TCCATCACAC 
GCCGTAGTAC 
AAGCAGGTAG 
GCCTCGGGCG 
CTGACCGGCT 
ATAGGAGTGA 
GCTCCGATGA 
CTCTTCCTCA 
GOTCAAGGCC 
GGAGGTTTGG 
AGCATAGAGT 
GCGGATATGA 
CGCGAACTCC 
TTCATGCCGT 



CAGCCGTAGA 
CATACATAAA 
CAATGGATTC 
CGGTGGAGAT 
AAAAGATGTC 
TGGCCAATGG 
GTGAAATAAC 
AAGAACT CGC 
TTCGTGTCAA 
CCGATCTCAA 
GATACTACAT 
CCATCGGCAT 
ATATGGACGT 
CGGTCGAAGA 
CTCGCGTATC 
GTGTATGGCT 
ACACAGCTAT 
ATGGCAAGAC 
AGTCTGCCAT 
GCTCCATGCC 
AC 



GCCATT CAT G 
CGGGCTTTGG 
CGT GAAAATG 
GAAGCTCGTG 
CCAACGCGGA 
CCCGATGAAC 
GGAAGAAATA 
CAAAGAAATC 
TTCTCCGGGA 
GGCCAAAAAG 
AGCCTGCGCA 
ATTCGGCATG 
CGTACAGACA 
TCGTGCCCTC 
GGAAGGCCGC 
CGGCGACAAA 
CAAACGGGCC 
CAAGCGCAAC 
CCTGAGTACC 
GCCCCGTCCT 



CTCAACAGGA 
GACAAGATCA 
TTTGCCGACA 
GATGAGCTGG 
GAGAAAGATG 
AAAACGAAAG 
ATAAAGAAGC 
AAGGCAGCAG 
GGTAGTGCTT 
CCTATCGTGG 
GCCAACAGTA 
TTCCCGAACT 
TCCAAGTATG 
ATCCAACGCT 
AACCGCACCA 
GCTCTTGCAC 
GCGAAGCTGG 
TTCTTCGAAG 
ATTCTCTCCG 
TCCGGCATAC 



TGAGCGATGC 
CATCCGATAT 
AAGGCGAAAT 
CTTACCGTAC 
AACTTCGGTT 
GCAGTCGGAT 
CGTTCGACAC 
CCGATGACGA 
TCACTTCCGA 
TCTCCATGGG 
TCGTGGCAGA 
TCGCGGGCGT 
CAGACTTGGG 
ACATAGAGCA 
AGGCACAGAT 
TCGGTTTGGT 
CTCAGCTCGG 
AGTTGCTCTC 
ATCCGGAAAT 
AGGCACGTCT 



CAATCGCGAA 
TGCAGAGTCG 
GTTCGGTCTT 
CGATGTGGAG 
CGTATCGCTT 
CGCCGTTCTC 
TGACGGTAGC 
TGATATCAAA 
ACAGATATGG 
CGACGTAGCA 
GCATACGACT 
AGCCAAGAAG 
CAACACCTTC 
GGGCTACGAC 
CGACAGCATC 
GGATGAGCTT 
TGGCAACTAC 
CTCATCAGCA 
AGAAGTTCTG 
CCCCTATTAC 



660 
720 
780 
840 
900 
960 
1020 
1000 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1012 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID HO: 12 

(J.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 972 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: circular 

Ui) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv> ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY: nusc_f eature 
(3) LOCATION 1...97 2 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO:12 



CTAACGTTGT 
TCCGTTTCGG 
ACATACGAGG 
AATATACCGG 
AC ACT GG CCA 
AAGCGCACCT 
CGCGAATCGT 
AAACTCGACA 
ACCAATCGGG 
TTGGATCGCG 
TCCCATCAAA 
GGACTGCTCT 
GACATGAGAG 
AAGGGCGATG 
CACGTAGTGC 
AAGAACCTCT 
CTGCGCTTGC 



TTTGTTGCAA 
TTCTCTTTTT 
CT TATGTGAA 
CAAGC AT CAC 
GCGTACACAA 
ATCGTACCGA 
ATGAAGATCA 
GAGAAGACTA 
GCTATGCCAA 
AGAAGTACCC 
CGACCAAGCA 
ACATCATAGC 
CCTCCAAACT 
TGAT CTATCT 
GTGTGGGCGA 
ACAAGCTCAA 
GC 



CTATTTCAAA 
CTTCGGGACA 
ACAGTACGCC 
CAT AG CAC AG 
CAATCACTTC 
CGATGCGCCG 
TTCCCGATTT 
TCGGGGCTGG 
TCTGCTGATC 
CTCATGGTTC 
GAAGCAGAGC 
CAAGCAAGGC 
GGCCAAATAC 
GGAGAAAAAG 
TTCGATGCAC 
CGACAAGGAT 



CAGATGAGAG 
GCGATCGGAC 
GACGAAGCTA 
GCTTTGGTGG 
GGGATCAAAT 
AACGAATG CT 
CTGCTCCAAC 
GCTACGGGGT 
AAGATGGTGG 
CACAAGTCTT 
GG ACT CAAGC 
GATACCTTCG 
AACGATGCTC 
CACGCAT GCT 
AGTATCTCCC 
GGCGAATATA 



CAAACATTTG 
AGGCTCAGAG 
TCCGACAGAT 
AGACAGGAGC 
GCCACAAATC 
TCCGCAGCTA 
CAC GCT AT CG 
TGCAACGCTG 
AGCTGTATGA 
ACCCCGGGTC 
ACGAAGCTTA 
ATTCTTTGGC 
CCGTGGATTT 
CCATCTCCAA 
AACGCTATGG 
TACCCCAAGA 



GCAGAT ACT T 
TCGAAACCGT 
GAGCCG CTAC 
CGGAGCCAGT 
GTGGACGGGC 
TTCGGCCGCT 
TCCCCTGTTC 
TGGCTATGCC 
GCTATATGCT 
CAACAAAAAA 
CTTCAGCTAC 
CG AAGAGTTC 
CCCGATCGAA 
ACACACACAG 
CATCCGGATG 
GGGCGATATA 



(2) INFORMATION FOR SEQ ID HO: 13 

U) SEQUENCE CHARACTERISTICS: 

(A| LENGTH: 1599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
[D\ TOPOLOGY: circular 

fii) MOLECULE TYPE: DMA (genomic) 



60 
120 
180 
240 
300 
360 
420 
460 
54 0 
600 
660 
720 
700 
840 
900 
960 
972 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

5 tvil ORIGINAL SOURCE; 

(A) ORGANISM : PORYPIIVROMOWAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eaturc 
10 (B ) LOCATION 1...1S99 

txl) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

AGGATCCCCG ACGAGCAGAC CCGACGTATC ATGGACGGAC GTCGATATTC GGATGGCCTC 60 

15 CAT CAGGCTA TCGAAGCCAA AGAGCATGTG AAAGTAGAGG CTGCGACACA GACATTTGCA 120 

ACT AT CACTT TGCAGAACTA TTTCCGCATG TAT CAT AAGC TGGCAGGGAT GACCGGTACT 180 

GCTGAAACTG AAGCGGGAGA GCTTTGGGAC ATCTACAAAC TGGACGTTGT AGTTATTCCG 24 0 

ACAAACAAGC CTATCGCCCG TAAGGATATG AATGATCGTA TCTATAAGAC GGCACGTGAA 300 

AAATATGCAG CAGTTATCGA AGAGATTGTA CGTCTTGTCG AAGAGGGCAG ACCTGTACTT 360 

20 GTCGGTACTA CTTCGGTGGA AATATCCGAA TTGTTGAGCC GTATGTTACG CTTGCGTGGC 420 

ATCCAACACA ATGTACTCAA TGCCAAATTG CATCAGAAGG AGGCCGAGAT TGTAGCTCAG 4 80 

GCCGGTCAGA AAGGAACTGT TACCATCGCA ACGAACATGG CCGGTCGTGG TACCGACATC 54 0 

AAGCTCTCl'G CCGAGGTTAA GAAAGCCGGG GGTTTGGCTA TCATTGGTAC GGAAAGGCAC 600 

GAATCCAGAC GAGTGGACAG ACAGCTTCGT GGTCGTTCCG GCCGTCAGGG TGATCCCGGT 660 

25 TCGTCCATAT TCTATGTTTC CCTT GAAGAT CAT CTGATGC GCCTCTTTGC CACAGAAAAG 720 

ATTGCATCAT TGATLGATCG TTTAGGTTTC AAGGAAGGAG AAGTGCTCGA AAACAACATG 780 

CTCAGTAAGT CCGTGGAGCG TGCTCAAAAG AAGGTGGAAG AGAACAACTT CGGTATCCGT 8d 0 

AAACATCTGC TTGAGTACGA TGATGTAATG AATTCGCAGC GTGAAGTCAT TTATACCCGT 900 

CGCCGTCATG CTTTGATGGG AGAGCGTATC GGTATGGATG TACTCAATAC CATATACGAC 960 

30 GTATGTAAGG CTCTGATTGA CAATTATGCA GAAGCCAATG ATTTCGAAGG CTTCAAGGAA 1020 

GATCTGATGC GTGCACTCGC GATAGAATCT CCTATCACGC AAGAAATATT CAGAGGTAAG 1080 

AAAGCAGAAG AGCTGACCGA TATGCTTTTC GATGAAGCTT ACAAGTCTTT CCAACGTAAG 114 0 

ATGGATCTGA TCGCAGAAGT GGCCCACCCT GTGCTTCATC AGGTATTCGA GACCCAAGCC 1200 

GCCGTGTACG AGCGCATTCT AATCCCCATT ACGGATGCTA AACGTGTCTA TAACATAGGA 1260 

35 TGCAATTTGC GTGAAGCGGA TGAAACTCAA GGGAAAAGCA TCATCAAAGA ATTTGAGAAA 1320 

GCTAT CGTAC TGCATACTAT CGATGAGTCT TGGAAAGAAC ATCTGCGTGA GATGGACGAG 1380 

CTTCGTAATT CCGTTCAGAA TGCCAGCT AC GAAAACAAAG ATCCACTACT TATCTATAAA 1440 

CTCGAATCTT ACGAACT GTT CCGCAAGATG GTAGAAGCCA TGAACCGTAA GACCGTAGCG 1S0O 

ATCCTAATGC GTGCTCGGAT ACCGGTACCG GAGGCTCCTT CCCAAGAAGA GCTGGAACAC 1560 

40 AGGCGGCAAA TAGAAATCCG AC AT GCAACC CAACAACGT 1599 



(2| INFORMATION FOR SEQ ID NO: 14 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 base pairs 
{ B} TYPE: nucleic acid 
tC> STRANDEDNF.SS : double 
(D> TOPOLOGY: circular 

50 

{ ti 1 MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

55 (iv) ANTI-SENSC: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

60 FEATURE: 

(A) NAME/ KEY: mlsc_feature 

(B) LOCATION 1...2160 



65 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 



AAATCTTGCC GTGTTATTGG CCAGACGAGG CGATACGGAT GCTGCCCTGT CGGACTACGA 60 

CCGTGCCATC AAAGCCTATC CGGAGTTTGC CGATGCCTAT TTCAATCGCG GCCTGCTGTT 120 

GCTTTCGCGC GGAAAGGCCA AAGAAGGCAT CGCCGATCTG AGTCGGGCAG GCGAATACGG 180 

GCTCTACAAG GCGTACAACA TCATCAAACG AATGAGCACG AAGTCATGAT CTCCGTCAAT 24 0 

70 AACCTGACTG TCGATTTCGG CACCCGTCTG CTCTTCGATC AGGTATCATT CGTCATCAAC 300 

AGGCGCGACC GTATCGCTCT TGTAGGGAAG AACGGTGCCG GCAAGAGTAC GCTGCTCAAG 360 

CTGATTGCCG GCATGGAAGA ACCGACATCC GGACACATAG CACGCCCCAA GGGGATCCGC 420 

ATAGGCTATC TGCCGCAGGT GATGCGTTTG CAGGACGGAC ACACGGTTTA CGAAGAGGTC 480 

GAGCAGGCTT TCAACGATAT TCGCCAAATA GAGGAAGAGA TACGGCGTCT GTCCGATGAA 64 0 

75 ATGGCCGGAC GTACGGACTA CGAATCGGAT GACTATATCC GACTGATAGA GCATTATACG 600 
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15 



20 



25 



AATATGAGCG 
ACATTGATCG 
GGAGGATGGC 
CTGCTCGACG 
ATCGCCACCA 
ACCACGACAC 
AGCCACTATG 
CAGCAGAAGA 
AAGTCCGTAC 
GACGAGCGGG 
TATCCGCTAA 
GCTACATACA 
AAAAGTACCA 
CTGGGGCACA 
GATCTCACGG 
AACGATTTGC 
CTGAGTGGAG 
TTCCTTATTC 
GAGGCGATCA 
GATGGGCTTG 
GGTATATACG 
ACTACGAT'JG 
TACCGTCGGC 
TGCGAGGAGC 
GATCCGAAAC 
CTCGAAAAGG 



AGACCCTCTC 
GTCTGGGCTT 
GTATGCGGAT 
AGCCGACCAA 
ATG CAGGAGC 
GCACTATCGA 
TGGAGCTGCG 
TGATCCGCGA 
AGGTACAGAG 
ATCGTTCGGC 
TAGTGGATGA 
CCATCGAAAG 
TGGTCAAGTG 
ACGTGCAGCT 
TATTCGACAC 
TCGGGGCTTT 
GAGAACGAGC 
TCGATGAGCC 
AGAACTTCGA 
TCAGCAAGGT 
ACTATCTCCG 
AAACAAAAAC 
AAAAGGAGGT 
GGATCGGAAA 
AT GCG ACT GA 
CCATGGAGGA 



TCTCATGCAG 
CGGCCGAGAG 
AGAGCTGGCC 
TCACCTCGAC 
CGTTATCCTG 
AATAGAACTG 
CGAAGAGCGG 
TACGGAGGAC 
CCGGATCAAA 
ATTTCACTTC 
TTTGGCCAAG 
AGGCGAAAAG 
TATCATGGGA 
GGGCTACTTT 
GATAGACCGT 
TCTCTTCGGG 
ACGATTGGCT 
GACCAATCAC 
TGGGACTGTC 
GTATGAATTT 
GACCCGCCGT 
CACACGGGAG 
AGCCAAACAG 
ATTGGAGTCG 
CGCGAATCTG 
CTGGGAACAG 



CAGGGCAACT 
GACTTCCACC 
AAACTTCTGC 
ATCGAATCCA 
GTGTCGCACG 
GGACATATAT 
CTGCGACAGC 
TTCATCGAAC 
CAGTTGGAGA 
CGCTTTATCC 
GCTTATGGCG 
GTGGCTTTCG 
GAGCTGACAG 
GCCCAAAACG 
GAGGCCGTGG 
GGCGAAGCAT 
ATT AT CAGGC 
CTCGATATGC 
ATCGTAGTAT 
GCAGATGGAC 
ATGCAGACGC 
GCTATACCTG 
CTGCGCACGT 
GAATTACAGG 
TTCGAGCGAT 
GCTTCCGAAG 



ATCATGCTGC 

GCCCCACAGC 

TCCAACGCCC 

TCGGCTGGCT 

ACAGGGCATT 

ACGACTACAA 

AGATGCGTGC 

GATTCAGATA 

AAGTAGAGCG 

CGGCACAGCC 

ATCACCAGGT 

TAGGCAAAAA 

ACTACACCGG 

AAGCCCAAGA 

GCGACATCCG 

OGGAAAAGAA 

TTTTGCTACA 

GCTCGAAGGA 

CTCACGACCG 

AGGTGAACGA 

TGACAGAGCT 

AAACGGAAGC 

TGGAGCGAAC 

CAATAGAGAT 

ACGCCGGCAT 

CTTTATCCGA 



GATCGAACAG 

CGATTTCAGC 

CGAAGTTTTG 

GGAGCAGTTC 

CATCGACAAT 

GACCAACTAC 

CTACGAGAAT 

CAAGGCCACG 

CGTGGAGATA 

TTCCGGCAGT 

GTTTTCCGGA 

CGGTGCCGGC 

CAAGCTCGAA 

GCTAAGAGGG 

TCTGCGCCTG 

AGTAAGTGTC 

GCCGGCTAAC 

TGTACTGAAA 

TGAGTTCCTC 

ACACCTCGGA 

GGAGCGAACC 

CAAAGCGGAC 

CGTAGCAACC 

GCTACTGCAA 

GAAACAAGAA 

AGCCCAAGGA 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16BO 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 



30 



35 



40 



45 



50 



55 



60 



65 



70 



(2) INFORMATION FOR SEQ ID NO: 15 

(1) SEQUENCE CHARACTERISTICS : 

(A) LCNGTH : 1158 base pairc 
<B] TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D> TOPOLOGY: circular 

ill) MOLECULE TYPE: DMA (genomic) 

<iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

ivi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PH YROMONAS GINGIVALIS 

( ix 1 FEATURE : 

(A) NAME/KEY: mlscjeature 
(B> LOCATION 1 1158 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: IS 



TATTTGAAGC 
AAAATATTCT 
CGCGCCGTGA 
AGCGTTTGTA 
CAT CTCGG AA 
TATCGAAGGA 
ATTTTTATCA 
TGCGAACATA 
TATTCAGGGC 
AT CTT ACGTG 
CAAAGAGATG 
GGTTTTGAAT 
CTTAT CAT AG 
AATATGGAGT 
TCATCTGCTA 
GCAGAAG CAT 
ATCGAAAACG 
AAGTTACGCT 
TTGATGGTCA 
GAAATCGGCT 



TATTAATACT 
CTATGATTTA 
TCAACTTGGC 
CAAAAGAAGG 
TTAGGCTTTA 
TAAAAGCCCT 
ATACAATTTT 
TCTCTTATGA 
'1TGATGCCGT 
GACGCTCTAA 
CTACTACTCA 
TCATGATAGA 
TCGGAGATGG 
CGCAAATAGA 
TTTAT CTAAT 
ATGCACTACC 
GTCGCAATGG 
T ATT GAT GG A 
AATCCTACTC 
ACATGAAT 



ACAAATAGCC 
TCTGCTGTTA 
TAACAACCTG 
AGAGCCTTCC 
TGGCAATGCA 
ATACAAGAAG 
GTCTCAGATC 
TATTGCCCGC 
TGTAGCACTG 
AGCATATGTC 
CAAACAAATG 
AGATGCATCA 
CGAAAATGAA 
AATACATCCA 
GACGTCCCGT 
TATAATCTCA 
TTTCCTTGTG 
TGATGAAACT 
TCCGGCAAAT 



TTGATGAATT 
GATACAATAA 
CATGCCAATG 
TTCCAAGTAG 
TTAGCCCGCA 
CGTGAACCGG 
AGTAACAGAG 
CCTATTACAA 
ACAAAAAGAG 
ATACCCAATC 
TTGGCGATTG 
CGAGTGCTGC 
TCGATGCTAC 
TCTACACCGG 
TTCGAAGGAC 
TACGATTGTC 
CCAATGGAAG 
CTTCGTAAGA 
ATCTATGAAT 



TCTTAAAAAA 
CAAACCGTGC 
GTCATCGCGT 
AAAAAGGAAT 
AAACAGTATA 
TTTTATTGAT 
GCAGAATATT 
AACGCATAAG 
ATCAGCAATC 
AAGT TTCATT 
GCAGGCTTAC 
GAGAAAGGCC 
GTAAAGAAAT 
AAATTCGCAA 
TACCAATGGT 
CGACCGGCCC 
CACATGAAGA 
AAATGGGACA 
GTTGGAAGAA 



AGAACCGTTT 
CGGTACAGAA 
ATCATTAGTC 
AGAAGTACAC 
TTTCAAGGCT 
AGGGACTAAT 
TACGATCGGA 
GGGGTTTCTG 
GTTCGAGGCA 
TACTACAGTC 
CTACCAGAAG 
TGATTGGAAG 
TGCATCTCGC 
ATACTACGAA 
ACTTCTCGAA 
GAGGGAACTG 
C1TCGCGGAT 
AGAATCAGAG 
ACT AT TCGTC 



60, 
120 * 
180 
240 
300 
360 
420 
480 
540 
600 
660 
7 20 
7 80 
840 
900 
960 

1020 

1080 

1140 

1158 



75 



(2) INFORMATION FOR SEQ ID NO: 16 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1965 base pairs 
<B) TYPE; nucleic acid 
<C> STRAW DEDNESS; double 
(D) TOPOLOGY: circular 

tii) MOLECULE TYPE; DMA (genomic) 

Uii) HYPOTHETICAL: NO 

Uv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 
(xx) FEATURE: 

(A) NAME/ KEY : misc feature 

(B) LOCATION 1...196S 

(xij SEQUENCE DESCRIPTION : SEQ ID NO;16 



ACAACGAAAG 
GGAATTGACT 
GTTATTACAA 
GGCGAGCGTA 
ATATACTCTA 
AGAGTGCCAT 
CGTCTCTATA 
GAAGACTACC 
GACGCTCAAC 
AT T GTGAACG 
ATGAAGAT CG 
GGCGACGGCG 
TTCGACCACG 
CTTCGCCAGG 
GAGCTCTCCA 
ATCCCCAAGC 
CTGATTCAGG 
GGCGATATCG 
ATTGTGGAGA 
GCTGTGGGTG 
TTGGACGTTA 
ATCGATGCCA 
AATCAACCTT 
AAGAGCATCG 
ATCGAAGTAA 
GCTACCGGCA 
ATCAAGCGCA 
CGTATCGACA 
GAGTTGGGAG 
CTGAAAGAAG 
CAAACCGCTC 
GGCGCACAAC 
TCTGACGACA 



AAAACGAGAC 
TAGGCACAAC 
ACAGTGAGGG 
AGGTGGGCGA 
TCAAACGCTT 
TCAAGGTAGT 
CGCCGCAGGA 
TCGGTCAGGA 
GTCAGGCAAC 
AGCCTACGGC 
CTGTCTTCGA 
TTTTCGAAGT 
TGATCATTGA 
ATCCTATGGC 
GCACTTCATC 
ACTTGGTGAT 
CATGTGTGGC 
ATGAAGT GAT 
AGATCTTCGG 
CCGCTATTCA 
CCCCCTTGTC 
AT AC CACT AT 
CGGTAGAGAT 
GCCGTTTCAA 
CGTTTGACAT 
AGAAGCAGAA 
TGAAGGAAGA 
AGATCAATCA 
ACAAATTCCC 
CACACAAAGC 
TTTCCGCAGC 
CCGGTCCGGA 
AGAACGTCAC 



AACAACAAAA 
GAACTCTTGT 
CAAGCGCACA 
TCCGGCCAAG 
CATGGGCGAA 
ACGTGGGGAC 
AATTTCGGCC 
AGTAACGGAG 
GAAAGAAGCA 
AGCTTCTCTG 
CTTGGGTGGC 
GAAATCGACC 
CTGGCTGGCA 
TAT GCAGCGT 
TACGGAGATC 
GACGCTTACA 
ACCCTGCGAA 
TCTCGTAGGT 
TAAGGCTCCG 
AGGCGGTGTT 
GCTCGGTATC 
CCCGACGAAG 
TCATGTACTT 
CTTGGACGGT 
CGATGCCAAC 
TATCCGCATC 
GGCGCAGGCC 
GGCCGACAGC 
GGCCGACAAG 
ACAGGATGTA 
GGGCGAAGAG 
CTTCGGCGGT 
AGACGTAGAC 



AACGAATATA 
GTCTCTGTAT 
ACGCCCTCGG 
CGTCAGGCCA 
ACTTACGATC 
AATAATACTC 
ATGATCCTTC 
GCCGTGATCA 
GGAGAGATCG 
GCCTACGGTC 
GGTACCTTCG 
AACGGT GAT A 
GAAGAGTTCA 
CTGAAAGAAG 
AACCTCCCCT 
AGGGCTAAGT 
ACGGCCTTGA 
GGTTCCACAC 
TCCAAGGGTG 
CTGACCGGTG 
GAGACTATGG 
AAGAGCGAAA 
CAGGGTGAGC 
ATTGCTCCGG 
GGTATCCTGA 
GAAGCCTCCA 
AATGCCGAAG 
ATGATCTTCC 
AAGGCTCCGA 
GCTGCTATCG 
CTTTACAAGA 
GCTCAAGGTC 
TTCGAGGAAG 



GAATTATGGG 
TGGAAGGTAA 
TAGTGGCTTT 
TCACCAATCC 
AGGfTTCCAG 
CGCGCGTAGA 
AGAAGATGAA 
CTGTGCCCGC 
CCGGCCT G AA 
TGGACAAGTC 
ATATCTCTAT 
CGCACCTCGG 
AGTCTCAGGA 
CTGCCGAAAA 
ATATCATGCC 
TCGAGCAGTT 
AAGATGCCGG 
GTATTCCTGC 
TGAATCCCGA 
AGGTAAAGGA 
GAGGCGTGAT 
TCTTTACCAC 
GTTCTTTGGC 
CGCCCCGTCA 
ATGTAACGGC 
GCGGTTTGTC 
CAGATAAGAA 
AGACGGAAAA 
TCGATACCGC 
ATACAGCCAT 
ATGCCGGAGC 
CCTCTGCCGG 
TGAAG 



AAAAATCATT 
CGAACCTATC 
TGTGGATGGT 
GACCAAGACG 
AGAAGTGGAG 
TATAGACGGT 
GAAGACGGCC 
ATACTTCAAC 
AGTTCGCCGT 
CAATAAGGAT 
CTTGGAATTG 
AGGAGACGAC 
AGGTGTGGAT 
AGCCAAGATA 
GGTGAACGGC 
GGCCGATCGT 
TATGTCACGT 
TATTCAGGAG 
CGAAGTGGTA 
TGTCTTGCTG 
GACTCGCTTG 
AGCAGTGGAC 
TAAGGACAAT 
GACACCGCAG 
TCATGACAAA 
CGATGATGAG 
AGAGAAAGAA 
GCAGTTGAAG 
TCT CGACAAA 
GGCCGAACTG 
AGCCCAAGGT 
TGATCAGCCC 



(2) INFORMATION FOR SEQ ID NO: 17 



U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1401 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 
Uv) ANTI-SENSE: HO 

( -l > ORIGINAL SOURCE : 

(A) ORGANISM: PORY PHYROHONAS GINGIVAL IS 
Ujc) FEATURE: 

(A) NA1 IE/KEY : micc^feature 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1000 
1140 
1200 
1260 
1320 
1300 
1440 
1500 
1560 
1620 
J 630 
1740 
1800 
1860 
1920 
1965 
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<B) LOCATION 1...14 01 

Ixl) SEQUENCE DESCRIPTION: SEQ ID NO; 1*7 

5 AAGTGGGCAC GTACAACTAC TTTAAGGATA TCGAACGCAA CAATCTATAA AACTATGCGC 60 

TACGACTTAG CTATCATCGG TGGAGGGCCG GCCGGTTATA CGGCTGCCGA ACGTGCTGCC 12 0 

AAAGGTGGCC TGAAAACCCT CCTAATTGAG AAGAATGCTC TCGGTGGTGT ATGCCTCAAC 180 

GAAGGATGTA TACCGACCAA GACGCTACTC TACTCGGCCA AAGTGCTACA TCAAATTGCT 240 

ACGGCATCTA AATATGCAGT AAGTGGAACG GCCGATGGAC TTGACCTCGG CAAGGTGATT 300 

10 GCCAGAAAAG GTAAAATCAT TCGCAAGCTG ACTGCAGGCA TCCGTTCACG CCTGACAGAG 360 

GCCGGAGTAG AGATGGTGAC GGCAGAAGCT ACCGTAACGG GATGCGATGC AGACGGCATC 420 

ATCGGCATTA CTGCGGGCGA AGCACAGTAC AAAGCTGCCA ACCTGCTACT ATCTACCGGT 480 

TCGGAGACGT TTATTCCACC CATCCCCGGA GTGGAGCAGA CAGAGTATTG GACAAACCGT 540 

GAAGCTCTAC AGAACAAAGA GATTCCGACC TCTCTCGTCA TCATCGGTGG TGGAGTGATC 600 

15 GGAATGGAGT TCGCTTCTTT CTTCAACGGT ATCGGTACGC AAGTGCACGT GGTGGAGATG 6 60 

CTGCCGGAAA TACTCAACGG TATCGATCCC GAACATGCAG CTATGCTACG CGCTCACTAT 720 

GAAAAAGAAG GAATCAAATT CTACCTCGGG CACAAAGTAA CATCGGTTCG CAACGGAGCT 730 

GTTACGGTAG AATACGAAGG AGAAAGCAAA GAGATCGAAG GAGAACGTAT CCTGATGAGT 84 0 

GTGGGACGTC GCCCCGTGCT GCAAGGATTC GAGTCGCTCG GATTGGTGCT TGCCGGCAAA 900 

20 GGTGTAAAGA CTAATGAGAG GATGCAAACT TCCCTGCCCA ATGTCTATGC TGCAGGTGAT 960 

ATTACAGGCT TCTCGCTTTT GGCACATACG GCTGTACGGG AAGCAGAGGT AGCAGTAGAT 1020 

CACATTTTGG GCAAAACAGA CGAAACGATG AGCTACCGTG CCGTACCAGG TGTGGTGTAC 1080 

ACCAATCCCG AGGTCGCCGG TGTGGGAGAG ACGGAAGAAT CGCTTCGCAA AGCAGGACGT 1140 

GCCTACACTG TTCGTCGCCT TCCTATGGCC TTCTCCGGTC GATTTGTAGC AGAAAACGAA 1200 

25 CAAGGCAATG GAGAGTGCAA ACTACTACTT GATGAAGAGA ACCGCTTGAT CGGAGCACAC 1260 

CTCATTGGCA ATCCGGCCGG CGAACTCATC GTAACCGCTG CCATGGCCAT CGAGACCGGC 1320 

ATGACGGATC GACAAATCGA ACGAATCATA TTCCCTCATC CGACTGTAGG CGAAATCCTA 1380 

AAAGAAACTC TCGCCGGAGG T 1401 



30 



40 



(2) INFORMATION FOR SEQ ID HO: 18 



ii) SEQUENCE CHARACTERISTICS: 

IA) LENGTH : 2835 base pairs 
35 (B) TYPE: nucleic acid 

{CI STRANDEDNESS: double 
(D) TOPOLOGY: circular 



til) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL*. MO 
(iv) ANTI -SENSE: NO 



45 {vt) ORIGINAL SOURCE: 

(A] ORGANISM: PORY PHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY : miscfeature 
50 (8) LOCATION 1...2835 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

CCTAAGATAC TTATGGAATT GAAAAGATTT TTATCACTTG GTCTTCTGCT TGTGGGATTC 60 

55 ATTCCGATGA AGCTTTCTGC CCAACAGGCT CAGCCACTCC CTACAGATCC GGCTGTTCGT 120 

GTCGGTAAGT TGGACAACGG ATTGACTTAT TTCATCCGTC ACAACGAGAA CCCGAAAGAT 180 

CGTGCGGATT TCTTTATCGC ACAAAAGGTA GGTTCTATTC TTGAAGAAGA TAGCCAGTCC 240 

GGTTTGGCTC ACTTCTTGGA ACACATGGCT TTCAACGGTA CGAAGAACTT CCCCGGTAAG 300 

AACTTGATCA ACTATCTCGA AACGATCGGT GTACGTTTCG GTCAGAACCT GAACGCTTCT 360 

60 ACCGGATTCG ACAAGACGGA ATATACGATA ATGGATGTGC CGACTACACG TCAGGGAATC 420 

ATCGACTCCT GCTTGCTTAT CCTGCATGAT TGGAGTAACA ATATTACCCT CGACGGGCAT 480 

GAGATCGACG AGGAGCGCGG TGTGATCCAG GAAGAGTGGC GTGCTCGTCG CGATGCCAAC 540 

CTTCGTATGT TCGAGGCTAT ACTTGCCAAG GCTATGCCGG GTAATAAATA TGCAGAACGC 600 

ATGCCCATCG GTCTGATGGA CGTCGTGCTC AACTTCAAGC ATGAT GA/GCT GCGCAACTAT 660 

65 TATAAGAAAT GGTATCGTCC CGACCTGCAA GGTCTGGTGA TCGTGGGAGA TATCGATGTG 720 

GACTATGTGG AGAACAAGAT CAAAGAACTC TTCAAGGACG TTCCTGCTCC CGTGAATCCA 780 

GCAGAGCGTA TCTATACGCC GGTAGAGGAC AACGATGAGC CTATCGTAGC CATTGCTACC 040 

GATGCTGAGG CTACTACCAC GCAGCTCTCC ATCAGCTTCA AGAGCGACCC CACTCCTCAA 900 

GAAGTGCGAG GA'i'CGATATT CGGACTTGTG GAAGACTATA TGAAACAGGT GATCACTACA 960 

70 GCCGTGAATG AGCGTCTGTC CGAGATTACT CACAAGCCTA ACGCTCCTTT CCTCAGTGCA 1020 

GGAGCTTTCT TCT CT AACTT CATGTACATC ACCCAGACTA AGGACGCATT CAATTTTGTT 1080 

GCCACGGTTC GTGAGGGTGA AGCGGAGAAA GCGATGAACG CATTGGTGGC AGAGATAGAA 1140 

AGCCTCCGTC AGTTCGGTAT CACCAAAGGC GAATACGATC GTGCACGCAC GAATGTGCTC 1200 

AAGCGATACG AGAATCAATA CAACGAAAGA GACAAGCGTA AGAACAATGC TTATGCCAAT 1260 

75 GAATACTCCA CCTACTTCAC CGATGGCGGC TATATCCCCG GTATTGAGGT GGAATATCAG 1320 
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ACGGTGAATG CTTTTGCTCC TCAGGTTCCT CTGGAAGCAT TCAATCAGGC TATTGCCCAA 13 BO 

ATGATCGATC CGGTGAAGAA TGCTGTCGTT ACCCTCACCG GTCCTTCAAA GGCTGAAGCC 1440 

AAGATTCCGA GCGAAGCAGA CTTCCTCGCT GCTTTCAAAG CTGCTCGTCA GCA.G AAAGT A 1500 

GAAGCCAAGA AAGACGAAGT CTCCGACCAA AAATTGATGG AGAAAGCTCC TAAGGCCGGA 1560 

5 AAGATCGTTT CCGAGAAGAA AGATCAGAAG TTCGGTACCA CGGAACTTAC CCTTAGCAAT 1620 

GGCATCAAAG TATACCTCAA GAAGACCGAT TTCAAATCAA ACGAAATCCT GATGAGTGCT 16SO 

CTCAGCCCGG GTGGTATCCT CTCCGGAAAG CATGCTCCCA ACCAATCTGT GATGAATTCG 1740 

TTCATGAACG TGGGTGGCTT GGGCAACTTC GATGCTATCC ACCTGGATAA GGTGCTGACA 1800 

GGTCGCTCTG CTTCCGTATC TCCCTCTTTG TCTCTGCTCA GTGAAGGTCT TTCGGGCAAA 1860 

10 ACGACTGTAG AAGATATGGA AACTTTCTTC CAGTTGATCT ATCTCCAAAT GACTGCTAAC 1920 

CGCAAGGATC CCGAAGCGTT CAAGGCCACA CAGGAAAAGT TGTACAATAA CTTGAAAAAT 1980 

CAGGAAGCCA ACCCGAT GGC TGCGCTTATG G ACT CT AT CC GTCATACCAT GTACGGCGAT 2040 

AATCCGATGA TGAAACCCAT GAAAGCTGCT GACGTGGAGA AAGTAAATTA CGATCAGGTA 2100 

ATGGCTTTCT ACAATGAGCG ATTCGCTGAT GCCGGCGACT TTATGTTCTT CTTTATCGGT 2160 

15 AATCTGGATG AAGCCAAGAT GAAGCCATTG ATCGAAACTT ATCTTGCTTC ATTGCCCAAC 2220 

CTCAAG CGTG GCGATAAGAT GAATAAGGCT CAGGTACCGG CTGCCCGTTC GGGAAAGATC 2280 

GATTGCAAGT TCGAGAAGGA AATGGATACT CCTTCGACTA CTATATTCGA TGTCGTGTCC 234 0 

GGAAATGTGG AATATACGCT CAAGAACAGT CTCCTGCTGG AAGTCTTCTC AGCCGTAATG 24 00 

GATCAGGTGT ACACGGCTAC CGTTCGCGAG AAGGAAGGCG GTGCATACAG TGTGGCTGCA 24 60 

20 TTCGGCGGTC TCGAGCAATA TCCTCAGCCC AAGGCTCTGA TGCAGATCTA TTTCCCCACG 2520 

GATCCTGCTC GTGCCGAGGA AATGAATGCT ATCGTTTTTG CTGAGTTGGA GAAGCTTGCC 2580 

AAGGAGGGCC CCAATGTGGA ATACTTTAAG AAGACTATCG AAAACCTGAA TAAGCAGCAC 264 0 

AAAGAAAGTC TGCG*i*GAGAA TCGTTTCTGG CTCGAAGCCA TGAAGGCGTC TTTCTTCGAA 27 00 

GGAAATGACT T CATC AC AG A CTACGAATCC GTACTGAACG GTCTTACTCC TGCTGAATTG 27 60 

25 CAAAAGTTTG CGGCAGACCT CTTGAAGCAG CAGAATCGGG TTGTTGTCAT GATGGCTCCT 2020 

GTTGCAAAGG CTCAA 2035 



30 



40 



(2) IHFORIIATION FOR SEQ ID HO: 19 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2058 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : double 
35 (DJ TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
CiiH HYPOTHETICAL*. HO 
(IV) AUTI- SENSE: NO 



[vij ORIGINAL SOURCE: 

{ A) ORGANISM: PORY PHYROMOU AS GINGIVAL1S 

45 

Uxl FEATURE: 

(A) NAME /KEY: miscjeaturs 
{ B ) LOCATION 1...2058 

50 <xi) SEQUENCE DESCRIPTION: SEQ ID HO: 19 



TACACTATGA GTAAGAAAGG AACAATCGGG 
AAAAAATTCC TGTACAGCGA C CAT GAG AT A 
GATGCTACGC AGAAGCTGAA AACGCTTACA 

55 GACCTCCGCG TAACGGTCAG CGTG GAT G AA 
GGCGTAGGGA TGACCGAAGA GGAGGTGGAG 
GCGGAAGAGT TTCTTGAAAA GTACAAAGAC 
CTCGGATTTT ACTCGGCTTT CATGGTGTCC 
CGAGAAGATG CTACGGCGGT GAAATGGAGC 

60 CCTGCGGACA AGGCTGACCG TGGCACCGAC 
GAGTTCCTCA AAAAAGAAAA GATAGAGGGG 
GTGCCGATCA TTTTCGGCAA GAAGCAGGAA 
GAGGACAATC AGATCAACGA CACACATCCT 
GACGAAGACT ATAAGGAATT TTACCGTTCG 

65 TGGATCCACC TCAATGTGGA CTATCCGTTC 
ATCAAAAACA ACTTGGATCT GCAGCGCAAC 
GTCACCGATG AAGTACAGGG TATCGTGCCG 
GATTCGCCGG ATATTCCCC1* CAACGTATCG 
AAGAAGATCT CGTCTCATAT CACCAAGAAG 

70 AACGACCGCC CCACATT CG A GGAGAAATGG 
ATGCTGACGG ATGAGAAGTT CTATGAGCGT 
GACGGACACA AGTACACGTT CGACGAATAC 
AAGGACGGAC AGGTAGTGTA TCTCTATGCT 
AAACGTGCAT CCGACAAAGG CTACAGCGTG 

75 ATCGTGAGCC TGCTGGAGCA AAAGTTGGAG 



GTAACGAGCG ACAATATATT CCCCGTCATC 60 

TTCCTGCGTG AGATCGTCTC CAATGCCGTG 120 

TCCGTCGGCG AATTCAAAGG CGAGACGGGT 180 

GTGGCACGCA CGATCACGGT CAGCGACCGC 24 0 

AAGTACATCA ATCAGATTGC TTTCTCCAGT 300 

GACAAGGCCG CCATTATCGG CCACTTCGGA 360 

GAGCGAGTGG ACGTGATCAC GCGCTCTTTC 420 

TGCGACGGAT CGCCCGAATA CACGCTCGAA 4 80 

ATCGTGATGC ACATCGATGA GGAGAATAGC 54 0 

CTCCTCGGCA AATACTGTAA GTTCCTTACC 600 

TGGAAAGACG GCAAGATGCA AGATACGGAC 660 

GCCTGGACCA AAAAGCCTGC CG AC CTCAAG 720 

CTCTATCCCA TGTCCGAAGA GCCTCTCTTC 780 

AATCTGACAG GTATCCTCTA TTTCCCGAAG 84 0 

AAGATTCAGC TCTACTGCAA TCAGGTTTAC 900 

GACTTCCTCA CCCTCCTGCA CGGGGTCATC 960 

CGCTCCTATC TGCAGAGCGA TGCCAATGTG 1020 

GTGGCAGACC GTCTGGAAGA AATTTTCAAA 108O 

GATAGTCTGA AGCTCTTCGT CGAATACGGT 1140 

GCAGCCAAAT TCTTCCTTTT CACCGATATG 1200 

CGAACGCTCG TCGAAGGTGT ACAGACGGAT 1260 

ACGGACAAGC ATG GACAGT A CAGCCACGTG 1320 

ATGCTGTTGG ATGGTCAGTT GGATCCGCAT 1380 

AAGACACACT TTGTCCGTGT CGATAGCGAT 144 0 
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ACGATCAACA 
CGCGCCACTC 
AATGTAGCTT 
TTCATGCGCC 
CTCCCCGATT 
TCCGGTGAGA 
CTGAAAGCGG 
CCTGTTGCCA 
AGTATCAACG 
GACTTGGCTC 
CGCAGCCAGC 



ATCTGATCCG 
TCGTGAAGCT 
TCGAATCGCT 
GTATGCGCGA 
CGTACAATCT 
AAGAATCGGT 
AAGAGGCCAA 
CGAAGGAAGC 
ATCAACTGAC 
TGCTCGGAAG 
GTCTTCTC 



CAAGGAGGAA 
GTTCGAAGCA 
CGGAGCCGAA 
TATGGCACAG 
GGTACTTAAT 
AGAGCCTTCG 
GCTGCTCGAT 
CAAGGAGAAC 
CAAATATGCT 
CGGATTGCTG 



AGAGCCGAAG 
CGCCTGCCAC 
GGTGAAGCCA 
CTGCAGCCGG 
ACCGATCATC 
CTCACAGAGC 
GAGGAAAAAG 
AACGCCGTCG 
CAGGACAACG 
ACGGGAGAGG 



TGAAACTGTC 
GGGACGAGAA 
TCCTTATCAC 
GAATGAGCTT 
CGCTCATCGA 
TTAGAGCGAA 
GGAAGAAACC 
AACAGGCCAA 
AGCTGATAGG 
CTTTGGCCGA 



CGATACGGAG 
GAAGCACTTC 
ACAAGCCGAA 
CTACGGCGAA 
CAGGGTACTC 
AATCGCCGAG 
GGAGGAAATC 
AACCGAAGGC 
TCAGCTCATC 
ATTCATTCGT 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1900 
2040 
2058 



15 



20 



25 



(2) INFORMATION FOR SEQ ID NO : 20 

(i) SEQUENCE CHARACTERISTICS : 

(A) LEHGTH: 14 4 6 base paira 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: doubls 
( D J TOPOLOGY : circular 

(11) MOLECULE TYPE: DNA (genomic) 

(ixi) HYPOTHETICAL: NO 

<iv) A1ITI-SENSE: NO 



30 



35 



40 



45 



50 



55 



60 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PH YROMONAS GINGIVALIS 

( ix ) FEATURE : 

(At NAME/ KEY : miscjeature 

(B) LOCATION 1. . .1446 

(Xi) SEQUENCE DESCRIPTION: SEQ ID MO: 20 



GACATTCGAC 
AAACTGATCG 
GTGCTCAAGC 
ATAATGCGGA 
TCCCTTTCGA 
TCCTGCCCTG 
AGGCTT G GAG 
ATAGATACGG 
GACAGAGATA 
ATCATAGGCC 
ACCCATGCCT 
GCTCTGCACC 
ATTCCCGAAA 
GCTTTTTCCG 
GAAATAGGCG 
ACATTCGAGC 
ACGAATGCTT 
CGAATCAACC 
CTGCTGGTGG 
TGGAGTGCGG 
CTGAAAAACG 
GATGTGGCTG 
ACCGACATGG 
CGTGCTGCAC 
GGACTG 



AGAAAAGACC 
AT ATTTT GGT 
GAGCGGGCTA 
ATCCTGATGG 
CCTCCGGCAG 
TCATACTGAT 
CTTTCGACTT 
CCTTGCATCT 
CAGCCCGTCA 
GGAGCGATGC 
CTG TGCTGAT 
GTGGGAGCAA 
GTTTGTTCGA 
ACAGGAAAGG 
AACTACCGGT 
CGTTGGGCGA 
CCTTGGAGCG 
TGATACATCT 
AAGCCTTCAG 
AAGCTATGCG 
TAGTGGAGCG 
ACTTCGGTTC 
AGGAAGCTGC 
GAGCCTTGGG 



GTGCTTCAAC 
CGTAGACGAT 
TAAT CCCGTT 
CGGCTGTAAG 
GGAAGGATTG 
GACGGCTTGG 
CATAGGCAAG 
GGCTGCTCCC 
GCCGAAAGCT 
CATCTGTAAG 
CACGGGCGAG 
ACGAGCCTCA 
AAGTGAGCTG 
ACGGTTCGAG 
CGGCAACCAA 
GAGCGTCTCC 
AATGGTAGCC 
GCATCTGCCT 
TGAAGCCTTT 
ACGTATCTGT 
TACGCTATTG 
GCAGGTGACG 
TATCCGAGAG 
ATTGAGCCGG 



GCCAATTTGT 
GATGTGGCAG 
ATAGCCAACA 
CCGGCTGTGA 
GAACTACTGG 
GCTTCGATTC 
CCATGGGACA 
TCAGCTGTGG 
ACAGTCCAAG 
ATCAAGGAAC 
AGCGGTACGG 
GCCCCATTCG 
TTCGGACATA 
CTGGCTGATG 
GTAAAACTGC 
CACCGAGTGG 
GAAGGACGTT 
CCGCTGCGTG 
GCCCAATCGA 
GCCATGCCCC 
CTCTCGGGAT 
GCAGCAGACC 
ACGCTGACTA 
GCAGCTCTTT 



ACTTTTACCG 
TCTGTGCCGC 
GTCCCGACGA 
TTCTGATGGA 
AGAAGATGCA 
CACTGGCAGT 
ACGATCGGCT 
CGAATCCATC 
AG AAT GACCC 
GGATACGCCG 
GCAAAGAGTT 
TCAAGGTCAA 
AGAAAGGAGC 
GCGGCACGAT 
TGCGAGTGCT 
ACATCCGTGT 
TCAGAGAGGA 
AGCGTCAGGA 
ACGGATTGCC 
TACCGGGCAA 
CGAGAGAAAT 
ACT CCGACGA 
AATACAACGG 
ACCGGCGAAT 



TGCAATGGAA 
ACTGCGTCTG 
AGCTTTGTCC 
TATGAATTTC 
GATATTCACT 
GGAGGGAATG 
CCTTCGTACC 
GGAACAGTCT 
CTGTGCCCAT 
CATAGCTCCC 
GATAGCCGAA 
TTTGGGTGGG 
TTTTACCAAT 
CTTTCTGGAC 
ACAGGAACAG 
GGTATCGGCT 
C CT CT ACT AT 
GGATATACAG 
CCATGCCGTT 
TGTACGCGAA 
CAGTGCCCGG 
ACGGGCTTTG 
CAACGTTAGT 
GGAGAAATAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1300 
1440 
1446 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 21 

11) SEQUENCE CHARACTERISTICS; 

(A) LENGTH; 78 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
( 0) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

Uii) HYPOTHETICAL: NO 
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(ivj ANTI-SEMSE: NO 



(vij ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHY ROMON AS G1NGIVALIS 



10 



15 



20 



25 



(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 
(B> LOCATION 1. . ,789 

<xX) SEQUENCE DESCRIPTION; SEQ ID NO:21 



AGATCTCTGC 
CACGCCACAG 
GAGATTCATG 
GTGGGACATC 
CTCGAACTCG 
GTCGAGATCC 
AAAGCGATCG 
CGTGCCATTG 
GGTGGAGAAA 
ATTTTGGACG 
GTAAACCGAC 
CTCGAGTACA 
GGAGGAGCCG 
ATAGGAGAA 



AATCATTTCA 
TACAGGGCAA 
CTATCATGGG 
CCTCCTTTGA 
AACCGGAAGA 
CGGGCGTCAG 
GAGCAGAACC 
TGGAGCTGGA 
AAAAGAGGAA 
AAACCGATAG 
TCCCCTCTCC 
TCAAGCCGGA 
AGCTGGCTCT 



AAATAAAAAA 
AGAGATATTG 
GCCGAACGGA 
AGTCACGGAA 
ACGTGCACAC 
CATGGTGAAT 
CGTATCGGCA 
CAACAAATTG 
CGAAATCTTC 
CGGGCTCGAT 
GGAGAATGCT 
CTTCGTACAC 
CACGCTCGAA 



CACTCAAGTA 
AAAGGAATCA 
TCGGGGAAAA 
GGAGAGGTGA 
CTCGGACTCT 
TTCATGAGGG 
AGCGACTTCC 
GCCAGCCGTT 
CAAATGGCTA 
ATCGACGCTC 
GCTATTGTGA 
GTCCTTTACA 
GAAAAAGGCT 



TGCTTAAGAT 
ATCTGGAGAT 
GTACGCTCTC 
CATTCAATGG 
TTCTCAGTTT 
CAGCTGTCAA 
TCAAGATGAT 
CTGTGAACGA 
TGCTCGAACC 
TCCGCAT CGT 
TCACACACTA 
AGGGGCGCAT 
ACGACTGGAT 



AAAGAACCTC 
CAATGCCGGA 
TTCCGTTTTG 
AATCGACCTG 
CCAATATCCG 
TGAACATAGG 
GCGAGAGAAG 
AGGCTTCTCC 
CAAGCTGGCT 
AGCAGGCGGG 
TCAGCGTTTG 
CGTCAAGTCG 
CAAGGAAGAG 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
709 



30 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 22 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 13B6 base pairs 
<BJ TYPE: nucleic acid 

(C) STRANDEDNESS; double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
<Iii) HYPOTHETICAL: NO 
iiv) ANT I— SENSE : NO 



45 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROM0NAS GTNGIVALIS 

(lx) FEATURE: 

tAl NAME/ KEY: misc_f eature 
<B} LOCATION 1...1386 



50 



55 



60 



65 



70 



75 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 



AGCATGGCTA 
TGGCAAGGCA 
GCACCGGCAT 
CCAAGACTCT 
GAGTTCGACC 
GAGCCGGGAA 
CGCACGCTCT 
CTGGGGCAAG 
TCCCG TGCAG 
ACCGAGGAAA 
CTGCTCAAAT 
GAAGGTAGCA 
GACGGGGATA 
ACTTCCGAGC 
AGCGAACATC 
ATGGAGGGCA 
GCCAATCCGC 
GTACTGGAGA 
GGAGGTATCA 
TCGAGTCTGG 
GGAGAGATAC 
TTCAAAGAGA 
ATTCGGCTCG 
AGAGAA 



AGGAGAAAAC 
ACT GCAATGC 
CGGGCAAGCA 
TACAGGATGT 
GCGTACTGGG 
TCGGTAAGTC 
ATGTGTCGGG 
CCATGAATGG 
AAGAACTCAC 
TGGAAAGCTC 
ACTGCAAGAC 
TAGCCGGACC 
AGCATCATCT 
TGGG GAT AT A 
TCAT CACACG 
TTCGCCCGAT 
AGCGTTCGGC 
AACGTGCCGG 
AAATAGCCGA 
ACATCGTTAT 
GTCCCGTGAG 
TATTGGTACC 
TGCCGGTCAG 



GATCTACGTC 
CTGTGGAGAG 
TGCAGCCAAG 
GGAGTCCGGC 
TGGAGGAATT 
CACGCTTATC 
CGAAGAAAGT 
GTGCTACGTA 
ACCCGATCTC 
GGCCGGCAGC 
TACGGGTATC 
GAAGGTGCTG 
CTACCGGATA 
CGAGATGCGG 
CAATAGGGAA 
ACTCATCGAA 
CACGGGCTTC 
CTTCAAGCTC 
TCCGGCTACG 
CCCGCCGGCC 
CCGCATCGAG 
GGCCGATAAT 
AAAGGTGGAG 



TGCCGTTCGT 
TGGAACTGCA 
AGTTTTATGC 
GATGAAGAGC 
GTCAAAGGAG 
CTCCAGACGG 
GCCCGACAAC 
TACT GC G AAA 
CTCGTGATAG 
GTGGGGCAGA 
CCCGTCATCG 
GAGCATATAG 
CTCCGAGGAC 
CAGGACGGTC 
GACCTCAGTG 
GCGCAGGCTT 
GATATTCGGC 
ATACAGAAGG 
GATCTGGCCG 
GTATGCATGA 
CAGCGCATAA 
TTCCGGCAGG 
GAAGCCTTCC 



GCGGAACCAA 
TTGATGAGGA 
CTCGGGAGCA 
GTATTCGCCT 
CATTTGTCCT 
TGCTGCGTCT 
TGAAGATGCG 
CGAATATAGA 
ACTCTATACA 
TCCGCGAATG 
TCATCGGACA 
TGGATACGGT 
AGAAGAACCG 
TGCGTGGCGT 
GCATAGCCAT 
TGGTCAGCTC 
GGATGAACAT 
ATGTGTTTCT 
TTATCTCGGC 
CGGGCGAGGT 
CGGAAGCGCG 
AGGATGCCGG 
GCCATCTGTT 



ATACGCCAAA 
GAAGGTGCCG 
GGACAACCGG 
CGGCGATGAA 
GCTTGGCGGC 
GCCGCAGTTG 
CGCCGAACGA 
GAGGATACTC 
GACGGTCTAT 
TGCCGCCTTA 
CATCACCAAA 
GCTTCTCTTC 
CTATGGCAGT 
GGAGAATCCG 
AGCCGTAGCG 
GGCCATTTAT 
GCTCTTAGCC 
GAACATTGCC 
AGTGCTGGCG 
CGGACTCTCC 
TCGCATAGGG 
CCGCTTCGGT 
CTCGAAAGGA 



60 
120 
180 
240 
300 
360 
420 
480 
S4 0 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1386 
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12 J INFORMATION FOR SEQ ID NO: 23 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: HI 9 base pairs 

(B) TYPE: nucleic acid 
<C) STRAHDEDNESS: double 
(D) TOPOLOGY: circular 

10 <ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 



15 



30 



45 



50 



55 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: POR Y PHY ROMOf I A 5 GINGIVALIS 



( ix ) FEATURE : 
20 (A) NAME/ KEY: misc_feature 

{ B} LOCATION 1. . .1119 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 23 

25 GGGTCTTGTG GAAGTAGCCC AOCAATTGGC CGATCAGGGT GTTCGCGTTG TGATCGCCGG 60 

ATTGGACATG GACTTTCGAC GTCACCCTTT CGGACCTATG CCGGGCTTGT GTGCCATAGC 120 

CGACTCCGTG ACCAAAGTTC ATGCCGTGTG TGTGGAATGC GGCCGATTGG CCAGCTATTC i 8 n 

TTTCCGTCGT GTCCAAGGCG ATCAGCAAGT GATGCTGCGC GAACTGAACG AATACAGTCC 240 

C'JTCTGCAGA ACCTGCTACA GGAAATGCAG TTCTCCCCCA CAAACAGAAG AAATCCATTC 3O0 

GACAATATGA ATAGCAGACA TCTGACAATC ACAATCATTG CCGGCCTCTC CCTCTTTGTA 360 

CTGACATTGG GCGGCTGCTC CGTAGCCCAA CAAGATACUC AGTGG ACT CT CGGCGGAAAG 420 

CTCTTTACTT CGGCGT GG AT ACAACGTTCG GCCGAATATC AAGCGCTTTG CATTCAGGCA 4 80 

TACAACATCG CTACGGAAAG ACT GG ACGCT CTACCGGCAG AACGTAAACA AGGAGATAGG 54 0 

CCTTATGCCA TCGTAACGGA CATAGACGAA ACCATTTTGG ACAATACGCC TAACTCCGTG 600 

J 5 TATCAGGCTC TCAGGGGCAA GGATTATGAT GAAGAGACTT GGGGGAAATG GTGTGCACAG 660 

GCCGATGCCG ACACACTGGC AGGAGCTTTG TCTTTCTTCC TCCATGCAGC GAACAAGGGG 720 

ATCGAGGTCT TTTACGTCAC CAACCGCAGA GACAATCTGC GCGAAGCAAC TCTTCAGAAC 760 

CTTCAGCGTT ACGGATTCCC CTTTGCCGAT GAAGAACATT TGCTTACGAC CCATGGGCCA B4 0 

TCCGACAAAG AACCCCGTCG GCTCAAAATA CAAGAACAGT ATGAAATAGT ATI* G CT CAT A 900 

W GGAGACAACT TGGGCGACTT CCACCACTTC TTCAATACGA AAGAAGAGTC CGGACGCAAA 960 

CAGGCTCTUG GCCTGACAGC CGGGGAGTTT GGCCGGCACT TCATCATGCT GCCCAATCCC 102 0 

AACTACGGAT CTTGGGAACC GG CAT GG TAC GGCGGGAAGT ATCCGCCACT GCCCGAAAGA 1 OflO 

GACAAAGCAC TTAAACAACT GCACTCACAG AACAGCAGA mg 

{2} INFORMATION FOR SEQ ID NO: 24 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic} 

(ill) HYPOTHETICAL: NO 

(iv) AtlTI -SENSE: NO 

60 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOIIAS GTHGIVALIS 

(ix) FEATURE: 

(A) NAME /KEY : misc^feature 
b5 (B) LOCATION 1...1278 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

7fl ^SSSr TTTCTAATTT TGTGGCCA TG AGCACCAATA TAGATGTACA ACAGATCAAA 60 

7U CA^CGTTTCG GCATCATCGG TAGCAGTCCG CTGATGGAAC ATGCCATACG AG TGGCAGCA 12ft - 

CAGGTGGCTC CTACCGACAT GTCCGTCCTC GTGACGGGGG AGAGCGGTTC CGGGAAAGAG 180 

TT^^ CA " AGATAATCCA CTACTACTGC GCCCGGAAAC ATCATAGCTA CATTGCAGTC 240 

I^nSS?^? CCATCCCCGA AGGAACCATC GATTGCGAGC TGTT CGGACA CCGCAAAGGT 300 

7 r T^™ C ' GAGCCGTATC GGATCGCAAG GGGTACTTCG AAGAAGCATC CGGCGGCACG 360 

75 ATCTTTCTGG ACGAAGTGGG CGAACTGCCT TTGCCCACGC AGGCGAGGCT GCTGAGGGTG 420 
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CTGGAGACGG GCGAGTTCAT CCCCGTAGGA GCCAGCCAGT CGCAGAAGAC GGATGTCCGT 4 80 

ATCGTAGCGG CCACGAATGT GAACCTCAAG GAGGCGGTAG CGAACGGGAA GTTCCGGGAA 54 0 

GACCTCTTCT T CCGGCTC AA TACGGTACCG ATCGAGGT GC CTGCGCTGCG TATGCGACCG 600 

GACGACGTGC CCTTGCTTTT TCGCCGATTC GCCGCCGACA GCGCCGAGAA GTATCGGATG 66 0 

5 CCTCCGCTGC GCCTATCGGA CGAAGCCCGT ACCATATTAA TGCGTTACCG CTGGCCCGGC 720 

AATGTGCGAG AGCTGCGCAA TATAACCGAC AGGCTGAGCA TCCTGGAGGA GGAGCGGACG 7 BO 

GTATCGGCAG AGACCATCAC TCGCTACCTG GACGCTGAGG GGATGCAAGA CCTCCACCCC 840 

GTCGTGATCC GACGGAACGA AACGACCGAA GCGGACAAAC AAATCCCCCA TTACGAGCGC 900 

GAAATCATCT ACCAGGTGCT ATACGATATG A«\GAAAGAGA TAGCCGATTT GAAGGGGATG 960 

10 ATGAACCGCC TGGCGCACCA CGAACAGCCC TCATGGCCTG TAGGGTCGGA CGTCTGGGGC 1020 

AACGACGACA ACCGCACCGC AGATCCGAAG TGGGGCGTCA GCACGCACAA GGCCCCCATC 10 BO 

GCGAACGCGG C7\GAACCCGT GGAGCCGATA CAGGAAGCCA GCGAATACAC CGAGGATCCG 114 0 

GTTTCGCTGG AGGAGGTAGA GAAGAAAATG ATTTCCCTTG CATTGGAACG CCACCGCGGA 1200 

AGGCGCAAGC AGACAGCCGA GGAACTGAAG ATTTCGGAGC GGACACTATA CCGTAAAATCT 1260 

15 AAGGAGTATG GACTGGAA 1278 

(2) INFORMATION FOR SEQ ID NO: 25 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1959 base pairs 
(D ) TYPE: nucleic acid 

(C) STRANDE DNESS : double 

(D) TOPOLOGY: circular 

25 

<ii> MOLECULE TYPE: DUA [genomic) 

(iii) HYPOTHETICAL: HO 

30 (IvJ ANT I -SENSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: F-ORYPHYROHONAS GINGIVALIS 

35 tlx) FEATURE : 

(A) NAME /KEY; mi sc_f feature 
(BJ LOCATION 1...1959 



40 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 



AAAAATCTTC CGAGGTACGC TCCGGCATTT GGCAGAAATC GTAAAAGGGA ACAACCTGAC 60 

CCTGACAACG ATGATAGTCG TGGGAGAAGC CATAGACAAT CGGGAAGGGC TGTCACGGTT 120 

GTATGCCGAC GAATTCAAAC ACCTCTTCCG CACATGATCC TACTCTTCGG CGGTACTACG 180 

GAAGGCCGTG CCGCAGCTCG CGTGCTGGAT GAAGCGGGAA GTCCGTTTTT CTACTCCACC 24 0 

45 AAAGGCAATC TGCAAGAGAT CCAGAGTAGC CACGGCCATC GTCTGACAGG AGCCATGACG 300 

GTTGCCGACA TGGTTTCGTT TTGTCGGAAA GAAGAGATCC GACTGATCGT GGACCCCGCT 360 

CATCCTTTCG CCGAAGAATT GCACGCTTCA GTGGCAGAAG CCTCTGAACA AACAGGTATC 420 

CCCGTAGTAA GATACGAGAG ACAATACCCT CCACGCGAAG AAGGTATCGT CTGGTGTGCA 4fl0 

AACTACGATA CGGCTGCCGA GCGGATGCTT GGCGATGGCG TGCAGCGTCT GCTGA.TGCTC 540 

50 ACAGGAGTGA ATACGATCCC CAAGCTGGCT GCTTTCTGGA AAGAGCGCAC CACCTTTTGC 600 

CGCATATTGA AGCGAGACGA ATCGGTTGCT TTGGCAGAGA AGAACGGCTT TCCTGCGGAG 66 O 

CGCATCGTTT TCTTCGAACC GCATGCGGAC GAGGAGCTGA TGCAAGCCGT TCGCCCCGAT 720 

GCCATTATCA CAAAAGAAAG CGGAGAGAGC GGTTACTTCC GAGAAAAGAT AGAAGCTGCC 780 

CGACGGATGG GCAT CCGTAT ATATGCCGTC GTACGTCCCC CTTTGCCTCC TTCATTCATT 84 0 

55 CCCGTAGGCG GGCCTGTCGG TTTGAGACGG GCGGTAGAAC GCCTCGTGCC GGGATTCTTT 900 

TCACTCCGAA GCGGATTCAC TACCGGCACC ACAGCTACCG CTGCAGTAGT AGCAGCCATG 960 

TACCGATTGA TGGGGCTTGG CTCTCTCGCC GAAGCTCCCG TAGAATTGCC TTCGGGCGAA 1020 

ATAGTCAGTC TGCCCATAGC GGAAATTCGA GAGGAAGAAG ATGCTGTCGT ATCCGCAGTC 1080 

CTGAAAGATG CAGGTGATGA TCCGGATGTG ACCAATGGCA TGGCGGTATG CGCTACGATC 114 0 

60 AGGCTCAATC CCGAACATGA GGAAGTCCGC TTCCTGCAGG GTGAAGGGGT GGGGGTAGTG 1200 

ACGCTCCCCG GCCTCGGTCT GGAGGTCGGA GGTCCGGCTA TCAACCTCGT ACCTCGACGA 1260 

ATGATGACAG CAGAGGTACG CCGACTCTAT GCGCAGGGAG GTGTGGATAT TACGATTAGC 1320 

GTACCCGAAG GCCGAGAGGC TGCTACCCAG ACATTCAATC CCCGACTCGG CATACGGGAC 1380 

GG CAT CT CT A TTATCGGAAC ATCGGGAGTC GTGAAACCTT TTTCGGCCGA AGCGTTCGTT 14 40 

65 GGTGCCATCC GTAAGCAAGT GGGTATTGCC ACCCCCTTGG GAG CCAATCA TATCGTCCTC IS 00 

AATTCGGGAG CCAAGAGTGA GCGTTATGTA AAAGGAGCCT ATCCGGCACT CATTCCACAG 1560 

GCCTTTGTGC AGTATGGCAA TTTCGTCGGC GAATCACTCA GTTGTGTAGC TTCCTTCCCT 1620 

TCTGTCCGTT CGGTAACGGT AGGAATCATG CT CGGCAAAG CAGTGAAACT CGCCGAAGGC 1680 

TATCTGGATA CGCACAGTAA AAAGGTAGTG ATGAATCGGG ATTTCCTGCA CGAACTGGCT 17 40 

70 CGTCAGGCAG GTTCTTCGGA AGA CATC CAT GCCATAATAG ACAGCCTGAA TTTGGCTCGT 1800 

GAGCTATGGA CTATGCCGAG TGCGGAGGAC AGCGATCGAC TGCTACGAAA GATTGCCGAA 1860 

CGATCTTGGG AAACTTGCCG CCCATCGGTA CCATCGGCCG AATTAGAACT CCTGCTGATC 1920 

GATGAGTCCG GAGCGATTCG TTTTCGTATC GGTGGAGAA 1959 

75 
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(2) INFORMATION FOR SEQ ID NO: 26 

(1) SEQUENCE CHARACTERISTICS: 

(A> LENGTH : 1353 base pairs 
3 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

1Q t i:L » t IOLECULE TYPE: DMA (genomic) 

(111) HYPOTHETICAL: NO 

(iv| ANT I -SENSE: NO 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALTS 



20 



60 



tlx) FEATURE ; 

(A) NAME/KEY: raise feature 
{B> LOCATION 1...1353 



(ii) MOLECULE TYPE: DNA (genomic) 
<iii> HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO, 



(vl) ORIGINAL SOURCE: 
g 5 < A > ORGANISM: PORYPHYROMOHAS GINGIVALIS 

( ix J FEATURE : 

(A) NAME/ KEY : misc feature 

(B) LOCATION 1 . - .1467 

70 < x± ) SEQUENCE DESCRIPTION: SEQ ID NO:27 

^S^SZr TC AAAAGAAAGA CCCTTTTAAT ATGAAAAGAA TACAACTAA- TCTTATCG^T 



PCT/AU98/01023 



(Xij SEQUENCE DESCRIPTION: SEQ ID NO: 26 

25 ~l ™i iiii ~- — — ssssss ^ 

so ESS G G £=S iiEiiE 

SESS SS55SS ssssss sssss sssE iEi 

aSg£a tgatcgg^? S£?S^ 

seek iiE iEr sseez sssss 
SEES essss sessse ssss sssssssssss & 

40 SS S S S S = 1020 

^ r oi^l^WjAT GTGCCTGTliA CCAAAGGCAG CTCCGGAGTA 1080 
^GTGC-ScG 2££5?22 r£^ TCBC ^S^TGC GCGACTGTAT CCGATGCTCC 

TATAAGAGCT G^gIaaScc SSSSSI 0 TCCTTATGCG CGACACCTTA 1200 

45 TgS^S C^g1£££c AACGTGGTTG ACTGTATCGA ATGCGGTTCG 1260 

£££££ SSSS CTGCTGGATT ATATCCGCCA AGCCAAGAAG 1320 

„ (2) INFORMATION FOR SEQ ID NO: 27 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 67 base pairs 
<D) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
^ (D) TOPOLOGY: circular 



60 
120 
180 
240 
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GGTACCAGCC GGGTGGAATT TCCTTCGGTA GTCATTACAG GCCGCAATAG AACAAAAGCT 300 

CTCAAGCGTG AAATCG CATT TAGTT CGGCT TTGCCCCAAG CAAAACATGC AGCTCAATAC 360 

ATTCGCCGTC ATAATGGGAA GAGCGAGCAG TTTGCTTTTA CAGGAGAACA TGCTTATGCA 4 20 

TCATGGATGA TGGATGCCAA GTTTGTGGTT CGTGAGGAGG TACGAGGTTG TGCTAAATGC 4 00 

5 CCTGTAGGTC TCTCGAGTAA TATTGTTCCT TTTGATCCAC TCTTCAATCC GGCAGAGGCT 54 0 

CCTTATTTGT TGGCACACAT TACTCCGGCA GAAGAAGTGG AAAAACAGCG AGAGTCCAGC 600 

TTCGATG CTT ATATCAACTT CAAAGTCAAT AAGGCAGAT G TCCTTCCTGA GTATCGCAAC 660 

AATAAGGCGG AGTTAGAGAA AATCAAAGAA TTTGTAAGCA CCGTTAAGGC TAATCCAAAC 720 

TATTCGGTCA ATAAAATGAT CATCGAAGGG TTTGCTTCTC CCGAGGCTTC AATAGCCCAC 7 BO 

10 AATAAGGCTT TGTCG GAGCG CCGTGCTAAA AGACTCGCGG AAGAATTGGT GCGTAAGTAT 840 

GGCAAAACAT TGCCGAATAT AACCACTGAA TTCGGCGGTG AAGATT GGAA GGGGCTGAAA 900 

CTGGCTATCG AAAAGAGTGA TATAGCCGAT CGTGACCGCG TATTGGAGAT AATCAACTCC 960 

GATAAATATG CCGATGATGA TGCACGTGAA CAGGCTCTGA AGCAACTTTC GTCTTATCGT 1020 

TATATCTTGG ATCAGATCTA TCCGAATTTG CGTCGCAATA CGATAACCAT GGGGTATATC 1080 

15 GTTCGTGATT ATACCCTCGA AGAAGCTCGT GAAATCATTA AGACTGCTCC GAAAGAACTT 114 0 

AGTGAGGCCG AAATGTACCG TGTGGCAATG TCTTATCCTG AGGGGCACCA AGAGCGTTTG 1200 

TTTGCTCTGA ATACGACCCT TAAGTATTTC CCTGAAAGTG TAACGGGCCG AATCAATTTG 1260 

GCTGTAGCCG CTTTTAATGG TGGAGACGTT CAACAGGCAA TTGCTCTGTT GAGTCCGATT 1320 

CAGACAGAAA AGGGTGTAAG CAATATCCTT GGAGCTGCTT ATGCTCGTAC GGGAGATTTT 1360 

20 GCTCGTGCCG AAACCTTCTT CCGTAAGGCC GTTGCAGAAG GAGATGCAAA TGCGCAGCGC 144 0 

AACCTCGATA TGCTGCTTGG CAAAAAG 14 6/ 



25 



35 



{2) INFORllATION FOR SEQ ID NO: 28 



(1> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1152 base pairs 

(B) TYPE: nucleic acid 

(C) STRAW DEDMESS ; double 
30 (D) TOPOLOGY: circular 



till I lOLECULE TYPE: DIIA (genomic) 
(ill) HYPOTHETICAL: HO 
tiv) A11TI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS G1NGIVALIS 

40 

fix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION l..,1152 

45 txi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 



GACATGGCAG AAAAAAGAGA CTATTACGAA 
GATGAACTGA AAAAAGCATA TCGCAAGAAG 
GGTGACAAGG AGGCCGAAGA GCACTTCAAA 

50 GATCCGCAGA AGCGCAGTCA AT AT GACCAG 
GGTGGAGGTT TCAGCGGAGG CGGTATGTCC 
CTATTCGGTG GGTTCGGCGG TTTCGGCGGA 
CGTGTTCGCA GAGGGTCTGA CCTGCGAGTA 
AAAGGTG'VGG AGAAGAAAGT GAAGGTAAAA 

55 GATCGOACGG AAGAAGCCAA TGGCAAGACT 
GTTACACGTG TGAGCAACAC TTTCCTTGGG 
TGCCACGGAG AGGGTGAGAT CATCACGAAG 
GAGATCGGCG AAGAGGTGAT CTCATTCCAC 
ATGTCCGTGA ACGGCAAGGG AAATGCCGCG 

60 GTCGTGATCG CCGAGGAACC GGATCCGAAT 
AATCTGCTTA TATCCGTTCC GTTGGCTATA 
GACGGACGAG CCAAGATCCG CAT CGAGG CG 
CGCAATAAGG GGTTGCCCAG CGTAAACGGC 
AATGTCTATA TCCCCGAATC GATCGATGCC 

65 AACTCGGACA GCTTCAAACC TACCGATGCT 
GAGATGCTGG AT 



GTCCTCGGTG TATCGAAGAA TGCCACCGAC 60 

GCTATCCAAT ACCATCCTGA TAAGAACCCC 120 

GAGGTAGCTG AAGCCTACGA CGTATTGAGC 180 

TTCGGCCATG CCGGATTGGG CGGAGCTGCC 240 

ATGGAGGATA TTTTCAGTCG CTTCGGTGAT 300 

TTCTCCGATA TGGGCGGTGG CAGTCGCAGA 360 

CGAGTGAAGC TTTCTTTGGC CGATATAAGT 4 20 

AAGCAGGTAG TGTGCAGCAA ATGTCGTGGC 480 

ACCTGCCAGA CCTGCCATGG AACCGGCGTG 54 0 

GCCATGCAGA CCCAGAGCAC TTGTCCCACT 600 

CCATGCTCCA AGTGTAAGGG CGAAGGTGTG 660 

ATCCCTGCCG GTGTAGCCGA AGGAATGCAA 720 

CCCCGAGGAG GCGTGAATGG CGACTTGATA 7 80 

CTGATCCGCA ATGGCAACGA TCTGATATAC 84 0 

AAAGGAGGTA GTGTGGAAGT GCCGACGATA 900 

GGGACACAAC CCGGCAAGAT GCTGCGTTTG 960 

TATGCCATGG GAGACCAACT GGTGAATGTC 1020 

AAAGATGAGC AGGCTATCGC AGCGATGGAA 1080 

GCTCGTAAGG ATATAGACAA GAAATACAGA 1140 

1152 



(21 INFORMATION FOR SEQ ID NO; 29 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 baee pairs 
t B > TYPE: nucleic acid 
(C| STRANDEDHESS: double 
(D) TOPOLOGY : circular 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



19 / 490 



10 



(ii) MOLECULE TYPE: D1IA {genomic) 

(ill) HYPOTHETICAL: NO 

liv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PGRYPHYROMONAS GINGIVAL! S 

( ix ) FEATURE : 

(A) NAME/ KEY: misc_feat\jre 

( B) LOCATION 1 ... 92"? 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

CGATTCGATT CAACAACTAA TGTCTCACAA ATTAATTTAA GAACAGAGAT GAAAAAACTG 60 

ATTTTAGCGA CTTTGGGACT TATGGCCATT GCCATGCTCT CATGTTCAAG CAACAACAAG 120 

GATTTGGAGA ACAAAGGGGA GGCTACTCTT TTGGTAACGT TTGGTAGCTC CTATAAAGCT 180 

20 CCACGCGAAA CCTATGCGAA GATTGAGAAG ACTTTTGCCG CAGCTTATCC CGATCAAAGG 24 0 

ATAAGCTGGA CATACACGTC TTCTATTATC CGAAAGAAAC TGGCTCAGCA GGGTATTTAT 300 

ATCGATGCTC CGGATGAGGC TTTGGAGAAA TTGGCTCGTC TGGGTTATAA GAAGATCAAT 360 

GTACAGAGTC TTCATGTGAT TCCCGGCCGA GAATATGATG AGATGATCGA CTTTGTCAAT 42 0 

AAGTTTAAGG CAGCACATAG TGATATTACT GTGAAGGTAG GGGCTCCGCT TTTCGATACC 48 0 

25 GATGAAGATA TGCGCGAGGT GGCAGAGATC TTGCACAAGC GTTTTCAGCA AACGATAGAG 54 0 

AAAGGTGAAG CTATTGTATT CATGGGACAC GGCACCGAGC ATGCTGC.CAA TGACAGGTAT 600 

GCCCGTATCA ATAAGATCAT GAAGAACTAT AGCAAGTTCA TGATCGT CGG AACCGTCGAG 660 

TCCGATCCCT CTATCAATGA TGTTATTGCC GAACTGAAAG AAACCGGTGC CACGGCCGTA 720 

ACAATGATGC CGCTGATGAG TGTGGCAGGC GACCATGCTA CGAATGATAT GGCCGGAGAT 7 80 

30 GAGGACGATA GCTGGAAGAC GTTGCTGACC AATGCCGGCT ACACAGTTTC TATAGACAAG 84 0 

CTGGACAATG GCAATTTCTC AGCTCTTGGA GATATAGAAG AGATCCGGAA TATCTGGCTC 900 

AAGCATATGA AAGCCACCTC TGCTCGC 92*7 



35 (2 J INFORMATION FOR SEQ ID NO: 30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1473 base pairs 
<S) TYPE : nucleic acid 
40 (C) STRANDEDNESS: double 

|D) TOPOLOGY: circular 

(ii) IIOI.F.CULE TYPE: DMA (genomic) 

45 tiiii HYPOTHETICAL: MO 

(iv) ANT I -SENSE: HO 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM; PORY PHYROMONAS GIWGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1...1473 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 



CGGAGAGATG CGCGGAGGCA ACTGGTACGC CGTAGGAGGA AAGAGCTATC TGGCACAGCA 60 

AATACGCGAT GCCGGAGGAG AGTATTTCCT GAAAGACGAT CAGCGATCCG GTGGTGTTTC 120 

60 CCTCGACTTC GAGACGGTCT ACAGCCGATC CGATT CGGCT CGCTACTGGC GAATCCTCAA 180 

CAGCTATCCC GGGGAGTTCG GTTACGAAGC TTTGAAAGCC GAAGACAGCC GATATGCCGA 24 0 

TTTCAGGGCA TTCAAGGAAA AAGGCGT GAT ATACTGCAAC CTGAGGGAAA CAGCCTTCTA 300 

CGAACTCATG CCCATGCATC CCGATTGGGT GCTGGCCGAT CTTATCGCTA TCTTGCACCC 360 

CGGACTACTT CCCGACCACC AACCGCATTT CTATTATTTG CTCCAATGAC ATCCGTCAGC 420 

65 CACTTACGTA CAATTTCTGT CGCAGGTATC CTGGCTGCGC TGGGAGGGGC TGTACTCATT 4 80 

CTCTTCGGGG TTAATCTCTT CCTCGGCTCG GTGGCTATTC CGATGAGCGA GATCTTCCGA 54 0 

CATCTTTTTT CAGATCGTCC CGAAGGAGGA GAAGCACTCG T G CACTACAA TATCCTATGG 600 

AAATCCCGCC TGCCCGAAGC CCTCACGGCT GCTTTTGCCG GCGCAGGTTT ATCCGTTAGT 660 

GGCTTGCAGA TGCAGACCGT CTTTCGCAAT CCTTTGGCCG GT CCGTCCGT TCTCGGCATC 720 

70 AGCTCCGGTG CCAGTTTGGG TGT TGCTTTG GTCGTTCTGC TGAGCGGCTC GCTGGGAGGA 7 80 

GTGGCATTGA GTAGCCTGGG TTATATGGGC GAGGTGCCCA TGAATATAGC CGCTGCCGTA 84 0 

GGCTCGCTGG CAGTAATGGG GCTGATCGTT TTTGTCAGCA CCAAGGTGCG CAGCCACGTT 900 

ACGCTGCTCA TTATCGGCGT TAT GATCGG A TATGTAGCCA CTGCCGTCAT CGGGGTATTC 96C 

AAGTTTTTCA GTATCGAAGA AGATATTCGG GCATACGTAA TTTGGGGGTT GGGCAGCTTT 1020 

75 TCCCGTGCCA CCGATTCGCA ACTGAGTTTC TTTGOCATTC TGATGTTGAT CTTTATTCCG 108 0 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



20 / 490 



GCCGGTATGC 
AATCTGGGAC 
ATCGCTACCG 
TTGGCACG-3G 
ATTGGAAGTG 
GCTTTGCCCG 
TTCCGGCGCA 



T CCT TGYCAA 
TGAATACTCG 
TCACGGCCTA 
TTATCTTTCA 
CTCTGGCTCT 
TCAATTCCGT 
GACGCTTCAA 



GCAGTTGAAT 
TCGGGCACGG 
TTGCGGTCCC 
CACATCGGAT 
TTTCTGCAAT 
AACGGCTTTG 
GGAAGAAACC 



CTCTTATTGC 
CTGCTCGTGA 
ATCGGCTTTT 
CATCGGATCC 
ATCATTGCTC 
GTGGGAGCAC 
GAC 



TGGGAGAAAG 
TCTCTTCCGC 
TGGGGATGGC 
T GAT G CCT GC 
GTATGCCGGG 
CTATTATCGT 



CTACGCACGT 
CGGTTTGCTC 
TGTGCCACAC 
TACCTGTTTG 
GTTTGAGGGG 
CACCGTTTTG 



1140 
1200 
1260 
1320 
1380 
1440 
1473 



10 (2) INFORMATION FOR SEQ ID NO: 31 

(zj SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228? base pairs 
(BJ TYPE: nucleic acid 
15 (C) STRANDEDNE5S: double 

{ D) TOPOLOGY: circular 

{ii> MOLECULE TYPE : DNA (genomic) 

20 ti^i) HYPOTHETICAL: NO 

(iv) ANTI-SENSL: NO 

(vi J ORIGINAL SOURCE: 
25 (A) ORtiANISH: FORY PHYROJ lONAS GINGIVALIS 



30 



35 



40 



45 



50 



55 



60 



65 



70 



( Lx ) FEATURE : 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1...2289 

Ui) SEQUENCE DESCRIPTION; SEQ ID NO: 31 



CATTTTTTAG 
GCTCTATTGT 
GATGCCGAAA 
GTAGCCGGTG 
ACTATTATAT 
GGACAGACCA 
GTCGTTACCG 
CTTACCGCTA 
CTGAGTCCGT 
CTTAGCAGTA 
GGTCAGGCCG 
GGTGCTTCGA 
AAAAAGAATA 
CGGCAAACCA 
TTCTTCTACC 
TCCGGCGAAC 
GGTG'J'AAGCC 
CAGTACAATA 
GATTATCGTG 
ACGCTTTCTT 
GACAGCAGTG 
GGTCAGCTAC 
ACACTGCCTT 
TCTCCCTATA 
GATGAATGGA 
CAGGAGTT CG 
AACTTCCGCG 
CGGAACGAAC 
CCACAGATGT 
AGTGCAACGG 
TCACCCGAG C 
GCTCGCAGCC 
GGAGCCGGAT 
GCTCCACGTC 
CTTGGCGTGA 
CCGGACTATA 
ATCCTGGATG 
GGTGTCAATT 
TTCAACAAC 



TAACGATTAT 
CGTCTTCTCT 
CCGGAGAGCC 
CCGATGCCGG 
GTTCGTTGGG 
AAACGATCTC 
GTACCGGTAC 
AGGACATAGC 
CTTTTGACTT 
AGTATATCCT 
ATTTGAGTCG 
GTTCGCTCTA 
CGAATCGACT 
ATACTTCGCT 
ATACGGAJ'GG 
CGGTCTTGGA 
AATCGCTTTC 
AACGTCAGAT 
CrTTGACGGC 
TCGATGCCGT 
AGAGCCTGAT 
GCAATAAAAA 
ATGCGCAGAA 
ATTTGATTAC 
AACCGCTCGA 
GTACACGAAT 
CTACGTATGC 
TCACCACTAT 
CGGATTATTA 
TTTATGACAA 
ACGAAGCTCA 
GCGCCCTTGA 
ACAGCCTCGT 
ATCGTGCCAA 
GCC1TTTCGG 
CCTTGTGGCG 
GAACGCTCGG 
ATGCTACCGT 



GCGGACAAAA 
GTCGGCTCAG 
TCTTGCCGGT 
CGGACATTTC 
GGGGTATGGA 
TTTTGCATTG 
ACGTTACCGC 
CTCTTTCTCG 
CGGCCCCAAT 
CATCCTTATC 
TATTTCTCCT 
CGGATCCGAT 
GAGTGCATAT 
CGATATAAAC 
CTGGCAGAAT 
GGAAACGTAT 
CTATTATGCA 
CTTCACTCCG 
TTCACTCGGT 
CTACGATCGC 
CAACAACCAA 
CGATCAGATC 
ACT GACCGGC 
CGACAAGGCA 
TTGGTT CAAT 
GACGCCTAAG 
TAACGGCTAT 
GGGTTCGCAC 
TGCTTTGGGC 
TGAAOTTCGC 
GGGAATCAAG 
TGTCCTATGT 
GGAAGCTAAG 
TGTGCACGCC 
CCGTATT CAG 
ACTCGCCACA 
TATAGACAAC 
AACGCCGGGA 



ACTATCTTTT 
AGCAAAGCCG 
GCTCGAATCG 
GAGATCAAGA 
CAGAAAGAGG 
CGACTGCGAA 
TTGGTCGATG 
GCTCCTACTT 
CTGATGGGCT 
GATGGTAAGC 
GATCAGATCG 
GCCATCGCCG 
ACGTCACATC 
ATCGGTAAGT 
AGTCCGTTCG 
AAGAAAACTT 
ACTAACAATC 
ACTTTTTCCG 
ACGAACTATC 
TTCCGTTTCG 
GGTCAGACCG 
CGATACACGG 
GGTTTGGAGT 
GATGCTTCCA 
ATGACAGCCG 
GTATCCATAC 
AAGACTCCCA 
AATCTCTATC 
TTGGAGTACA 
AATCTGATCT 
AAAACCAAGC 
GATGCCTCTA 
AATCTCCAGA 
GATTGGGTTC 
AGCGAGCGTT 
TCGCATCGTT 
CTGTTTGACT 
CGTACTTTCT 



TTGCGATTAT 
TTTTAACCGG 
AAGTCAAACA 
ACCTGCCGGC 
AGGTGGTTGC 
CGAACAACTT 
CTCCTGT GGC 
CCGAGGCCTT 
CTTTCATGCA 
GTGTGTACGG 
AACGGATCGA 
GGGTAATCAA 
GCATATCGAA 
TCAGTAGCAA 
AAATAAAAAA 
TTCGTGCACA 
TTAGCTTCAG 
AAAAGAAGCC 
TTTTCCCCAA 
GATATTTGTA 
AGCAACCCAC 
CAGAGGCTCG 
ATTTCCGTGA 
CGCTCTCTGC 
GTTTCCGTCT 
TCGCCAAGTA 
CGCTGAAAGA 
TCGGCAATGC 
ATCAAGGCCC 
CCTTTATGGA 
AGTATGCCAA 
TCGGTTGGGG 
CGGATGAGTG 
ACTACTGGCG 
ACTACAAAGA 
TCGCTCATTT 
ACGTGGATGA 
TTGCTCAAAT 



CTCTTTTATT 
TAGTGTGTCG 
CACCAACATA 
AG GG CAG CAT 
CATCGAAGCC 
GGAGGAAGTC 
AACGGAAGTC 
ATTGCAGGGG 
GCTGAACGGC 
CGATGTAGGC 
ACTGGTGAAA 
TGT G A't'CACA 
GTACAACGAT 
TACCAACTAT 
GAAAAAAGGA 
GGAAAATCAG 
CGGAAATGTG 
CTATGACATG 
TGGTCTGCAT 
TCATGACAAG 
ATTCTTTCCG 
CGGTGTATTT 
GGAATTGATC 
TTATGTACAA 
GGTACACCAT 
TGGGCCGCTG 
GCTTTTTGCA 
GGATCTTAAG 
TATCTCGTTC 
TATACCGACC 
CATAGGAAAA 
TAT CAAGTT A 
GCTGGAAGGA 
TCAGTATAGA 
CGGCAATGCT 
CCGCCACATC 
TCGTCCTATG 
AGCGATTCGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13B0 
1440 
1S00 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2289 



75 



t2) INFORMATION FOR SEQ ID NO: 32 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



10 



15 



20 



WO 99/29870 PCT/AU98/01023 

21/490 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109S base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDHESS : double 

(D) TOPOLOGY: circular 

Ui> JIOLECULE TYPE : DNA (genomic) 

Uii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 

{ ivt J FEATURE : 

(A) NAME/ KEY : raisc_f tsature 

(B) LOCATION 1...10 95 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 32 



45 



50 



55 



60 



65 



(2) INFORMATION FOR SEQ ID MO: 33 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 960 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS : double 
{D> TOPOLOGY: circular 

<ii> MOLECULE TYPE: DNA (genomic V 

(iii) HYPOTHETICAL: HO 

(iv) ANTI-SENSE: NO 

(vij ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHCIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY : miscf eature 

(B) LOCATION 1. . .960 

(Xl) SEQUENCE DESCRIPTION: SEQ ID HO: 33 

CCGCAATCCT CTCCTGATCG AAGAAGCTTC CAAAACGTCA TGAATAAATA TCATTCTCAA 
AGCGTTTTAG AGGTCGGCAA AATTGGGATT GTGATTATCT TTGCGCCCAT AGTACGGAAT 
GTACATCAAC! AnrrrrrrrT TT'i-n^^.n, _ _* ACtaOAAT 



60 
120 



TGCT GCGCAA GGCTTCGGGA CACCTCTCCC CGGAAGAAGT CGTTTCGGAA TGATACCGTT 

J™?IT ATT TCGTTATTCA TCTGATCAAA CAT ATT AT CA TTATGACGGA CAACAAACAA 120 

2<S ?^1™Z*1^ TATTCCCGGC GTTTCTCCTC TTGCTGGGAG TCATCGCACT GGTGACGATC 180 

25 GTTGGTTTTT TCAT GCTCAG ACCGGCCGAG GAGATTATCC AAGGACAGAT AGAAGTGACC 240 

GAATACCGAG TGTCCAGCAA AGTGCCCGGG CGCATCAAGG AACTTAGGGT ATCCGAGGGA 300 

J A ?S A f 5TGC AGGCCGGCGA TACCCTCGCT GTCATCGAAG CCCCCGACGT AGCGGCTAAG 360 

5*™?^° CAAAGGCTGC CGAAGCAGCT GCACAGCCTC AGAACGCCAA GGCTCTCAAA 420 

GGA'jCACGCA GCGAACAGAT ACAGGCAGCC TATGAGATGT GGCAGAAAGC TCAGGCCGGC 4 bo 

30 GTAGCCATAG CGACCAAGAC ACACCAGCGC GTGCAGAACC TCTATGACCA GGGAGTGGTA 540 

^^^ GA AGTTGGACGA AGCCACTGCC CAGCGCGATG CGGCCATCGC TACGCAAAA^ 600 

CCCAGTACAA TATGGCTCGC AACGGTGCCG AACGCGAAGA CAAGCTGGCA 660 

° TCGTCG ^AG AGCGAGAGGA GCCGTCGCCG AGGTGGAGTC GTACATCAAC 720 

^ ^pSSS TCATCGCCCC ACGGGCAGGC GAAGTGTCGG AGATATTCCC CAAAGCCGGC 780 

n^S^ 2I A 2 CGGCGC ACCTATCATG AATATCGCCG AGATGGGCGA TATGTGGGCC 840 

AGCTTTGCCG TTCGTGAGGA TTTCCTCAGC AGCATGACCA TGGGAGCCGT TCTGGAGACT 900 

GTGGTGCCGG CTCTGAATGA AGAAAAAGTA CCCTTCAAGA TCACATTCAT CAAGAACATG 960 

GG IACCTATG CTCCCTGGAA AGCGACCAAG ACAACAGGGC AGTACGACCT GAAG^C^C llto 

40 GTG^TCATAC GCMG^^ AAGGCACAAA AGCTACGCCC GGGTATGTCC loBO 



1095 



60 

GTACATCAAC T^GCCAT A^TCA^ £SS£SS SSSSSS HI 

ttgttcgtct ctttttcggt tttgcttttt gcatcatgcc GTTCCCAGCG agaaa^ggtc Ito 

/U CTTTACCTGC AAG AT AT CCA AACTTTT&at 

360 



CTTTACCTGC AAG AT AT CCA AACTTTTAAT CGGGAGATTA TCGCTAAACC SSSSSS!?? 

AGGACGATGT GCTGAACATC CTTGTCAGCA GTAGAGACCC GG AGCT T T CA 360 

ACGCCCTACA ACCAAGTGTT GACCACTCGT GCACTGGCCC GCAACGGCTA TGGWVWC 420 

TCGAACGAAG GCTTCCTGGT CGATTCGAAA GGGTACATCA ATTAT CCTAT TTTAGGCCAG 4 80 

ATCTATGTAG AGGGCCTTAC TCGTACCGAA CTGGAGAAGG AGATACAGAA GAGGA™^ 540 

75 TCCAGTGGAT TTATCAAGGA T CCT ACGGT A ACGGTGCAGC TTCAAAATTT CAaS" too 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



22 / 490 



GTTTTGGGAG 
TTGGAAGCGA 
GTGATTAGAG 
TTGCTCGCAA 
GACAAGAAAA 
GTTACCTCCA 



AGGTGAATCA 
TCGGAATGGC 
AAACCGATGG 
GCCCCGTGTA 
CACAGATGAG 
CTTTGGTATC 



TCCGGGTTCG 
CGGAGACCTG 
GCATCGCGAG 
CTATCTGCAT 
CGAGATCAAC 
CATTTCCACG 



ATGTCGGTAA 
ACAATCTATG 
GTTTTCCAGA 
CAGAACGACG 
CAGAATAATA 
CTGACGATTA 



AAGGAGAGCG 
GTCGCCGCGA 
CGGATCTCAG 
TCATCTATGT 
ACGTAAACGT 
CGATAATAGA 



AATAACTCTT 
TCGGGTTTTT 
AAAGGCCGAC 
GGAGCCGAAC 
ATGGCTGAGT 
TAAGACCAAA 



too 

720 
780 
840 
900 
960 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(2) IN FORI IATTON FOR SEQ ID NO: 34 

<i) SEQUENCE CHARACTERISTICS: 

(A ) LENGTH: 174 6 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAN DEDN ESS : double 

( D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) ANT T- SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; PORY PH YROMONAS GINGIVALIS 

Ux) FEATURE: 

IA) NAME/ KEY: ml sc_f ea ture 
(BI LOCATION 1...174 6 

(xi) SEQUENCE DESCRI PTIOM : SEQ ID NO: 34 



TCAAAAATAG 
TTATCTCTGC 
AAGACGAACA 
GTGCTTAGCA 
CATATGACTC 
ATTCCTTACG 
GGAGCTATCA 
ATGCCCGCAG 
GACTTCCGCA 
GTTGCAAAGG 
CGTCAAAAAG 
TATATCCGCT 
GATCTTCGTG 
CG ACT GAT GC 
GTAACGACCA 
ATCGACACGA 
ATTGTAGCCG 
GGCAAAGGGC 
ACGGCCAAGT 
AATCGGACAG 
GCCGGAAGAC 
ACAGCTGCGA 
GGTTATGTGC 
GACTATGCAG 
GGCAAGATGC 
AACTCGGAGC 
CGATTCAAAA 
GAGCGAGGCA 
CTGCTGAAGG 
AAAGGG 



TACTACGAAA 
TGGTGATCTT 
AACTCACCGA 
ATATTATCGG 
GGCGTGGTAT 
AGGAAATGGA 
TATCGCAGCG 
ACGAAGCAGG 
AATCCACCAC 
TGACAGTAAT 
TGATTATGAA 
TGAACAACTT 
ACAAACAAGG 
AGGCTGCTAT 
AAGGTCGCAT 
AACTCCCGAT 
GAGCACTGCA 
TTGTACAAAC 
ACTACATCCC 
GTATGGCAAC 
GTGTAGAAGA 
CATTACTTTA 
TCAAGCATAA 
CTTTCTGCAA 
TTGACAAACT 
TTAAAGCACT 
AGGAGATAAC 
GTATCCGCCA 
ACCATCCGGA 



GTTTTGTACC 
TATCACCGGT 
AGAAGATCGG 
TAACGTCAGG 
AGATGCGATG 
TGAACTGAAA 
CCCGGATAGT 
ATT GATAGCA 
ACCGAAAGTA 
GCGCTATGGC 
TTCCGTCACT 
TACGGACAAA 
AGCGAAAGGT 
CGAGATAGTC 
TGCAGAGTCG 
AGTAGTCCTG 
GGATATGGAC 
GACTCGTCAG 
AAGCGGACGT 
GGCCATTCCT 
TGCAGGAGGA 
TTATATGGCC 
AACGATTGCC 
GATGATGGAA 
GGAGGAACTG 
ACGCGAAAAG 
AAACTATCTC 
GAGTTTGCCG 
ACAAATTCGA 



TTAGCGCGCA 
AGTTTTATGA 
AGCCGCAATG 
CTGTATTTCG 
TTGGGCGGGC 
TTGATGACTA 
GCTGTGATTA 
GGCGACCGCA 
AGCCAAGCAC 
GAAACCAAAC 
TACAGCGGAA 
AGTGCAGAAG 
CTCATTTTGG 
AATCTGTTCG 
GCGTCCGTAT 
ATCGATGGAC 
AGGGCTGTAC 
CTACCATACA 
TGTATTCAGC 
GACAGTCTGC 
AT CCT GCCTG 
ATCAATAATG 
AAGCCGGAGG 
GAAAAGAAAT 
GCTAAGATAG 
CTAAAACCCA 
AACAATGAGA 
GAAGATAAGG 
CAGATCCTTG 



TGAAAAAGAC 
CTGCCTGTGC 
ACTATGTACA 
TCGATACCAT 
TTGACCCCTA 
CGGGAGAGTA 
TCCAGAGACC 
TCCTGACTAT 
TGAAAGGGAT 
CTCGTACTTT 
TGCTCGATGG 
AGGTGCGCAC 
ATTTAAGAGG 
TCCCTAAGGG 
TTCGCACATT 
AATCGG CAT C 
TGATGGGACA 
ACGGCGTGAT 
GTTTGGACTA 
ACAAAATCTT 
ACATCGAGGT 
ACGTTTTCGA 
AT TTTTCCAT 
TTGACTATGA 
AAGGCTACCT 
ACCTGTCGCG 
TTGTCACT CG 
TAGTCAAAGA 
CAGCTCCGAA 



CAATCTGTTT 
ACAGAAGTCC 
GTCGATGGAT 
AAGTATCAAA 
TACCGAATAO 
TGCCGGAGTC 
TATGGAAGGT 
CG AT GGG AAA 
AG CCGGT ACT 
TTCCGTGAAA 
CTCGATAGGA 
GGCCTTGTTG 
CAATGGTGGC 
CAAAGAGGTG 
GACTGAACCG 
TTCCTCGGAG 
AAAGAG CT AT 
CAAATTGACT 
CAGCCGCACC 
TTACACTGCT 
CAAACAAGAT 
TTTCGTCACA 
AACGAACGAG 
TCGCCAGAGT 
CCCGGAAGCC 
TGATCTGCTA 
CTATTATTAT 
AGCTATTAAG 
AGCAGAGAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7Q0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1 740 
1746 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 35 

(i) SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 29bS base patrP 

(B) TYPE: nucleic acid 
tC> ST HANDEDNESS : double 
<DJ TOPOLOGY: circular 

UD MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



23 / 490 



(XV) AIJTI -SENSE: NO 

|va) ORIGINAL SOURCE : 

(A) ORGANISM; PORYPHYROMOHAS GINGIVALIS 

5 

(is) FEATURE: 

(A) MAI IE/ KEY : misc_featura 

(B) LOCATION 1...29S5 

10 (xi) SEQUENCE DF.SCRIPTION: SEQ ID NO: 35 

GTAACAGACA AAATGCAAAA CAAAGGATTT GTGATTGTTA TCACATCGGC TCTGGCCATC 60 

ATCTGTGCGT TTTACCTGTC ATT CT CTTTC GTTACGAACC GTTACGAAAA GAAGGCTAAG 120 

GCGATGGGCG ATGTTGCCGG AATGGCCTAT CTTGATTCCA TGTCGAATGA GAAGGTCTGG 180 

15 TTCGGCTACA CGCTGAAAGA AGCTCAAGCC CAGCAAATTG GTCTTGGCCT TGACTTAAAG 24 0 

GGGGGTATGA ACGTTATCTT GAAACTTAAC GCAAGCGATC TGCTTCGTAA CCTCTCTAAC 300 

AAAAGTTTGG ATCCCAACTT CAACAAAGCT CTGGAGAATG CTGCCAAGAG CACGGAGCAA 360 

TCCGACTTCA TCGATATTTT CGTGAAGGAA TATCGCAAGC TCGATCCCAA CGGTCGCTTG 4 20 

GCCGTTATCT TCGGTTCGGG TGACCTTCGC GACCAGATTA CCGCAAAGTC TACGGATGCA 4 80 

20 GACGTAGTGC GTCTGCTCAA AGAAAAATAT AATAGTGCTG TAGAAGCTTC GTTCAATGTG 54 0 

CTCCGTGCTC GTATCGATGC TTTCGGTGTG GTTGCACCTA ATTTGCAGCG ATTGGAAGGG 600 

CAAGGGCGTA TCCTTGTCGA ACTCCCCGGA GTGAAAGACC CTGAGCGTGT TCGTACCCTT 660 

TTGCAACGCA GTGCCAACCT ACAGTTCTGG CGTACATACA AATTCGAAGA GGTCAGCGGA 720 

GACTTGATCG CTGCCAATGA TCGTCTGAGC GAATTGGCTA TGAACAACAC GGATG C7TACC 780 

25 CCGGAAACAG AGCCTGCAAC TACTGACTCT GTAGCTGCAA CAGCCGATTC TGCTGCTGTA 84 0 

CAAGCTGTAG CTGATTCTGC TAd GTAGCA CAAAAAGAGG CCAAGGATGC TACTCGTAAA 900 

GACGCACTCT TCTCTCTGCT TACTCCCGTG AA'i'CGT GGCG GTGCAGTAGT GGGTGTGGCT 960 

CGTCGTGCTA ATATGGCTCA GATATCTGAA ATGCTCCAGC AAGCTCACGA TCTGAAGGTT 1020 

ACACGTGAAG ATGTG CTTTT CCTCTGGGGT GCTAAAGCAA TCGAAGACCC CGAAACCAAA 1080 

30 AAGGAGACCG ACCTCTACGA ACTCTATGCT ATTCGTACCA ATCGTACGGG AGATCCTGAT 114 0 

TTGGGAGGTG ATGTAGTGAC TTCCGCCAAG AGTGATATCC AAAATGACTT CGGTCGTTCC 1200 

GAACCGATCG TTTCG AT GAC GATGAATGAA GAAGGTGCTC GTAAATGGGC GCGTATCACA 1260 

AAGGATAACG TGGGACGGGC AATCGCTATC GTTTTGGATG GTGTGGTTTA TTCTGCTCCG 1320 

AACGTGAATG AT G AG AT CAC GGGCGGTCGC TCTCAGATCT CCGGGCACTT CACCGTGGAG 1380 

35 GAGGCCGGTG ACCTTGCCAA CGTACTCAAC TCCGGTAAAA TGGATGCTAC GGTAAGCATC 144 0 

GAACAGGAAA ACGTGATTGG TCCTACGCTG GGTGCCGAGT CCATTAAAGC CGGATTCTTG 1500 

TCGTTCCTGC TCGCTTTGGT TATCCTGATG TGTTACATGT GTCTGGCTTA CGGTTTCTTG 1560 

CCGGGTCTTA TCGCAAACGG CGCATTGATT GTAAACAGCT TCTTCACATT GGGCGTATTG 1620 

GCTTCTTTCC ATGCCGTGCT GACCCTCTCG GGTATCGCAG GTTTGGTGCT GACGCTGGGT 1680 

40 ATGGCTGTGG ATGCCAACGT ACTTATCTTC GAGCGTATCA AAGAAGAGCT TCGTGCCGGT 174 0 

AAGACTCCGA TTCGTGCCGT TACGGATGGT TATGGCAACG CTTTCTCTGC CATCTTCGAC 1900 

TCGAACGTTA CGACTATTAT TACCGGTATC ATCCTATTCC TCTACGGGAC GGGGCCGATT 1860 

CGCGGTTTTG CCACTACGTT GATTATCGGT CTTATCGCTT CTTTCATTAC GGCTGTCTTC 1920 

TTGACTCGTA TCGTCTTCGA GAAACTGGCG AAAAAAGGTC GTTTGGATAA GATTACATTC 1980 

45 ACT AC GAG CA TTACTCGCAA TCTCCTTGTC AATCCCTCAT ACAACATCTT GGGTAAGCGC 2040 

AAGACCGGCT TTATCATTCC GGTGATTATC ATCGTTTTGG GACTTATAGC TTCATTTACA 2100 

ATCGGTCTCA ATAGGGGTAT TGAATTCT CC GGAGGACGTA ACTACGTAGT TAAATTCGAC 2160 

CAGCCTGTAT CYTCCGAAGC CGTTCGTTCG GCCTTGTCTT CTCCCCTGCA GGAAAAGGTA 2220 

TTGGTTACCT CCATCGGTAC TGAAGGGACA GAGGTGCGTA TATCTACGAA CTATAAGATC 2280 

50 CAGGAGGAAA GCGAAGAAAC TGAAGCAGAG ATTACTGACA AATTGTATCA GAG C CT G AAA 2340 

GGTTTCTACA CCCAGCAGCC TACT G CT GAT CAGTTCTTGG AC AAT AT CAT TAGCTCTCAG 24 00 

AAAGTAAGTC CCAGTATGTC GAGTGACATC ACGAGAGGTG CTATTTGGGC TGTGCTGTTA 24 60 

TCGATGATCT TCAT GGCCAT TTATATTCTG ATTCGCTTCC GTGACATTTC TTTCTCTGCC 2520 

GGGGTATTCG TATCTGTGGC CGCTACTACA TTCTGCATTA TTGCTCTGTA TGCGTTGCTG Z580 

55 TGGAAGATTC TGCCCTTCAC CATGGAGATC GATCAGAACT TCATCGCTGC TATTCTGGCT 2640 

ATCATCGGTT ACTCGCTCAA TGACACCGTG GTTGTATTTG ACCGTATCCG AGAGACGATG 27 00 

AAATTGTACC CCAACAGAGA TCGCTATCAG GTGATCAACG AT GCCCTTAA TTCAACATTG 27 60 

GGTCGAACAT TAAATACGTC TTTGACTACG TTTATCGTTA TGTTGGTAAT CTTGATCTTT 2820 

GGAGGTGCTA CGATGCGTAG TTTCACGTTC TCGATCCTGC TCGGTATCGT TATCGGTACA 2880 

60 TACTCTACGC TCTTTGTTGC TACACCCCTT GCCTACGAGA TCCAAAAGCG CAAGCTCAAC 294 0 

AAAGCAGCTA AGAAA 295S 



65 



75 



(2) 111 FORMAT! 01 1 Ft)R SEQ ID NO: 36 



i\) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 3138 bas* pairs 

(B) TYPE!: nucleic acid 

(C) STRAMDEDNESS: double 
70 (DJ TOPOLOGY: circular 



til) MOLECULE TYPE: DMA (genomic) 
(iiil HYPOTHETICAL: NO 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



24 / 490 



(iv) ANTI-SENSE: HQ 



(vi ) ORIGINAL SOURCE: 

{A) ORGANISM: PORYPHYROMONAS GINGTVALI5 



10 



fix} FEATURE: 

(A) NAHE/KEi: raiscjeature 
{ B ) LOCATION 1...3138 

\m) SEQUENCE DESCRIPTION: SEQ ID NO: 36 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



GACTTCCCTT 
TTTAAGTTTA 
CTGACCAGCA 
TCCGAGGATA 
GGTGCTGCAA 
TTGAGAGTGT 
ATCGTACTGG 
GGACAGAAAC 
GAAAAGCCCG 
ATGACTACAT 
TTGGGGGCAA 
GTGGCTACGA 
ACATCTATTT 
AAAATGAGCG 
AATACTAAGC 
GCAGGTTTTT 
GCTGAAGATT 
CCAGTGGATT 
AGTCAAGGTG 
GGCTACTTCG 
CGGCTCAACT 
GCGATAGCGA 
TTCGGTGTGT 
GATGTCTATT 
ATGAGACCGT 
CCGATCAAAG 
TCTTCTAAGA 
AGAGCTT AT C 
GATGAAAAAC 
GTTATTGGGG 
AAAACCGGAA 
TTCTTTAGTC 
AATGACCAAT 
GGAATGTTTG 
CTGAAAATGA 
CTCGTTACTG 
AATCCCGACC 
TTCAATAATC 
ATTGATGTCC 
ATGAAAAATA 
AATGTATATG 
GGTCTCAATA 
TCGTTCTATA 
GTTCCTGGTC 
TTGGAGACAC 
GCTTCTTGGA 
ATCAATAATG 
AAAATGCTAT 
CAGTCTCCTC 
CTCAAACTCA 
GCTCGTGTCT 
CCTGAAGCAG 
GGTATTCAGT 



GGGTTAGGGT 
ACGCAAAAGA 
TTGGGT GGGC 
ATGAGCCCCT 
CCGACTTGGA 
CCTATTCCGG 
ATCCGGACTC 
TCAGCACTGT 
TTGCCAATAT 
CCGGTGACCC 
GCTCTGCACC 
TGAATCCGAA 
ATGGAGCTCG 
AGAGAGGTCG 
CCCTTGATAA 
GGGGGAACAA 
TGTATGGCAA 
TTAATCATGA 
ATATTTCTTT 
ATCAGGAAGG 
TCGAAAGTCG 
ATAGACGATC 
TAACGATGCC 
ACATGTATGG 
TCAGTTCCGA 
GCCTTACTTT 
GAATGCCCAA 
GAGATGTTAG 
ATGATCTTAC 
CATCTTCTAA 
ATAGTTTGTC 
GTTTTAATTA 
CCTCT CGATT 
ACATATATAA 
GTTATGGTAC 
TGAACAATTA 
TCTCGTGGGA 
GCTTATCl'GC 
CGATGCCTTA 
CGGGTGTAGA 
CTTCl'GCGAA 
AGTACATGTT 
TGGCTGAATA 
AAGTCGATGC 
GAATTGATAA 
AAGG ACT TTC 
ACCGTTACTT 
TGAATGCCTG 
AGTTTGATAC 
CCTATGTACT 
ATTTGATGGC 
GGGGGAATGT 
TGTCTTTC 



AAAACCGGAG 
AAAGTCTATG 
TATGGCCCAG 
GATCGGCGCG 
TGGCAACTTC 
TATGACTACC 
TAAGGTTCTG 
TTCCGGTTCT 
CATGGATGCC 
TACTGCCGTC 
ATTGTATATC 
TGATTTTGAA 
TGCTGCAAAC 
TATTACCTTT 
TATGATGACT 
TCAAACCGTT 
TTATGATTCT 
TGCAGACTGG 
CTCCGGAGGG 
TATGGCTCGT 
TATCAATGAA 
TGCCGACTAT 
TCGTTATTAT 
AGCTACCAGA 
ATCACATCAG 
AAAGGCACAG 
TAATCCGTAT 
CAAGTCTTTT 
AGCATTGATG 
AGGATTTGAA 
TT TGCCT GAA 
CGGTTTTGAC 
CGGATCCAAT 
TAAATTCATT 
AACGGGTAAC 
TACTGAAGAT 
AAAGCAGTCT 
AGAGGTAGAT 
TATCAGTGGT 
CCTTTCTCTT 
TTTCAACTAC 
GCCTAATACC 
TGCTGGAATC 
GGATGGTAAT 
GTCTGTTACT 
TTTAGATGCT 
TACAGAGAAT 
GACAGAGGAT 
GCATTTGTTG 
CCCCAATAGT 
GCGCAATCTG 
GGGAAAAAAT 



AAGAAAAGAA 
AAAAGAATGA 
AATAGAACCG 
AATGTCGTGG 
ACGCTTAGCG 
AAAGAGGTCG 
GAGCAGGTAG 
GTGGCCAAAG 
CTCCAAGGTC 
GCTTCTGTGG 
GTGGATGGTA 
TCTATGTCCG 
GGAGTCGTTT 
AATGCCAGTT 
GGAGATGAAT 
CAGAAGGTTA 
TTGAAAGATG 
CTCAAGGCTT 
TCTCAGGGAA 
GAACCGGCAA 
TGGCTGAAAG 
TTTGGAAAGT 
AACCCTTTTG 
CCTTCTATGA 
GCCAATGTAA 
GCTGGTGTTG 
GATTCTACTC 
ACAAATACGG 
GCGCATGAAT 
AGTGATAAGT 
CACAGAGTCG 
AAATGGATGT 
AATAGAAGCG 
CAAGAAAGTA 
TCGGAGATTG 
GCTATGGGGC 
CAGTTCAACT 
TTCTATGTCC 
TTCTTCTCAC 
AAGGGGACGA 
AATAGACAGG 
GGTACTATAT 
GACAAAAAAA 
AAAGTTACAA 
CCTCCTATTA 
GATTTTGCCT 
GCAGGTGGAT 
AATAAAGAAA 
GAGAATGCTT 
CTTTTTGCTG 
TTAACTGTTA 
CAATATCCTA 



AACAACATAA 
CGCTATTCTT 
TGAAGGGTAC 
TTGTCGGAAA 
TGCCTGCCAA 
CCATCGCTAA 
TTGTATTGGG 
TGTCCAGCGA 
AGGTAGCCGG 
AGATCCATGG 
TGCAAACTTC 
TTTTGAAAGA 
TCATTCAAAC 
ACGGGATTTC 
TGCTGGATTT 
AAGATATGAT 
AGTATGGTAA 
TGTTTAAAAC 
CTTCATATTA 
ATTTTAAGCG 
TTGGTGCAAA 
ATTATATGGG 
ATGTGAATGG 
CAGAACCGTA 
ATGGTTTCGC 
ATATTACTAA 
CTCTTCGGGA 
CTGAATATAA 
ATATTGAATA 
TGATGTTACT 
CTGAATATGC 
ATATAGATTT 
CGTGGTTCTA 
ATTGGCTCAG 
GTAATTACAA 
TTAGCAT TTC 
TCGGTTTGGC 
GCACTACGAA 
AGTATCAGAA 
TCTACCAAAA 
AAATAACAAA 
GGGAAATTGG 
CCGGTAAGCA 
CAAGCCAGTA 
CAGGTGGTTT 
ACATCGTTGG 
TGATGCAATT 
CAGATGTTCC 
CTTTCCTGCG 
GGCAGAATGT 
CGAAGTATAA 
ATTCTAAGCA 



CAGTAATAAT 
CCTTTGCTTG 
AGTTATCTCC 
CACCACTATC 
TGCCAAAATG 
TGTGATGAAG 
TTACGGTACG 
AAAGCTCGCG 
TATGCAGGTT 
TACAGGGTCG 
TTTGGATGTT 
TGCTTCTGCA 
GAAGAAAGGT 
TCAAATCCTG 
TCAGGTGAAG 
CCTTGCCGGA 
GACATTGTTC 
AGCACCCACC 
TGCCT CTATA 
CTATAGTGGC 
TTTGTCTGGT 
GTCAGGTACT 
GGATTTAGCA 
CTTCGCAAAA 
CCAGATTACT 
TACTCGCACT 
AAGAAGAGAA 
GTTTTCAATT 
TGAAGGCGAT 
GAG CCAGGGA 
CTATTTGTCT 
CTCTGTTCGT 
TTCTGTCGGT 
TGATCTTCGA 
CCACCAAGCA 
TACAGCAGGC 
TGCAGGGGCT 
TGATATGTTG 
TGTAGGCTCT 
TAAGGACTGG 
GCTTTTCTTC 
GTACCCCAAT 
GTTGTGGTAT 
CTCAGCTGAC 
CTCCTTAGGT 
TAAATGGATG 
AAATAAAGAT 
AAAATTGGGA 
TTTGAAGAAT 
GATTGGTGGA 
AGGCTTTGAC 
GTACGTTGCG 



60 
120 
X80 
240 
300 
360 
420 
40Q 
54C 
600 
660 
720 
180 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
174 0 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3138 



(21 IN FORMAT TON FOR SEQ ID NO: 37 



(i) SEQUENCE CHARACTERISTICS : 
70 (A) LENGTH: 2607 buffs pairs 

IB) TYPE: nucleic acia 
(C) STRAMDEPNESS: double 
(U) TOPOLOGY: circular 

75 IIOLECULL TYPE; DNA (genomic} 



SUBSTITUTE SHEET (Rule 26) (RO/ AU) 



WO 99/29870 



25 / 490 



PCT/AU98/01023 



( i i i ) HY POTHETICAL : MO 
fiv] ANT I -SENSE : NO 

(vl> ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALS 

Ux) FEATURE; 
10 (A) NAME/ KEY: raise feature 

(B) LOCATION 1 2607 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

15 TGGCATAGGA ATATTTTTAT CTTTGCGAGT ACATTTAGCC CGAAAAATAT GCTCCCACTG 
Snl^n^I ^ / ^ AA ° CGAGCACCTT TTTCTCGCAA AAGGATACTG CAAGAATCCA 
ATAACAAACA TAATTATCCT ATTTATCAAG AAAAAGAATT TTTTGCTTCT TGGCATTTTC 
SSSSSSf 2 TGACTTTCAT CGGCMCWre CAGGCACAAC AGGCCAAAGA S5£SS5£ 
TTToACGAAC GGGGCGAGGC CTACTTCfCA TTCAAAGTGC CTGATAGGGC CGTTCTACAA 300 

20 GAGCTGGCTC TGATCATGTC CATCGACGAG TTTGACCCCG TAACCAATGA AGCCA^^GC^ 

^? AGGAGTT CGAGG <^C CTGCGCTATG GGCTCAAGCC TACA^G 
ACTCCTCCAT rrATHPanfc /-,-^T>.->n»j-,-r *v~x ± j 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID HO: 38 

<i) SEQUENCE CHARACTER 1ST ICS: 

(AJ LENGTH: 318 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDUESS: double 
(D) TOPOLOGY: circular 

(il) MOLECULE TYPE: DMA (yenomic) 

Uii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v1 ) ORIGINAL SOURCE: 



60 
120 
lfiO 
240 



360 
420 
480 
540 



" " ~ " unuuvA1 1 ^■ibUbLTV\T( J GGCTCAAGCC TACATTCTTG 

ACTCCTCCAT CCATGCAGCG CGCTGTCGAG ATGTTCGACT ACCGCTCAGG AGAAAAATAC 

<™CCCCAC CTATGAAGCC TATATCAGCA TGATGGAAGA GTTCCAAACA 54 0 

AAGTATCCAT CACTTTGTAC TACTTCCGTC ATTGGCAAGT CCGTAAAGCA TCGTAAACTG GOn 

^n^'IS CA AGCTGACGTC CTCT GCCAAT ACAGGGAAAA AGCCTCGCGT GC^Sa^ACT 660 

C™5cGA ^?S ACCGGA ™GTGGTAC TGCTCCGACT CHESSES 7^0 

™C?GCC ™^ SS£E5« TI™^ AACGGAAGTA 7 80 



840 



~^wvj« rti irtrt'jMrtLA TTC-TGGATAA AACGtiAAGTA 

TGGATCTGCC CTTTGACCAA TCCGGACGGA GCATACAGAG CCGGAAACCA <£s££S£ 
GGAGCTACU'C GCTACAATGC CAACAATGTC GATTTGAACC GTAACTTCAA TCATGAtStA 900 
r™^» TC ACCCCCATGG AAAACCTTGG CAGCCGGAGG CAACTGCATT CATGGATTTG 
EESKS SISEST SS^S ^T52M5 GAGGAACAGA GGTGGTGaIc 



960 
1020 
1080 



TATCCATGGG ATAATAAAAA AGAAAGACAT GCAGACGATG 

CGCAACTACG CAGCCGCTTG T GAGAGT AT T TCCGCCAGCT ACATGACCTC CGAA^^ 114 0 

TCGGGAATCA TCAACGGTTC AGACTGGTAT GTAATTCGCG GAAGTCGTCA G^^T^A ^200 

35 AATTATTT CC ATCGTCTGCG AGAAATTACC CTTGAAATCA GCAACACGAA GTTGGTGCCG lllo 

GCCTCTCAAC TTCCAAAGTA TTGGAATCTG AACAAAGAAT CTCTGCTTGC TCTGATCGAA 1320 

GAATCCTTAT ACGGCATCCA TGGTACAGTG ACTTCCGCTG CGAACGGACA GCCTCTCAAA llto 

JESSES T GATAGAAAA CCATGACAAG CGCAACTCCG ATGTTTACTC CG^T^ 1440 

ACAGGCTACT ACGTACGTCC TAT CAAAGCC GGCACTTATA CGGTGAAATA CAAAGCCGAG 150O 

ATTGCATTGG GCA^SS i'£S ATTAGG ATCAAGGACA AAGAAACCGT ££Sc ^560 

ATCT^flGTAT r^ISS TCCTCTGCCT GTACCCGATT TCACAGCTTC TCCTATGACC 1620 

ATCTCAGTAG GCGAAAGCGT CCAATTCCAA GATCAAACGA CAAATAACCC CACGAATTGG ififlo 
TCGAAGGCGG ACAGCCTGCC ATGAGTACAG AGCAGAATCC 

4^ CCGGTCAGTA CGACGTTACG CTCAAAGTGT GGAATGCAAG TGGTTCCAAC 1800 

A ^CCGA ^r^^I ^CACTGTC AATGCCGT TA TGCCTGTAGC JESSES? lleo 

GGTACCCCGA CuGAAATAGA AGAGGGCCAG ACGGTAT CTT TCCAAAACCA ATCCA^CAAT 19?0 

CCgSgc ATTCGATGGC GGCACTCCCG OTAC^?^ AGACGAAAAC [Ho 

CCGACTGTGC TTTACAGCAA AGCCGGCCAA TACGATGTCA CGCTCAAGGC GATCAGTGCT 204 L' 

TCCGGT GAAA CGGT GAAGAC GAAAGAAAAA TACATCACTG TCAAGAAAG- TCOG^TCCCT IVoo 

50 GCTCCGGTAG CCGACTTCGA AGGAACACCT CGAAAAGTAA AGAAAGGCGA GAMTTaS 2^60 

TTCAAAGACT TGTCTACGAA CAAT CCGACT TCATGGCTTT GGGTGTT CG A AGG^GOTCT llll 

SSrSEE J*™™ G G ^ACCTACA ATGAAACAGG 2280 

GTCCAGCTGA CTGCCACCAA CGAGGGCGGA AGCAATGTGA AGAAAGCAGA AGACTACATT J3 «n 

55 ™I A I^ ™GACAG TGTCGAGGAC ATAGTGGGAC !S "oO 

SSS^ H 

sssss ssess sssssr TflCACCATC * ATATcS »ss 

2 607 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



it ."B «3 'S3 H™ iCp Cji hi 




WO 99/29870 



PCT/AU98/01023 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



(A) ORGANISM: PORYPHYROUONAS GINGXVALIS 

(ix) FEATURE : 

(A) NAME /KEY: misc_f ea tur& 

(B) LOCATION 1. . . 318 

|xi) SEQUENCE DESCRIPTION: SEQ TD NO: 38 

CTTTCGTACA GTGGAGAGAG CGATGCAAAA GAGTCTGATC AGAATTGCCG GAAATGTACC 60 

TTCATCGGCT TTGAAAAACG AGTAAATACG ATGCGACTGA TCAAGGCTTT TCTCGTGCAA 120 

CTCTTACTGC TCCCCATTTT CTTCTACAAG CGGTTTATAT CGCCGCTTAC ACCGCCTTCA 18 0 

TGCCGGTTTA CCCCCTCATG TTCGTCCTAT GCCATCGAAG CCTTACGTAA ATAT6GCCCG 210 

GGCAAAGGAC TATTGCTGAG CAT CAAGCGT ATTCTCCGCT GTCACCCGTG GGGTGGAAGT 300 

GGCTATGACC CCGTTCCG 318 



(2) INFORMATION FOR SEQ ID NO: 39 

U] SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2583 Dase pairs 

(B) TYPE: nucleic acid 
tC> STRAN REDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULP. TYPE: DMA (genoiivic) 

[lii> HYPOTHETICAL: NO 

liv) ANT I -SENSE: NO 

(vij ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROUONAS GINGIVALIS 

(ix> FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1 . . - 2583 

iKl) SEQUENCE DESCRIPTION: SEQ ID NO: 39 

TTAGAGATGG CATACGACTT TACACAAACA TTCCGCAACA GCCTGGAGTA CAGCTATCAG 60 

GAAGCAACCC GTCTCGGCGT CGTAGCCGTG ACGCAAGATA TGCT CGTACT CGGTATCATT 120 

CGCGACGGAG ACAAT GGCGC GATCGACATC ATGCGGCACT ATGGGATCAA CTTGTACGAA 180 

CTCAAACGGT TGATCGAGTT GGAAGCCATC GCCGAGAGTT TGCCTGCTTC GCCTGAGGGA 240 

TCGCCCATCT TCACCCCTTC GGCTCGGGAG GCTATCGATG ATGCCACAGA CATCTGTGCC 300 

GACATGGAGG ACGAGGCCGT CAGCCCGGTC CATCTGTTGC TGAGTATCCT CAACTCGACA 360 

CAGGAGAGCT TAGTACAAAA GATATTTATG AAACAAGGTA TAAAATACGA CACCATCCTG 420 

TCGGATTACT TCGGACAGCG CAACCCCTCC GAAGGGAAGT CTCCCTCCGA AATGGAGATC 4 80 

CTCGACGGGT ACCAAGACAA CGACTTCGAC GACGAAGAGG ACGAATCCTC TCCGCCTTCC 540 

GGGAATAGCG GGACAGGCGG AGGCTCCGGC GACGCCCCCG AACAGAATAC CGGCGGAGGC 600 

GATACTACCA CCACGACACG GAGTGGAGGC GACACGCCTG CACTGGACAC CTTCGGCACC €60 

GACATCACTG C CAT GGCGC C AGCAGGCAAG CTCGACCCGG TAGTGGGTCG GGAGCAGGAG 720 

ATCGAAAGGG TGATACAGAT ACTCAGCCGG CGCAAAAAGA ACAAT CCGGT GCTCATCGGC 780 

GAACCCGGTG TAGGCAAGAG TGCCATCGTG GAAGGACTGG CCGAACGCAT CGTGAACAGG 840 

AAGGTGAGCC GTATTCTTT'i* CGACAAGCGG ATCATCAGCC TCGATTTGGC TCAGATGGTA 900 

GCCGGCACCA AATATCGCGG ACAGTTCGAA GAGCGGTTGA AAGCCGTGCT CGATGAGCTG 960 

AAGAAGAATC CGCAGATCAT CCTCTTCATC GACGAGATAC ATACCAT CGT GGGAGCAGGC 1020 

TCTGCAGCCG GATCGATGGA, TACGGCCAAT ATGCTCAAAC CCGCTCTTGC CCGTGGACAC 1080 

GTACAGTGCA TCGGAGCCAC TACGCTGGAT GAGTATCGTA AGAACATAGA AAAGGACGGA 1140 

GCACTCGAAC GCCGCTTCCA GAAGGTGCCG ATAGCCCCCT CGACTGCAGA AG AAACGCT G 1200 

ACCATCCTGC AAAACATCAA AG AGAAAT AC GAGGACTATC ACGGTGTACC CTATACGGAC 1260 

GAAGCGATCA AAGCGGCAGT GGAACTGACC GATCGCTATG TATCCGATCG TTTCTTCCCA 1320 

GATAAGGCGA TAGATGCCAT GGACGAGGCC GGCGCGAGCG TCCATATCAC CAATGTGGTG 1380 

GCTCCGAAAG AAATCGAGAT ACi'GGAGGCC GAATTGGCAT CGGTGCGAGA GAACAAGCTC 14 40 

TCGGCCGTAA AGGCTCAGAA CTACGAACTG GCTGCCTCCT TCCGCGATCA GGAGCGGCGC 1500 

ACTCAGCAGC AGATAGCGGA AG AG AAG AAA AAATGGGAAG AGCAGATGTC CAAGCACCGC 1560 

GAGACGGTGG ACGAGAATGT AGTGGCGCAT GTAGTGGCGT TGATGACAGG CGTTCCGGCT 1620 

GAGCGGCTGA GCACGGGCGA AGGCGAACGT CTGCGCACGA TGGCAGATGA TCTCAAGACC 1680 

AAAGTAGTAG GTCAGGACAC AGCCATCGAA AAGATGGTGC ATGCCATCCA GCGCAATCGT 17 4 0 

CTGGGACTTC GCAATGAAAA GAAACCG AT C GGTTCTTTCC TTTTCCTCGG CCCCACGGGG 1800 

GTAGGCAAGA CCTATTTGGC CAAGAAGCTC GCCGAATACC TGTTCGAGGA TGAGAATGCC 1860 

ATGATCAGGG TGGATATGAG CGAGTATATG GAGAAGTTCT CCGTTTCG CG TCTCGTGGGT 192 0 

GCCCCTCCGG GATATGTGGG CTATGAAGAA GGCGGCCAAC TCACGGAGCG CGTAAGACGC 1980 

AAACCCTATT CCGTGGTTCT CTTGGATGAG ATCGAAAAGG CGCATGCCGA TGTCTTCAAT 204 0 

CTGCTCTTAC AGGTGATGGA CGAAGGTCAG CTGACCGACA GTCTGGGACG GCGCGTGAAT 2100 

TTCAAGAACA CCGTGATCAT CATCACCTCC AACGTGGGTA CACGCCAGCT CAAAGACTTC 2160 
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GGGCAGGGTA TCGGGTTCCG TTCGGAAAAA GACGAGGAAG CGAACAAGGA GCATAGCCGT 2220 

TCCGTGATCC AAAAAGCTCT GAACAAGACG TTCAGCCCCG AATTTCTCAA CCGTTTGGAC 2280 

GATATCATCC TCTTCGACCA ACTGGGCAAG ACGGAGATTC GCCGGATGGT GGACATAGAG 234 0 

CTTAAAGCCG TCTTGGCGCG CATCCATCGT GCCGGATACG ACCTCGTCCT TACCGATGAA 2 4 00 

CCCAAGGATG 'l'GATAGCGAC GAAGGGATAC GACCTCCaAT ACGGAGCACG ACCGCTCAAG 24 60 

CGCACACTCC AGAACGAAGT GGAGGATCGC CTCACGGATC TTATCCTCTC CGGACAGATC 2520 

GAGAAAGGGC AGACG CTT AC GCTCTCTGCT CGCGATGGCG AGATCATCGT ACAAGAACAA 2580 

GCA 2503 

<2) INFORMATION FOR SF.Q. ID NO: 40 



(i) SEQUENCE CHARACTERISTICS: 

(A] LENGTH: 1368 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 
ID) TOPOLOGY: circular 



(li) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: MO 
(iv) AI ITT -SENSE: I JO 



25 ivi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS Git NIVALIS 

(ix) FEATURE: 

(AJ NAMF./KEY: miscjeature 
30 (B) LOCATION 1...1368 

(xj.) SEQUENCE DESCRIPTION: SEQ ID NO:40 

AGAGCAGGAC AAATCCGCCG ACACCATACC GATTCGTCAA GGGGATCGGA CAGCAAAGCC 60 

35 TGCCGCTTCT CCTTTCATGC CGAGACCATC GGTTTCTCCA ACCACCAAAG AGCAAAAATG 120 

AATTACCTGT ACATACTGAT TACACTTTTA CTCTCCGGCT TTTTTTCCGG TGCT GAGATT 180 

GCTTTCCTTT CTTCGGACAA ACTGCGTCTT GAGTTGGACA GGAATAGAGG CGATCTCACA 24 0 

GGCAGAGCGT TAAACTTGCT GTATCGACAT CCGGACCAAC TGGTGACTAC TCTCCTTGTG 3O0 

GGTAATAATA TCGTTTTGGT AGTCTATGGT CTGCTGATGG CGGGATTGCT GGCCGCACCT 360 

40 TTGGCGCAAT GGATTGATAA CGATGCTATG ATCGTCGTTC TCCAATCTGT CTTATCCACT 420 

ATCATCATAC TGTTTACCGG GGAATTTCTA CCCAAAGCCA TTTTCAAGAC CAATGCCAAT 4 80 

ATGATGATGA GGGTATTCGC CCTCCCTATC GTAGCGATCT ATTATCTGCT TTATCCTCTG 54 0 

TCTAAACTCT TCACC«GTTT ATCTCGCTCT TTTATTCGTC TGGTGGACAA GAATTATGTG 600 

CCTACAACAG TAGGGTTGGG GCGCGTAGAT CTCGATCATT ATTTGGCAGA AAATATGTCC 660 

45 GGAGAAAACG AACAGAACGA CTT G ACT ACC GAAGTGAAAA TCATCCAGAA TGCGCTGGAT 120 

TTTTCCGGTA TTCAGGTGCG AGACTCCATG ATCCCACGCA ATGAGATGAT AGCATGTGAG 7Q0 

TTGCAAACGG ATATTGAAGT ACTCAAAACG ACTTTTATCG ATACCGGTTT GTCCAAGATC ©40 

ATTATCTACA GACAGAACAT AGATGACGTA GTAGGATATA TCCATTCGAG CGAAATGTTT 900 

CGTGGGCAAG ACTGGCAAAA ACGTA'l'CAAT ACTACTGTAT TCGTACCCGA AAGCATGTAT 960 

50 GCCAATAAAC TGATGCGACT ACTCATGCAG CGCAAGAAAA GCATTGCGAT CGTCATCGAT 1020 

GAACTTGGAG GTACGGCCGG AATGGTCACA TTAGAGGATT TGGTAGAAGA GATTTTCGGT 1080 

GACATTGAGG ACCAACACGA CACTCGCAAG AT CAT AGCCA AACAGCTCGG CCCTCATACC 114 0 

TATCTGGTCA GTGGTCGTAT GGAAATAGAT GATGTGAACG AACGTTTTGG GTTGTCCTTG 1200 

CCTGAGTCTG ACGACTACCT TACCGTGGCC GGATTTATCC TGAATAGCCA TCAAAATATC 1260 

55 CCACAGGCCA ATGAGGTCGT GGAGATTGCT CCTTATACTT TTACCATTCT CAGATCTTCT 1320 

TCCACCAAGA T CGAACTGGT GAAAATGTCC ATCGACGACC AATCGAAC 136 B 



(21 INFORMATION FOR SF.Q ID NO: 41 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: doubl- 
es ( D) TOPOLOGY : circular 



[iij MOLECULE TYPE: DNA (genomic) 

(III) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

|vi) ORIGINAL SOURCE: 

(A) ORGAN1SI1: PORYPHYROMOIIAS GINGIVALIS 
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(ix} FEATURE: 

(A) NAME/KEY: miscjeature 
{ B) LOCATION 1 . . . 897 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 



TTAAAAAAGG AGATAACTAT GAAACAGAAC TACTTCAAAA GAGTCTGCTC ACTGCTTTGG 60 

CTGGTTTTAC CCATGCTTAT TATGCCATTG GAAGTAGCAG CTCAAGAGAT TATTCCGAAC 120 

GAAGAGGTGT TGGAATCATT GACTTTCGTT GCACCGGTTG AGGAGACAGA CGCAATAGAG 180 

10 GCAGAGGTAG AAGCTCTGCA GGAGATAGTC GCTACTGAGG AG ATT GCGGA GCAGGCTGTT 240 

CGTTCTTATA CCTACACGGT CTATCGTGAT GGCGTGAAGA TTGCTTCAGG ATTGACTGAG 300 

CCCACTTTTC TCGATGAAGA TGTTCCTGCC GGCGAACATA CCTACTGCGT AGAAGTACAG 360 

TATCAGGGAG GCGTATCCGA CAAAGTATCC GTGGACGTAG AGGTGAAGGA CTTCAAACCG 42 0 

GTTACCAATC TCACCGGAAC TGCTTCCAAT GACGAAGTTT CTTTGGACTG GG ACGGTGTG 480 

15 GAAGAGAAAG CTGAAGAGCC GGCAAGTGAT AAAGCAGTCA GCTACAACGT CTACAAGAAT 54 0 

GGAACCTTGA TCGGTAATAC AGCTGAAACT CATTATGTGG AGACCGGTGT AG CCAATGGT 600 

ACATACATCT ACGAAGTGGA AGTAAAGTAT CCTGACGGTG TATCTCCGAA GGTGGCTGTA 660 

ACCGTGACCG TGACCAACAG CTCATTGAGC AATGTAGATG GACAGGCTCC TTACACATTG 720 

CGAGTAGAAG GCAAGAAGAT TATTGCGGAA GCCCATGGTA TGAT CACGCT CTACGACATC 7 80 

20 AACGGACGTA CCGTGGCCGT AGCCCCGAAT CGATTGGAAT ACATGGCGCA AACCGGTTTC 84 0 

TATGCAGTGC GCTTCGATGT GGGGAATAAA CACCATGTAT CGAAAATACA AGTAAGA 8 97 



25 



35 



(2) INFORMATION FOR SEQ ID HO; 42 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1131 base pairs 

(B) TYPE: nucleic acid 

t C ) STRAW DEDNESS : double 
30 (D) TOPOLOGY: circular 



(ii) MOLECULR TYPE: DNA (genomic) 
Mil) HYPOTHETICAL: NO 
liv) ANTI-SF.NSE: HO 



(Vit ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOIIAS GINGIVALIS 

40 

(ix) FEATURE: 

(A) NAJtE/KEY : roisc_feature 

(B) LOCATION 1...1131 

45 {xij SEQUENCE DESCRIPTION: SEQ ID NO: 42 



TTGTTTAATC ATAAAAAATC ATGGTATGAA 
TCTGGCTTTA TTCTTTTGTG TGAAAATACC 
CCTGTGTCGG ATTTACGTGC AGAAGCGTAC 

50 CCGTATGACA ATCCGATGAT TCCTCTAAGC 
TGCAAGACCA TTGACGCAGA TGGCGATGGC 
GGACAGAGTG GTCTCTGTGT CTCTTCGGCT 
CCGGACAATT ATCTGATAAC ACCCGAATTA 
ATCTATTGGG TATGTACTCA AGAT CTCACT 

55 TCTTCTACAG GCAATAATGC TGCTGACTTT 
GCCAAACGGA TACAATCCCC CGAGTTGATC 
CAAAGAAAGG TGGTACTCCC TAACGATACT 
ACGGATAATT TCTGGCTCAA TTTGGATGAA 
AGAGCTCCGT GTCCGCATCC GGG TGGTTAC 

60 ATAGCGAGTG GATTGTCGGC ATTGGCATAT 
GACTATTGTG TCCAAGTCAA TTATCTGCAA 
GTGGTGGCAA ATTCTGCAAA CATCTATGGG 
GGCAAGACCA TTGTAGCGAG TGCTTTCAAA 
CGGCTGATAG CTTCCGGCTG CGATACGCTT 

65 ATTAAAATAC AGGTAAACGG AACTGTCTAT 



ACATTTCAAT TTTATCTCGT TGTTTTCCGC 60 

CTTGCACAAC AAAAAACAGA GGAGTTTGCA 120 

GGCTCTACCG TTTTCCTCCA CTGGACTCCG 180 

GAGAGTTTTG AATCAGGTAT TCCAGCTATA 24 0 

TATAATTGGA TGCATTTGAC CAATTTCACG 300 

TCATACATAG GCGGCGTCGG AGCTTTG ACT 360 

AAACTACCCA CAGACGCGTT GGTGGAAATA 420 

GCTCCATCGG AGCACTATGC CGTTTATTCC 400 

GTTAATCTCT TATATGAAGA GACTTTGACT 54 0 

CGCGGAAATC GGACACAAGG TGTTTGGTAT 600 

AAATATGTTG CTTTCCGCCA TTTTAATTCC 660 

GTATCTATCC TGTATACCCC TCTTCCCCGA 720 

ACTTATTCTG TATTCCGTGA TGGACAAAAG 780 

ATCGATACGG ATGTACCGTA TGGGACTCAA 84 0 

GGAGACTCGT ATA AAGTCT G CAAAAATATA 900 

GCGGATAAGC CTTTTGCGTT GACCGTGGTT 960 

GGAGAGATCA CTCTTTATGA CATTCGTGGC 1020 

AGGTACAAAG CGGAAAATGG TTTTTACCTC 1080 

ACTGAGAAAA TCCAAATCCA A 1131 



(2) INFORMATION FOR SEQ ID HO: 4 3 

70 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 
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(11} MOLECULE TYPE: DMA (genomic) 

Uii] HYPOTHETICAL: NO 

5 (iv) ANT I -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: POR Y P H Y ROMON AS GINGIVALIS 

10 (ix) FEATURE: 

(A) NAME/KXY: nasc_feature 

(B) LOCATION 1 . , .2547 



15 



50 



55 



65 



70 



(xi) SEQUENCE DESCRIPTION: SEQ TD NO:43 



TTCGGAATAT CACCTTCAAT GAAAAAAAGT TTTCTTTTAG CCATAGTAAT GCTCTTTGGC 60 

ATTGCCATGC AGGGACATTC TGCTCCGGTT ACGAAAGAGC GAGCTTTGAG TCTGGCTCGG 120 

CTGGCTTTGC GACAGGTATC CTTGCGAATG GGACAAACAG CAGTATCTGA CAAGATTTCC 180 

ATCGATTACG TTTATCCGCA AG GAG AT G CT GAGAGGGGTA TCACAT CAC A AGAGGAAGGC 24 0 

ZU TCTCCTGCAT ATTTTTATGT AGCTAATCGT GGAAATAATG AGGGCTATGC TCTTGTAGCA 300 

GCAGATGACA GAATACCGAC AATTTTAGCC TATTCACCCA TTGGCCGTTT CGACATGGAC 360 

AGTATGCCGG ACAATCTTCG CATGTGGCTA CAAATTTACG AT CAGGAAAT AGGCCTGATA 4^0 

CTTTCCGGAA AAGCTCAGCT CAATGAAGAG ATATTACGTA CCGAGGGCGT ACCGGCTGAA 4 00 

GTACATGCTC TGATGGATAA CGGTCATTTT GCCAACGATC CCATGCGATG GAATCAAGGT 540 

^ TACCCATGGA ACAATAAGGA ACCACTGCTT CCTAATGGCA AT CATGCCT A TACCGGCTGT 600 

GTTGCTACTG CTGCAGCACA AATCATGCGC T ACC AT AG CT GGCCGCTTCA AGGTGAAGGC 660 

TCTTTCGATT ATCATGCAGG TTCATTAGTT GGOAACTGGT CCGGCACATT TGGTGAAATG 720 

TACGACTGGA TCAATATCCC CGGAAATCCC GACCTTGATA ATCTGACTCA ATCTCAAGTG 7 60 

GATGCCTACG CCACACTGAT GCGTGATGTG AGT GCATCTG TTTCGATGAG TTTTTATGAA 84 0 

30 AATGGAAGTG GTACGTACAG CGTTTATGTA GTAGGAGCCT TGCGAAACAA CTTTCGCTAC 900 

AAGCGTTCAC TGCAGCTACA TGTACGCGCC TTATATACCT CACAGGAGTG GCACGATATG 960 

ATCCGCGGGG AACTTGCCTC CGGAAGGCCG GTCTATTATG CAGGGAATAA CCAGAGCATA 1020 

GGACATGCTT TCGTTTGCGA TGGTT AT GCT TCGGATGGTA CTTTCCATTT CAACTGGGGT 1080 

TGGGGAGGTG TTTCCAACGG CTTCTACAAA CTAACACTCC TCTCGCCGAC TTCGTTGGGT 114 0 

35 ATCGGAGGTG AGGGAATAGG TTTTACCATT TATCAAGAGA TCATCACCGG TAT CGAACCG 1200 

GCTAAGACTC CCGCTGAAGC CGGTACAGAT GCCTTGCCGA TCTTGGCACT GAAAGACATA 1260 

GAAGCCGAGT ATAAAAGTGA ATCCGGATTG AACGTAGGGT ATTCG AT ATA TAATACAGGT 1320 

GAAGAGCAAT CAAATCTTGA CCTCGGATAC AGATTGAACA AGGCTGACGG AGAAGTCATA 1380 

GAGGTGAAAA CTT CATCT AT CAATATCTCT TGGTACGGAT ACGGAGAGCA TCCCGAGAGT 14 4 0 

4U TTCTCATTGG CACCTAATCA GTTGTCACAA GGAATCAACA CCATCACCCT ACTTTATCGT 1500 

CGCACAGGCA CCGAACAGTG GGAGCCGGTA CGGCATGCAC AGGGAGGATA TGTCAATAGC 1560 

ATTAAAGTAA ATACGACAGA CCCGAACAAT GTCGTAGTCA CGGTAGATAA TAACGAAGGC 1620 

AAGCT CAGTA TCGTCCCCAA CAGCTTTGTC GCAGATCTGA ATTCTTATGA ACATAGTACG 1680 

AT T AC AG T AC AGTTCAATAG CGACAGCCCT GATGAGATCC GTACACCCGT AGCCTTTGCT 1740 

45 CTATCTACAG GAGCTACTGC GGACGATGTA ATATCTTT GG GCTGGGTAAT GGCTGAAGTT 1800 

CCGGGCGGTA GCAGCAACTA TCCGGTGGTT TGGTCTAAAG ACGTTCTCAC TCTCTCGGAA 1860 

? G Si^ TA CATTGTGGTA TAGATTTTCC ATCAACAACC AAAAGGATGA ATGGAAAAAG 1920 

A 1 CGGAAGCG TGTCAGTAAA AACACCGACA GAGTATACGC A'JCCCTTATT CGAAGTGGCC 1^80 

CATAATCAAA CTTCTACCTA TACGCTGGAT ATGGCACACA ACAGAGTATT GCCCGACTTT 704 0 

ACACTCAAAA ATCTCGGATT GCCTTTCAAT GGTGAGTTGG TTGTTGTTTT CCGCCAAACA 2100 

5^I5 C T5 AT CG ^ GGTCTTT ATGGGCAGCT CAAGAAACAG TACATATCAA GCAAGGAGAA 2160 

ACTTTCGTAT ATAAACCTGT TGTCGAAGGC CCTATACCTG ATGGATCCTA TCGTGCGACC 2220 

CTCCATGCAT TCGTAAACGG ACAACAACAG TTGTACCTCA AGGGGAAAAG GAACTACACG 2280 

GTGAAGATCG TCAATGGTAC AGCGGTAGAA GCAATAGAAT CGTCAGAAGA GATCAGAGTA 2340 

TTCCCTAATC CGGCACGCGA TTATGTGGAA ATATCGGCAC CTTGCATTCC CCAAGAAACA 24 00 

TCTATCATTC TTTTCGATCT GTCAGGCAAG ATTGTCATGA AGAATAGTTT ATCAGCGGGG 24 60 

CATGGCAGAA TGGATGTCAG CCGACTTCCT AATGGGGCCT ACATCCTTAA GGTGGATGGA 2520 

TATACGACGA AAATAAATAT AGTGCAC ™~ 

60 

(2) INFORMATION FOR SEQ ID NO: 4 4 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 85 bdss pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDN ESS: double 

(D) TOPOLOGY: circular 

(il> MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(iv) ANTI-SENSE; NO 
75 (vi) ORIGINAL SOURCE : 
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(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY : mlscjeature 
5 t3) LOCATION 1...885 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 4 

ACGAAGAAAA CACTGATGAA AAAGCTATTT CTCTCGCTCA CGAGTCTTGT AATGGTCTTC 60 

10 GCTGTTGCAA GTTGCGATAT AATCGACAAG GATCAAACCC TCTTGCCGGC TCCGACCAAT 120 

GTGACACCCG ATAATCCGGA TGACAATCCT TCGGAGATCG ACATT ACG CA GACGCACACA 130 

GAAAAATATG TTTTGGCTGA AGAATTTACC GGCCAAAAAT GTCTCAACTG TCCGAAAGGT 24 0 

CATCGCAAAC TGGCGGCTCT CAAGGAGCAA TACGGTAAGA GATTGACTGT TGTCGGTATA 300 

CATGCCGGCC CTGGATCTCT CG'i'GCCACCT CTTTTCCGTA CAGAAGCCGG AGACGCATAT 360 

15 TATAGCAAGT TCGCCAATAA TACCCCTCTC CCTGCGCTGA TGGTTTCGCG CAAAAAGTTC 420 

GGCTCTTCCT ACGTTTATGA TAAGAGCTAC AAAACGTGGG ACGTGCCTAT TGCCGAGCAG 4 80 

ATGGAGCAAA AGGCGAAGAT CAATATCTTT GCCGTGGCCC AATACACCGA TACCCAAAAG 54 0 

ATCAAGGTGA CTGTAAAGGG T AAAAT ACT G GAGGGGAATA CACTCCCGAA GTCCATGGTT 600 

CAGGTGTATC TGTTGGAGGA TAAGCrGATC GCTCCGCAGG TGGATGGCAA TACGACAGTC 660 

20 GAGAATTACG AGCACAATCA CGTGTTGCGT GGAGCCGTTA ATGGTATTTG GGGCGAAGAA 720 

TTTGTGAATC TCAAAGATTA TTTGTATACT TACGCCGTTG AACCGCTCTC GGGTATGTCC 78 0 

TTCGTAGCCG AGAATTATTC GATTGTGGCT TTTGTATACG ATGTGCAGAC GTTCGAAGTG 84 0 

TATGACGTTG TGCATGTAAA GATCAATCCG CAATCCGATG GCAAA 885 



25 



35 



70 



(2) INFORMATION TOR SEQ ID HO: 45 



ti) SEQUENCE CHARACTERISTICS i 

(A) LENGTH: 690 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
( DV TOPOLOGY: circular 



[Ii) MOLECULE TYPE: UNA (genomic) 
(iix) HYPOTHETICAL: NO 
Uv> A> ITT— SENSE : NO 



40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHY ROMONAS GINGIVALIS 

Iix) FEATURE: 

(A) NAME/ KEY: misc_f-ature 
45 (B) LOCATION 1...690 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 

ACGAATAAAA AAGAAGAGAC AATGAAGAAA TCAAGTGTAG TAGCCTCAGT TTTGGCCGTG 60 

50 GCTCTCGTGT TCGCCGGTTG CGG ACT GAAC AATATGGCAA AAGGCGGCCT TATCGGCGCC 120 

GGAGTAGGAG GTGCCATTGG TGCCGGAGTA GGTAACGTAG CCGGAAATAC GGCTGTCGGT 180 

GCCATCGT CG GTACTGCAGT CGGTGGAGCA GCCGGTGCTC TCATCGGAAA GAAGATGGAC 24 0 

AAGCAGAAAA AAGAACTGGA GGCCGCAGTA CCCGATGCTA CGATTCAGAC AGTAAATGAC 300 

GGAGAGGCTA TTCTGGTTAC TTTCGATAGC GGTATCCTCT TTGCGACGAA CTCCAGCACT 360 

55 CTGAGTCCCA ACTCACGCAC TGCGCTGACG AAGTTTGCTG CAAACATGAA CAAAAACCCC 4 20 

GACACGGATA TTCGTATCGT AGGCCATACG GACAATACCG GCTCCGACAA GATCAACGAT 4 80 
CCTCTGTCTG AGAGACGTGC AGCCAGCGTA TATTCTTTCC TGAATTCTCA GGGTGTGAGT y 54 O 

ATGTCGCGCA TGGCAGCCGA AGGGCGTGGG AGCCAT GAAC CGGTTGCAGA CAATAGCACA 600 

GTTGCCCGAC GTTCGGCCAA CCGCCGTGTG GAGGTTTATA TCTTGCCGAA TGCCAAGATG 660 

60 ATCGAACAAG CACAGCAAGG TACGCTGAAG 690 

{2) INFORMATION FOR SEQ ID NO: 4 6 

65 (i» SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1026 base pairs 



IB ) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D> TOPOLOGY: circular 



(il) MOLECULE TYPE: DMA (genomic) 
(111) HYPOTHETICAL: NO 
75 (ivl AUTI-SEUSE: NO 
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Ivi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY P HYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY: misc feature 

(B) LOCATION 1...1026 

(xx> SEQUENCE DESCRIPTION: SEQ ID NO: 4 6 



AACAGGAACA GAAATATGTC GAAAAAATCG ATCCTTCTGC TTTGCTGTTC GCTGTGCTTC 60 

ATTTCTGCTA CGAAGGCTGT GACCCCCGTC AGAAATGTGC GCAATAGCCA AGTGAACAGC 120 

AAAGCAAAGA CCGAACGTAC AAAGCCCTCG GACTCTGTAC GGTACATTAG CAACATGATT 180 

GCAGATCGGC TGGAGTTCCG CAACAAGATT TCTTCCGAAA AAGAGGTAAG AAAAGCCGAA 24 0 

lO TATGAAAATC GGCTGGCGAT GGAAGCACTC AATTACCCTG CCATAGATTT ATATGGTGAA 300 

GATTCTTGGA GCGAGTATGT AAACCCTTTC GTGGGTGCAG GAACCGATGT CGAAATTCCG 360 

AACTCCTATG ACATTGATTG CTCTTCGTTC GTGATGCCCG TCGAAGATAA GCAGGTCACC 4 20 

TCTCAATTTG GCTACCGTCG GCGTTTCGGA CGGATGCACT ATGGTATTGA TCTTTCAGTG 4 60 

AATCGTGGCG ATACGATACG AGCAGCCTTT GACGGGAAAG TTCGTGTACG CAGCTATGAA 540 

GCGCGTGGCT AT GGCT ACT A CATAGTCTTG CGCCATCCGA ACGGACTGGA GACTGTGTAC 600 

GGACACATGA GTCGCCAATT GGTAGACGAG AATCAGATCG TTCGAGCAGG ACAACCGATC 660 

GGATTAGGAG GCAGCACGGG TCGAAGCACC GGTCCTCATC TTCACTTCGA GACCCGCTTC 720 

ATGGGTATTC CCATCAATCC GAGTACCATT ATAGACTTCG ATAACGGAGT GCCGCTCCGA 780 

GACATTTACA CATTCAAACG AGGGAGCAAT TCTCGCTATG CAAAAGCCTC TAAGACTTCT 84 0 

TCTCGCTATG CAAAAAAAGG GAAGAAAGGC AGACAAGCTT CTTCTCCTAT GACCTATAGA 900 

ATCAAAAAAG GCGATACTTT GGAAACAATA GCCAAAAGGC ACGGCACTTC TGTT CAGAAA <>60 

CTCTGTGCTA CCAATGGCAT TGGCAAGAGT AAAATTTTGA CTCCGGGCAA AGCCTTGAGG 1020 
ATCAAA 



20 



25 



30 

(2) INFORMATION FOR SEQ ID NO: 47 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

(lv} ANTI -SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

<A) ORGANISM: PORY PHYROMONAS GIUGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY : miscjaacure 
50 (BJ LOCATION 1...477 



35 



40 



55 



60 



65 



70 



75 



1026 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

CCGAGCAAAA CGATAATTAA GACAATGGCA AAAATCAATT TCTAT GCTGA AGGCGTCAGC 60 

CTTCCTCGGA TCAGAAGACG GATCGTCGGT AAGTGGATAG CCGAAGTATG CAGCCGATAT 120 

GGGAAAGCGG TGGGAGAAAT CT CCTATCTT TTCTGTGATG ACGAATATAT CCTGAAAGCC 180 

AATCAGGAAT TTCTCGATCA TGACTACTAC ACCGACATCA TCACCTTCGA TTCCTGCGAA 24 0 

GCGGATACGG TGAATGGCGA CCTGCTTATC AGTCTCGATA CCGTACGCTC GAATGCCCGT 300 

GCTCTTGATC TTCGATACGA AGACGAACTG CATCGTGTCA TTATCCACGG CATACTGCAT 360 

CTTTGCGGAT TGAAAGACAA GAGCAAAAAG GATGAAGCCC AAATGCGTGC AGCCGAAGAG 420 

AAAGCCCTTG TCATGCTGCG AGAAACCATC GGATCGGAGC TTTCCCTATT GCATACA 477 

(2) INFORMATION FOR SEQ ID NO: 48 

ti> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118S base pairs 

(B) TYPE: nucleic acid 
{C} STRANDEDNESS: double 
(D> TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (qenomic) 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI -SENSE: NO 



(vi> ORIGINAL SOURCE: 

I A) ORGANISM: PORYPHYROMQNAS GINGIVALIS 



10 



( ix ) FEATURE : 

(AJ HAMS/ KEY ; miscf eature 
(B) LOCATION 1. ..1185 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 8 



15 



20 



25 



30 



AAAAGTAAAA 
AACGCAAGTG 
GCTTTTGCTC 

gcgcagttcc 
tct ctttctg 

GGTCAAGCCC 
GCAGCTCACT 

GAATGGAGCA 
AGATTAGGAA 
CTAAGTCGCG 
TACAATGGAT 
TTCAATGCCA 
CGTTTGCGCA 
GAAGTAACTC 
TTCGACAGCC 
GTAAAAGAAA 
CAATACAACG 
AAATATGGTG 
TTCAGCAAGA 



CTATGAAGGT 
CACAGGAGAA 
GCAATAAAGC 
TCAATGACAA 
TCGGAAAGTA 
ACACTTTCCT 
TCGACTTTAT 
ATTTAATTCC 
AAGACAATGT 
AGCGAGTAGA 
CATACAATGC 
TCCAGGGGAT 
TTGAHCCAAT 
GCGAGGTCGA 
CTGTTACTAA 
ACGTTGTGGA 
CTAACGAGCC 
AGAAATTGTC 
TGCCTTCCGA 
AAGCTTGGAA 



AAAGTACTTA 
TACTGTACCG 
AGGCAGCAAT 
CAACAACAAA 
TCACAGCCCT 
CGGAAAAAAT 
GTTTGATGTG 
ATGGGTAGGT 
GGAATCACTG 
CTTTGTGATC 
CAAGAAAACT 
GGCTACAGCA 
GGACTACGCA 
AGAACTCTCA 
GACAGAAAAT 
CAAAGATCAA 
GATTACCGTT 
TGAGCGTCGG 
ATTAATCTCT 
TCGTGTTGTA 



ATCCTCACAT 

GCAACGGGTC 

TGGTTTGTAA 

GACCTCATGG 
TTCTTTGCAA 

GGCGAACAAG 
GTTAACTACT 
GTTGGCTACC 
ACGGCGAATG 
GAAGCACAAG 
CCCGTATTCC; 
GGTCTTAATT 
CTTATCAATG 
AAACGTCCTG 
ATACTGACGG 
TTGATCAACC 
GTTGGTTATG 
GCTAAAGCCG 
GTAGAATGGA 
ATCGTTCGCT 



TGGTTGGAGC 
AGTTACCCGC 
CACTGCAAGG 
ACCGCTTAGG 
CTCGTTTGCA 
AAATCAACAC 
TTGCACCATA 
AACACAAATT 
TAGGAGTTAT 
CAGCTCACTC 
AAGATCCCGC 
TCCGCCTGGG 
ATCTGAATGG 
TATCATGCCC 
AAAAAGCTGT 
TGTATGACGT 
Cl'GATCCTAC 
TTGTTGATGT 
AGGGCGACTC 
CCAAG 



AATTGCACTG 
TAAGAATGTT 
CGGTGTTGCA 
AGCCATAGGT 
AATTAACGGA 
CAATTTTGGT 
TCGCGAAAAT 
CATCGGTAGC 
GATGGCTTTC 
CAATCTCAAT 
AGGACGl'TAT 
AGCCGTAGGC 
TCAGATTAAC 
CGAATGTCCT 
ACTGTTCCGT 
AGCTCAGTTT 
GGGTAATACT 
T CTGACAGGT 
TACGCAACCG 



€0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
"720 
780 
840 
900 
960 
1020 
1080 
1140 
1185 



35 



40 



(21 INFORMATION FOR SEQ ID NO: 4 9 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 base pairs 
IB) TYPE: nucleic acid 
(CI STRAW DEDHESS : double 
<D) TOPOLOGY: circular 



45 



(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) A11T1 -SENSE: HO 



50 



55 



60 



65 



70 



75 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIUGiVALIS 

Ox} FEATURE: 

(A) NAME/KEY: miscf eature 
{ B ) LOCATION 1 . . .1161 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 9 



TATAAAATGA 
GCATGCACAT 
ACCGCATTCC 
GGTATGGCTC 
CCTACTTTCG 
GGATTCGACA 
AACGCCCACC 
CGTGTCTTCC 
AACGCCAATG 
TTGATGCTGA 
GCCGGAAAGA 
ACAGCAGGTC 
TATGCTTTGG 
TTGAGCCGTC 
CGTGTAGTCG 
CAAGAAATCA 
GTAGTAGGTT 
CGTCGTGCAA 
ACAATTGAAT 



CATACAGAAT 
TCAGTGCAAC 
AACGTGATAA 
TCTCGGGATG 
GTATCGGTAA 
TCTATGGATT 
TTGACTTCAT 
AT AT CATCCC 
GTGAAAAAGT 
AAT TCCGCCT 
TGAACT'TTAT 
TAACGTTCAA 
TCAATGACCT 
GTCCTGTTTC 
TTGACAATGT 
ATGTTTACAA 
ACGCTGACGA 
AAGCGGTAGC 
CGAAGGGCTC 



TATGAAAGCT 
AGCCCAAGAA 
GGCCTCCGAT 
GAATAATGAT 
ATGGCATGAG 
CCCGCAAGGG 
GTTCGATCTG 
ATGGGCAGGT 
AGGAAGTAAA 
ATCAAGACTC 
CGGGACAAAG 
CCTTGGCAAG 
GAACAACCAA 
ATGCCCTGAA 
GGTTTACTTC 
TACAGCTGAA 
AAAAACCGGT 
CAAGATGCTT 
ATCAGAGCAA 



AAATCTTTAT 
GCTACTACAC 
CATTGGTTCA 
GTAGACTTTG 
CCTTATTTCG 
AGCAAGGAGC 
ACGAACTATT 
ATAGGATTTG 
GATGATATGA 
GTAGACTTCA 
AGAGGAAAAG 
ACAGAGTGGA 
ATCAACTCAC 
TGCCCTGAGC 
CGTATCAATA 
TATGCGAAGA 
ACTGCGGCCT 
GAAAAGTATG 
ATCTATGAAG 



TATTAGCACT 
AGAACAAAGC 
TTGACATTGC 
TAG AT CGTCT 
GTACTCGTCT 
GTAACCACAA 
TCGGTGTATA 
GTTATAAATT 
CCGGAACAGT 
ATATTGAAGG 
CAGACTTCCC 
CAGAAATTGT 
TTCGCGCTCA 
CTACACAGCC 
GTGCAAAGAT 
CCAACAACGC 
ATAACATGAA 
GTGTTTCTGC 
AG AACGCTT G 



TGCGGGTCTC 
AGGGATGCAC 
AGGTGGAGCA 
AAGTATCGTT 
CCAATTCACA 
TTACTTTGGA 
CCGTCCCAAT 
CCATAGCGAA 
TAATGTCGGT 
ACAAGCTTTT 
TGTAATGGCT 
TCCTATGGAC 
AGTGGAAGAG 
TACAGTTACT 
TGATCGTAAT 
ACCGATCAAG 
GCTTTCAGAG 
GGATCGCATT 
GAATCGTATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10B0 
1140 
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GTAGTAATGA CTGCAGCGGA A 



(2) INFORMATION FOR SEQ ID NO: 50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
ID) TOPOLOGY: circular 

{ i i > HOLECULE TYPE: DMA (genomic) 

tiii» HYPOTHETICAL: NO 

tiv) AJIT1-SENSE: NO 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GING1VALIS 

lix) FEATURE: 

(AJ NA) IF,/ KEY: misc_f eature 
<B) LOCATION 1 . . .585 

Ui) SEQUENCE DESCRIPTION; SEQ ID NO: 50 

GTAATTGGCA TCATTATGGA ATTTTTCATG TTATTCATAG CGGCGGTTTT CGTTAATAAC 
GTCGTGCTGT CGCAGTTCCT CGGTATATGC CCATTCTTAG GCGTATCGAA GAAGGTAGAC 
ACCTCAATCG GTATGGGTGC AG CCGT GAC A TTCGTATTGG CACTGGCTAC CTTGGTTACC 
TTCCTGATTC AG AAGT TCGT TTTGGATCGT TTCGGATTGG GCTTTATGCA GACCATTGCA 
TTTATTTTGG TCATTGCCGC CTTGGTGCAG ATGGTGGAGA TCATACTCAA GAAAGTATCT 
CCTCCCCTCT ATCAGGCACT GGGTGTATTC TTGCCCTTGA TTACGACGAA CTGCTGTGTG 
CTCGGTGTCG CTATTTTGGT TATCCAGAAG GATTATACCC TGCTCCAGAG CTTCGTCTAT 
GCAATATCCA CGGCTATCGG TTTCACCTTG GCAATGGTTA CTTTCGCAGG TATTCGAGAG 
CAACT CG ATA TGACCAATCT CCCCAAAGCT ATGAAGGGAA TACCTTCGGC ACT CTTGGCT 
GCCGGTATAT TGGCTATGGC TTTCATGGGC TTCAGCGGTA TCGCC 



(2) INFORflATION FOR SEQ ID NO: 51 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2628 base pairs 
(BJ TYPE: nucleic acid 
(CJ STRANDEDNESS : double 
(DJ TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

(lit) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

( ix ) FEATURE : 

(A) NAME/ KEY : mlsc_f eature 

(B) LOCATION 1...2628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 

TACCGATCTT AT CGT GGG AT AGGGAGTGGG ACACACTCTC CTAACCTCAA AAACCGACTA 
AAAAGGATCG GAATAAGGAT ACCGAACAGA CACTATATCC ATATCAAGCC AATCAAACCA 
AAAAATAAAA TGAAACAACT AAACATTATC AGCTTCATCA TTGCTTTCCT AT T CT TAG G A 
ACGAGCGCAT CGGCTCAGCA ATCGGGCGGA TCCGTTACAG GTACCGTAGT GGACAAAAGC 
TCAAAAGAAC CTATCGCATA CGTACAAGTA TTCGTCAAAG GAACCACTCT CGGAACTTCC 
ACGGATGCAA ACGGAAACTA CTCGATCAAG GGAATCCCTT CGGGTAATCA AACTATCGTA 
GCCCGACTCA TGGGTTACTC CACTTGCGAA GAAAAAGTAC ATATAGAAAA GGGTGGTTCC 
CGCCACGTAG ACCTCTATCT GACCGAAGAG ATTCTCTCTC TCGATGGGGT AG TG G T AT CT 
GCCAATAGAA ACGAGACTTT CCGCCGTCAA GCACCCTCGT TGGTAACGGT ACTGTCGCCG 
GAACTTTTCC TCAAAACCAA CTCTACCAAC CTGAGTCAGG GACTTAAGTT CCAGCCCGGT 
CTGCGCGTGG AGGACAACTG TCAGAACTGC GGTTTCAACC AAGTTCGTAT CAATGGACTC 
GAAGGAGCCT ATTCGCAAAT TCTTATCGAC AGCCATCCCA TCTTCAGTTC GCTTGCCGGT 
GTCTATGGCT TGGAGCAGAT G CCTGCCAAT ATGATCGAAC GTGTAGAAGT AATTCGCGGT 
GGAGGTTCGG CTCTGTTCGC CTCTAATGCT GTGGGAGGCG TTATCAACGT AATTACGAAA 
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10 



15 



20 



25 



30 



GAACCGCTTC 
GGGTGGGGGA 
AAAGCCGGTG 
GACAACTTTA 
AAGACCGGTC 
GGTGGCGACA 
TATATCAATG 
AGTCTGTATG 
ACCGAAAATC 
GATGCTTTCA 
GGTATSTACC 
TACGGCCAAC 
ACTTTTAGTC 
GCACGTATCG 
ATTTTCAGTC 
TCATACAGCG 
GCCGGTGGTA 
AGCATCAGTG 
GGAGAAGCCT 
ACGAGCGATG 
AAGGTATATG 
CTCGGCGGTA 
AAAACAACGG 
GGCTATTTCG 
TTCACGGGCA 
ATAGCTCCGG 
GAAGGTCATG 
CACGACTTCC 
TTCAACAGCT 
GGTCCTATGC 



GCAATTCGGC 
GCTTCCAAAA 
TCATGGTATT 
CCGAACTACC 
TCTACAGCAA 
GACTGGACAA 
GCGGAAGTTT 
CTTCAGCACA 
TGCTGAACGG 
CGGCTCTTAC 
GTCATACCTT 
TCGATGACAG 
AGTACGACCA 
ACTATGTTCT 
CTAGAGCCAA 
AAGGATTCCG 
CTCCTATCAG 
CTTCTTTCGA 
TCTCCACCTT 
GCAAAGAATG 
GTGTGAATCT 
CATGGCAGAG 
GACAAGCCGA 
TTGCTACGGT 
AAATGGATGT 
ACGGATCGTT 
CCAAGCTCGT 
ACCTTGCTTC 
ATCAGAAAGA 
AGCCCAGAAG 



CGAGATCAGC 
TACGACCCAG 
CGGCCAACAC 
CAATCTGCGC 
AGCAACCCTC 
TCCTCCTTTC 
CAAATTCGAT 
AGACGTTCAG 
AGCAGTTCAG 
TTCCTACGGG 
CGGAGAAAAC 
AAGCGGCTAC 
GCTCGAATAT 
CCTCAATCAG 
CGTACGATAC 
CGCTCCTCAG 
CCGTGTCCTT 
TTATTACCAC 
TATCAGCAAT 
GATCATTCGT 
GGAGGGAAGA 
AAGCCGCTAC 
GATCTCTGTG 
ACGTCCTACC 
AGTACACGAA 
CGACTTTGAA 
CAAGACTCCG 
CACTATGACC 
CACGGACAAG 
GATTTTCGTC 



CATTCTACGA 
TTCAACGGTT 
AACTACCGTC 
AACCGCTCGC 
GAATATCACA 
GAAGCCCAGA 
CAGGGCTTCA 
CGTCGTAGCT 
AGTGGAAGCA 
ACTACCAAGG 
TGGGACTTTA 
AGACCGAGCA 
AAGACGGAGA 
GATGGCAAAC 
AATCCCAATA 
TATTTCGATG 
TCCCCCAATC 
AGAGCCGACG 
CAGTT CAAAC 
ACCATCTACA 
AT CGCCTACA 
GGAAGCATCT 
AAAGACTATG 
GAGCACTTCG 
GCCTATGAAG 
ATGAATGGTC 
GCCTTCGCCG 
TTGGAATTGA 
GGACCGGGTA 
GGTACAAAGA 



TGACCTTCGA 
CTATGCTGAC 
CCGGACAGGA 
TCGGTTTCCG 
GCATGCAGGA 
TAGCGGAATA 
GCGGTGGCAA 
ACTACGGGGC 
CCGAATCGGA 
GATTCGATTT 
CCGGCGGACT 
AAATAGATCA 
AGTTAAGTGC 
GCTATATCGA 
AGAATCTCAG 
AAGATCTGCA 
TGAAAGAAGA 
AATGGCAATT 
CATCCGATAA 
ACGAGAAGGA 
ACAAATCGTT 
ATACCGCTGT 
TACGCACTCC 
CCATCAATCT 
GCGATATTCC 
AGCAATTCAA 
ATATAGACCT 
ATGCCGGAAT 
GAGCTTCTAC 
TCAATTTC 



CCACGCGAAA 
GGAAGACCGC 
TATAGACGGC 
CT CAT ACT AT 
GTACCGTCGT 
TCTCCAGCAC 
GGATTTCTTC 
TGGCGACTAT 
CGAATACAAC 
GCAAGGAGGA 
CGAATATATC 
GAATACCTCT 
CCTTATCGGA 
TCCGCTCTTC 
CTTCCGACTC 
CGTAGAGTTG 
ACGTTCACGA 
CAATATCATG 
GGTCGAAACC 
TGGAGTATCG 
CGACCTCCAG 
GGAAGCGGAC 
GAATCTGTAC 
CTCCGGTACA 
CGCAGAACAC 
AGGTTTGGCC 
CAAGCTGAGC 
ACAGAACATA 
TTACGTATAC 



90O 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1360 
1920 
1980 
2040 
2100 
2160 
2220 
22B0 
2340 
2400 
2460 
2520 
2580 
2628 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ. ID NO: 52 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2697 base pairs 
IB) TYPE: nucleic add 

(C) ST HANDEDNESS : double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(.lv) ANTI— SENSE; NO 

tvi) ORIGINAL SOURCE: 

iiK) ORGANISM: PORY PH YRGMOM AS GINGIVALIS 

(ix) FEATURE: 

(A) NAME /KEY : misc feature 
<B) LOCATION 1. . .2697 



55 



60 



65 



70 



75 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 



ATATTGAATC 
TGCGCCAGCA 
TATAGCCCTG 
GAGGGAATGA 
GATGAAGTCC 
GGCTACT*rCT 
AAAATCATTG 
TCTGAGAGAG 
AATAATGAAG 
GATGCCAGCA 
CTTATCTCGA 
AAGGCCCTTA 
CTTAGAAACC 
GAGAGCTACC 
GATGCTGAAA 
TATCTCAACA 
TCACAATATC 
AATCAGAGAC 
TATAACAATG 
TCTGTTTCGC 



ATTTGAGAAA 
TAGGGATCCT 
AAATATCCTA 
GGTCTTTCGA 
TGATTCCTGG 
CAAATGTGCG 
TCACTGAACG 
AAGATCTTGA 
ACAAGGTCAG 
TACGGATAAC 
TTGAGAAGAA 
GCAATCATAA 
ATATTCGCTC 
GTGAAGATTT 
TACT GACCG A 
TCGAAGAGGG 
CATCGGAGTA 
GATTGGCTAA 
GCTATATTTT 
TTGATATTCG 



GACTATGTAC 
GAAAGGACAG 
TGCCAAGCCT 
TGACTTTGTC 
AGATGCCATG 
AATCATCGCG 
TCCTCGCATC 
AATGAAAATC 
GCAAATCGTA 
GCAGGAACCG 
AAGCAAAACC 
GCTAAGAATG 
ATCTTTCTTG 
GGTCCGATTG 
CAGTGTCGTG 
GCAGAAGTAT 
TTTGGAACGA 
GCGTCTCAAT 
TGCGTGGGTC 
TATAGCGGAG 



AAAAAGATTA 
TCCTCGGATC 
ATTCATAAGA 
TTGCGCAATC 
TCTGCTGCCG 
GATAAATATG 
AGTAAGGTTA 
GGTCTTCGCG 
CAGAAGTATT 
GATCTTTCCA 
AAGGTGAATG 
GCGATGAAGA 
AAACTTTTTA 
ATAGAGAAGT 
AAGGCTCCTG 
TATATTAAGG 
GTGCTCGGAA 
GAAGATGAAG 
GATCCCGTGG 
GGGAAGCAGG 



TTGCCGTAGC 
TGACCCCTCA 
CCATAGCATC 
TTTCAGGCTT 
TGAATAGAAT 
TCGGCAATAA 
CTTTTAGCGG 
AGGGGATTCA 
TTAGTGAGAA 
AAGATGGCTT 
AAATTTATTT 
ACACCAATGC 
GTACTCATAA 
ATCAGGAATA 
ACGGCAAAAG 
ATGTCAACTT 
TAAAATCCGG 
ATGCT GTGGG 
AAACAAATGT 
CCAATATCAA 



AGCTCTCTTC 
GGATACTATA 
TATTGAGATC 
GGCTGTAGGT 
TATGCGTCAG 
AGTCTATCTG 
GGTAAAGAAG 
GATGACCAGA 
AGGTTATCGC 
TGTCAATGTG 
TTCCGGCAAC 
CAAATTCAGT 
GTTTGTGGAA 
TGGATATCGT 
AGTGGATATT 
TGTGGGCAAT 
AGATGTGTAC 
GAACCTGTAC 
AGTGGGGGAT 
TAAGGTGATC 



60 
120 
180 
240 
300 
360 
420 
4 80 
540 
600 
660 
720 
700 
840 
900 
960 
1020 
1080 
1140 
1200 
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ATCAAAGGAA ATACTGTCGT GTACGAAGAC 
GGCCAGCTCT TTAGTCGCGA GGATATCATT 
CATTTCGATG CCGAAAAATC TATTCCCCGT 
GATATAGAGT ATGATTTGGT GCCGCGTAGC 
5 AGTCAGTCCG GACTTCTGTT CCGAGGAGCC 
TTGCTCCATC CCTCGATGTA TAAGAAAGGG 
TCACTGAGTG CTCAGACCAA TGGAAAGTAC 
CCATGGTTTG GGGGCAAGCG GCCGGATATG 
ACGGCGATTG ACTCCAAGTT CTACAATAGC 

10 AATAGCTACT ACAACAACTA TAATAGTTAT 
CTCTATACTC AGGCCAGCGA TCCGGATCGT 
TACGGTAAGC GTTTGACTTG GCCGGACAAT 
ACCTACTATA GACTGCGAAA TTGGAGCTAC 
GCTAATGATC TCAACTTGGA GCTGCGTCTC 

15 ACCAGAAGCG GATCGGATTT CATGGTTTCT 

GACAATCATG ACTATGCCAG CCAGAACCTC 
TATCACAAGT GGAAGTTTAG AGGACGAGTT 
AAATATACAC CGGTGCTCAT GAGTCGAGTG 
AATAAGAAAT CTCCTTT CGG TACTTTCTAT 

20 GGTGGCTACA TGAATGAGAC TATAGGTTTG 
AATAACTACG ACTATGCATA TGCTTATATG 
CTGTTTGAAA ACTCATTCAA TGCGTGGCTC 
CGCAGTATCG ACAATTATAA TCOCTTTAAC 
GTAACGTTAC CGATGGTCGG AATGCTCGGT 

25 GACAATTCTC TACAGCGAGG AGGAAGCAAT 



GTAGTACGCC GAGAGCTTTA CACAAAGCCC 12 00 

AACTCTATTC GTCTCATCAA TCAGCTTGGG 1320 

CCGATTCCCA ATCCCGAAAC AGGAACAGTG 1380 

AGTGACCAAT TGGAGCTTTC TGTCGGTTGG 14 40 

ATTAAGTTCA CGAACTTCTC TGTCGGCAAC 1500 

ATCATTCCGC AAGGGGATGG GCAAACACTA 1560 

TATCAGCAGT ATAGTGTCAC ATTTATGGAT 1620 

TTCAGCTTCA GTGCATTCTA TTCCAAGACT 1680 

AATGCCGGCA ACTACTATAA TGCCTACTAT IV 4 0 

TACAACGGTA TGTCGAACTA TACCGGCGAC 1800 

TCGCTTCAGA TGTTAGGTAC TTCGATCGGT 1860 

TGGTTCCAGA TTTATACTTC TCTGAACTAC 1920 

AATACCTTCC AAAATTTCCA TCATGGCTCG 1980 

TCTCGTACTT CCATCGATAA TCCTATTTAT 2040 

GTTGCTGCTA CTCTTCCTTA TTCTTTGTGG 2100 

AGCGTAAGCG ATCGTTACAG ATTTATCGAG 2160 

TTTACTCCAT TGCTCAATCC TGCTACGCAT 2220 

GAAGGAGCAG TTCTTGGTTC GTATAATTCC 2280 

ATGGGAGGTG ATGGTATGTC CAGCTATTAT 2340 

CCTGGTTATA AGAACGGATC TATTGCCGGT 2400 

CGGCTTACGA TGGAACTACG TTTCCCGATT 24 60 

TTAGCTTTTG CCGAAGCAGG CAATGCGTGG 2520 

CTGAAGCGAT CGGCCGGTGT AGGATTGCGT 2580 

ATCCATTGGG GATATGGCTT TGACCGTCCG 2640 

GTCCACTTTG TGCTCGGACA GGAGTTC 2697 



(2) INFORMATION FOR SEQ ID NO: 53 

30 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH : 531 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{D) TOPOLOGY: circular 

35 

(ii) MOLECULE TYPE: DMA (genomic] 

(til) HY POT1 IETICAL : HO 

40 (iv) ANTI -SENSE: NO 

{vi J ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

45 (ix) FEATURE: 

(A) NAME /KEY : miscj eaturs 

(B) LOCATION 1...531 



50 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 53 



CAAATAAACA TGAATGGCGA TATGAAACGG TTTTTGATTT TGATCGGCTT TGCACTGGCG 60 

GTAGCTTTCT CCGGTTTTTC CCAAAAGTTC GCTTTGGTAG ATATGGAATA TATCCTCAGG 120 

AATATTCCTG ACTATGAGAT GATGAACGAA CAGCTGGAAC AGGTGTCCAA GAAATGGCAA 180 

AATGAAATCG AAGCTCTCGA AAATGAAGCC CAATCTATGT ATAAGAAGTA TCAGAGCGAT 240 

55 CTCGTATTCT TGTCTGCTGC ACAGAAGAAA ACCCAAGAAG AGGCTATCGT AAAGAAAGAG 300 

CAGCAAGCAT CCGAGCTCAA GCGGAAGTAT TTCGGCCCGG AGGGGGAGCT GTATAAGAAA 360 

CGCTCCGATC TGATGAAGCC TATTCAGGAT GAGATTTGGA ATGCTATCAA AGAGATTGCC 420 

AAGCGTAACA ACTATCAGAT GGTGCTTGAT AGAGGTACGT CCGGAATTAT CTTTGCCAGT 4 80 

CCGTCTATTG ACATTAGCGA CCTTGTACTG AGCAAGATGG GCTTTAGCAA G 531 



12) INFORMATION FOR SEQ ID NO: 54 



<i) SEQUENCE CHARACTERISTICS: 
65 (A) LENGTH: 510 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS: double 
(DJ TOPOLOGY: circular 

70 (ix) MOLECULE TYPE: DNA (genomic) 

(lit J HYPOTHETICAL: NO 

liv) ANTI-SENSE: NO 

75 
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<Vi» ORIGINAL. SOURCE: 

(A) ORGAHISM: PORY PHYROHON AS GINGIVALIS 

Ux» E-LATURE: 
5 (A) NAME/ KEY: misc_teature 

{ D> LOCATION I... 510 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:S4 

10 CGAATAAATA AACAACACGA AATGAAGAAA TTTTTTCTCA TGCTTCTGAT GGCTCTTCCT SO 

TTGAGCCTCT TGGCACAAAA GGTGGCAGTG GTAAACACTG AGGAGATCAT TTCCAAAATG 120 

CCGGAACAAG TAGCTGCTAC CAAACAGCTC AACGAATTGG C CG AAA A GT A TCGCCTTGAT 100 

CTCAAGAGTA TGGACGATGA GTTTGCCAAA AAGACAGAAG AATTTGTAAA GGAAAAAGAC 24 0 

TCTCTACTGG AGAACATCCG CAATCGTCGT CAGCAGGAAC TTCAGGATAT TCAAACTCGT 300 

15 TATCAGCAGT CATACCAAAC GATGCAGGAG GATTTGCAAA AGCGCCAACA ACAGCTTTTT 360 

GCTCCTATCC AACAAAAGGT GGCTGATGCC ATCAAGAAAG TGGGTGACGA AGAAAACTGT 420 

GCCTACATCA TGGAGGCCGG TATGATGCTT TACACCGGAG CTACTGCTAT TGACTTGACC 4 80 

GCAAAGGTAA AAGCGAAACT CCGAATCAAG Si 0 



20 



30 



[2) INFORilATION FOR SEQ ID NO: 5L 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2484 base pairs 
25 (B) TYPE: nucleic acid 

<C> STRANDEDNESS: double 
(D) TOPOLOGY: circular 



(11 ) MOLECULE TYPE: DNA (genomic] 

<iii> HYPOTHETICAL: NO 

(iv) AHTI- SENSE: NO 

35 tvi) ORIGINAL SOURCE: 

(A» ORGANISM: PORYPHYROMONAS GINGIVALIS 

tlx) FEATURE: 

fA) NAME /KEY : mlsc_fsature 
40 (B) LOCATION 1...2484 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 

ATCATGAAGG AAGCTAT TCC CCGAAAGAAC AAGTATATAA AGCTCAACGG TATATACAGA 60 

45 TTGTCATTCA TTCTGCTATG CTGCCTGCTA TGCTCTCAGG CAGCTATGGC ACAAGGCGTC 120 

AGGGTATCGG GCTATGTGCT CGACCGTGGG GAAAAGCCGA TCCCGTTCGC CGGAGTCAAA 180 

GTGCGTGGTA CGGGGACAGG CGCAACGACG AATCT GAAAG GATACTACGA GTTTCGGATG 24 0 

AAGGCCACGA CGG ACAG CAT CACGATCGAG TTCAGCTCCA TGGGGTACCA AGGGGTAAGT 300 

CGCACCTTTC CGTCTCTGAC CAAGGACACT CGGCTGAATG TTCGTTTGGC AGAGGCCGAG 360 

50 ATGGAGCTTT CGAGCGTGAC GGTACAGGCC ACAAAACGCA GACTCAACAC GATGGAGCGC 420 

GTCAATACCC GAGACCTTCG TGTCAATGCA GGGCCTACGG GAGGGGTGGA ATCGCTCATC 4 80 

AGTACCTACG CAGGAGTAAC GCAGAACAAT GAACTAAGCT CGCAATACTC GGTTCGCGGA 54 0 

GGAAGCTACG ATGAGAATAT GGT CTATGT A AACGGAGTGG AGGTTTATCG CCCGCTGCTG 600 

GTTCGCTCTG CACAGCAGGA AGGTCTGAGC TTCGTCAATC CGGATCTGAC ACAATCCGTA 660 

55 CAGTTCTCCG CCGGAGGGTT CACGGCCGAC TAT GGCGACA AGATGTCCTC CGTACTGGAT 720 

ATTCGCTACA AGCAACCGCA GGAGAAGGAA GGAGCGGTAC TCCTCGGGAT GCTACAATCG 780 

AGTGCCTACT ATGGCAGCAG TGCCGGAGCC TTCAGCCAAA TCACGGGTGT ACGCTACAAG 840 

AGTGCCAAAT CGCTCTTGGG CACTACGGAC ACGAAAGCCG AATACGATCC GATCTATGCG 900 

GACGGACAGA CATTCATGAC GTACCGTTTC AGCCCCAAGC TGTCGGTTAG TTTCCTCGGC 960 

60 AATATTTCGC AAACTCGCTA CAAGTTTGTC CCTCAGACCC GTGAGACGAG CTTCGGTACA 1020 

CTGAGCGATG CCAAAAAGTT GAAGATCTTT TTCGACGGTC AGGAACAAGA TCGTTTCCTG 1080 

ACCTACTTCG GTGCCTTCAG CATGAACTTC GTGCCGGACG ACAAACAGCG GCATACGGTT 1140 

ACGCTTT CGG CCTTCAACAG TAACGAACGG GAGACCTACG ATATTCAGGG AGAATACTTT 1200 

CTGAACGATG TGCAGCTGGG GGCGGACGGA ACTGCTTCGA TGGCTT CGGG CTCAGAGAAC 1260 

65 TCCAACGGCT TGGGCATCGG GCG CAATCAC GAGCATGCGC GCAACAGGCT GAGCTACCGC 1320 

GTGCTGAACA TGGGTTACAG AGGGGAGATG AAGCTGAACG AGAAGCATCG CCTGCAAGCC 1380 

GGCGTATCGG CACAGAT GG A GAAAATAGCC GACCATATCA GCGAATGGGA ACGGAGGGAT 14 40 

TCGGTAGGAT ACAACCTACC TC ACT CGG AG ACCGTATTGC TGATGTACAA TAACCTATAT 1500 

GCCGATACGC AGATGAGGGG AACGCGCTTG TCGGCATTCG TACAGGATCG ATTCAACTTC 1560 

70 . AGCATGGGAG GAGGTACATT TTCTCTCATT CCGGGTATCA GAG CYTCGTG GTGGAGCTTC 1620 

AACAAGGAGT TGCTCGTCAG CCCACGTATC AGCGTGGGTT ATTCTCCCGA AAGCAACCCG 1680 

GCTTTGGTAC TGCGTGCAGC CGCCGGACTT TATTATCAGG CACCGTTTTA ' CAAAGAGCTA 17 4 0 

AGGCAGACGC ATAAGGATGC CGAAGGCAAT AACGTGGTTG TCCTCAACGA GAAGATCCGC 1800 

TCTCAGGGAG CTTTTCACAT TCTCGCAGGA GCAGACTATA CCTTCGAAAT GGGGGGGCGA 1860 

75 AAATACAAGT TTACGGCAGA G GCTTACT AC AAGAGCCTGT TCAACATCAA CCCGTATATA 1920 
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ATAGAGAACG TGAAGATCCG CTATCTGGGC G AAAAC AT CG GTTCGGGTTA TGCTGCGGGT I960 

ATCGATCTCA AGCTCTTCGG CGAACTGGTA CCCGGAGTGG ATTCGTGGCT GACGGCTTCC 204 0 

ATTATAAAAG CCCGTCAGAA ACTGGATGGC TACGGTTCTT TACCACTGAT GAACGCACCC 2100 

ACTTACAATT TCTCCTTCTT CCTTCAGGAG TACGTGCCGG GCAATAAACG CATCACAGCC 2160 

5 ACCCTGCGGG CTGCACTAAG CGGAGGATTC CCCCAGCTCA ATCCGAGCAA AGGGCTTAGC 2220 

TCGCCGGCCT TTACCGCACC GGCCTATAAG CGTGTCGATC TGGGGGTAAT GTACAAATGC 22 80 

CTCGACCCGG ATGACTCCTT TGCCGGCCGA AG CAAATG GC TAATGGGAGT AAAAGGGGCC 234 0 

TACATAGGGG CTGACCTCTT CAATCTGTTC GACATGACCA ACGTCAATTC TTACTACTGG 24 00 

GTGTCGGATG CCTACCAACA GCAATACGCC GTACCGAACT ACCTGACACG CCGCCAATTC 24 60 

10 AACCTGCGTC T CCTCGTCG A ATTC 2 4 84 



(2) I N FORMAT I ON FOR SEQ ID NO: 56 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2037 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

20 

(ii) MOLECULE TYPE: DNA f genomic) 

(liii HYPOTHETICAL: NO 

25 Hv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMON AS GIUGIVALIS 

30 (ix) FEATURE: 

(A) NAME/ KEY: miscf eature 

(B) LOCATION 1...2037 



35 



70 



75 



fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 



CCTATCCCTT TTCTTCTTAA TATGTACAGC GGACATCATA AAATCCATTA TCCTTTTCTT 60 

ATCCTGTTGG TATGCCTTGC TTTTGCTGCC TGCAAGAGCG TGAAGTTGAA AGATGCGGAG 120 

AAGGCACATG ATCGCCAAGA GTATACCAAG GCTGCCGATA TGTACAATAC ATTATACAGG 180 

CGTACCCGAC GAAAGCAGGT GGAGATGAAA GCTTATACGG CTTTCCGATC CGGTGAAAAC 240 

40 TATCGTGCCG CCGGCAGACA AGCCAAAGCT TTGCGTGGCT ATCTGAATGC CAGACGCTAC 300 

GGGTATCCGG ATTCTGTGGT ACTGCTCCGT TTGGCACAGA CTTATCAGCA AGGAGGTAAC 360 

TATAAGGAAG CCGAG GT ACT CTTCCGTGGA TATCTGGAAG CTTATCCGAA AAGTTATTTT 420 

GCAGCTATCG GTTTGGAGGG GTGTCTCTTT GCCCGCCAGC AAAAGGAATA TCCTACACGT 4 80 

TACCGGATAC GGCGAGCTGC CGAGTGGAAT TCGGCACGGG GCGACTTCGG CCCGGCCTAT 54 0 

45 GCACCCGATG CTTCCGCTCT CTATTTCACA TCGAGCAGAA GCAAAGACGA CGGTTTGGAT 600 

AATAGCAGCA TAACGGGACT GAAACCCAAC GACATTTATA TCATCAAACG AGATGCACAA 660 

GGACGATGGG GACGTCCCGA TAGCGTGTCC GGAGGAATCA ACACTCCATG GGATGAAGGC 720 

GTCCCAACGA TCACGCCCGA TGGTAGTACC ATATATTATA CGTTGGCGCA GCAAGGAGCC 780 

GATTACGACC GTACGGTACA GATCTATTCC GCCGCTCGGA GCGGAGAAGG CGGTT GGAGC 840 

50 AACGGTTCGC TCGTGGACAT TATGCGCGAT TCGCTCCGTA TGGCTGCTCA TCCCTCTATG 900 

TCGGCATCCG GCGATTACCT GTATTTCGTC AGCAATATAG GCGGTAGCTA TGGCGGCAAG 960 

GATATTTATC GTGTCAAGGT GTCGGATCGT TCTTATGGTT CACCGGAGAA TTTGGGGCCT 1020 

GATATCAATA CGCCGG GGG A CGAAATGTTT CCCTTCATAG ATGGGGATAG TACCCTTTTC 1080 

TTCGCTTCGG ACGGACACGC CGGTCTGGGA G GACTGG AT A TTTTCAAAGC CACGCTGGAC 114 0 

55 TCTACCGGCC AATGGCATGT AGTCAATATG GGACAACCGG TCAATTCCTC TGCCGATGAT 1200 

TTCGGCTTGG CTGTGGAGCC TAAAGGCAAA AACAAAGAAG AAGCTTTGCC GGACAACGGA 1260 

GTCAAAGGTG TATTTTGTTC CAACCGAGGC GATGCACGCG GATGGCCGCA CCTCTTCCAT 1320 

TTCGAACTGC CGGCTATCTA CACCGAGATT CAAGGTTATG TGATGGACAG AGAAGAAAAT 1380 

CCCATAGCCG GAGCCACTGT CAGGATCGTA GGCGAACGCG GCCCCGTAGG ACAGGGATTC 14 4 0 

60 GTGACTACTC GTGACGATGG CTCCTATAAG ATGAGCGTGC AGGGCGATAC TCGCTATGTA 1500 

ATGCTTGCCG GAGCAT CGGG TTATTTGAAT CAGTACGTAG AACTCAAGAC CGATACCCCC 1560 

AAGCAGAGTG AGACCTACTA TGTGGACTTT TTCCTTGCAT CGCGTGAGAA AGCCGAGGGC 1620 

TTGCAAAATA TTTTCTATGA TTTCGATAAA GCTACTCTTC GCCCCGAAAG CATGAAGAGC 1680 

TTGGACGAAC TGATTCGTAT CCTCACGGAC AATCCGGATA TTCGGATCGA ATTGGGTTCG 174 0 

05 CATGCCGACA GGAAAGGCCC CGATGCTTAC AACCTCGGAC TATCTGACCG CAGAGCCAAA 1000 

TCCGTGGTGG ATTACCTCAC GAGTCGTGGC ATAGCGGCCG ACAGGCTTAC GTGGAAAGGC 18 60 

TACGGTAAGT CTGTCCCCAA GACGGTGACA GCCAAAATTG CCGAACGGCA CGATTTCCTG 1920 

AAGGAAGGCG ATGTGCTCAC CGAGGAATTC GTAGCACCTT TGACCGAGGA GCAGCAGTCA 198 0 

GTCTGCGACC AACTGAACCG TCGTACCGAG TTCCGTGTGA TCGAAGAAGA GTTGCGT 2037 



12) INFORMATION FOR SEQ ID HO: 57 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2316 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 

(D) TOPOLOGY: circular 

Cii) HOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AHTI -SENSE; MO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



(lx) FEATURE: 
15 (A) MANE/KEY: nusc feature 

(B) LOCATION 1...2316 



50 



60 



(2) INFORMATION FOR SEQ ID NO: 58 

(1) SEQUENCE CHARACTERISTICS: 



Rt - 1A) LENGTH: 1452 base pairs 

DO (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

7Q < ll > MOLECULE TYPE; DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : HO 

7*> (Vl) ORIGINAL SOURCE: 



PCT/AU98/01023 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57 

20 CCTGCACAGC CGAAAACATA TTGCATACGA TACTTTCGGA GAGAGGTATC TCCfArrArr 
CAAACAGAAA GGACATTGTA TGCGGACAAG ATGCGCCGGC ATATCGTCAA GTGCTCTCCC 

SSSSEE ^ CTCC *»»*™** SSSSSS 

TTATGAAAAA GTTTTTCTTC GCGCTACTAT CGATTGGTAT TTCAGrrrar 

25 ™^ GACA ™™ ™ 3oo 

S££SS5 CCTCTGAAAA AGATACCGGC CAACATGGAA J60 
i.iijiicic«t CGCGCAACAT CAAGCAGTCC GGCTTTAACA ACATGACCGA rMTfiviitr 

?rSS5£ A ? TT CGCTCGATGT CATACAATAC CCCGGCTTTA SSsSSSS CGGTATCC^C 4 80 

GGTTTCAAGC CCTCCGGCAA GTATGTAACC GTATTGGTAA ACGGCATCCC TGCKGGAAPi £!n 

30 SSSES ™ac™ SES^ atc ^<™* ™S «oo° 

TCCAArr-nna I^TS^S^ C** 1 '^**' 3 GGCGGTGTGG TGAACAT CAT CACCCACAAA 660 

GGAT^ATTCA £££££ r™^" CTCTTCGGC G GTAGCTACCA GACCATGGCC "S 

ScTa£ SSSSESS £™« CCTGAGCAAA ' 212 

35 SSSSSS 

AATCTGTATC AAAACGTATT CCT CGGAGAT GCGATCCCCG TAGGAGGATC TATATGGGG^ , 

S£22£5£ ^ CTGAAT c * tt <™ga cc^tIcga £££££ lolo 

t^Sc^S ^^ CTCC ccct acttca acatagagaa ATCGGAGAAC Il40 

ATI Ii™™ CCGATCCCAC CGGTTTCATC AACTACAAAA G CG ACT ACT A CACCTATKTT 7?nn 

SSSi^S ^f^"" 3 ** TTCCTTTGGA GGACAAAATA TCGTACtSS T^CGACA^C AH 

^ A ^S? TGA CGATGGAGTC AGAAAGATTC GAGCAGGCAG GAGTGAATAC AAAGCCATAC Win 

^^^ GAT ATGCCACGAA CAATATCGGT TTGTTCGGAC ATOCCwS?? C^cSSctS ^380 

AACGATGCTC TATCGATATC TGCCGGTGCA CGTGCCGACT TCATGTTCTT TCmSSIa \aak 

45 ~ — ~ ~ = i= i 

a?aSc SSSSSS acc^S s^sss ss^s 

?S?g?cSg CC^ CGAC G ™ CCT ™ ^TA™ SESSE 7^ 

S5£S££ ^gagSg £££££ SEES?? J^SSSS SSSS; if 00 

£££££ SSSS^ c^if G ^SsS J^EEE ssse 2 6 S 

ESSES iSSSSS SSSSSSS ESSE: SEES L 9 !o° 

£1^*™^ GGAraAGCA AAACTGGTAT GCTTACTACC CCGAAGTTCG C-«GA^C 2?00 

£S^£ SSSES SSSSS SSSSSE 



60 
120 
180 
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{A) ORG AM ISM: PORY PHYROMONAS GINGIVALIS 

(XX) FEATURE: 

(A) NAME/ KEY : miscf eature 
5 (B) LOCATION 1 1452 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 

GGCCGGGCCT CTTCCCCTTA CAGGCAAATG GATGGCATAC TAAACGATGA ATATAGACAA 60 

1U GCATCTATGA ACAGGTTTTC AAATCATTGG CCCTGCATCC TCGTGGGGTT TCTACTCTGG 120 

TTTGTATCGG CGAGTCGGAC TGTGGCACAA AACGCCTCCG AAACGACGGT ATCGTACGAT ISO 

ACGGATACCG CCGTACTCTC CGAAGCCGAT GTGCTTCGGA TCGCTCTTAG TGAGAATGCC 24 0 

ACAGTGAAAG TGGCCGATAT GGATGTGCGC AAACAGGAAT ATGCACGTAG GG CAGCACGT 300 

GCCGATCTCT TCCCGAAAGT AGACCTCAAT GGCGTTTACA GCCATACGCT AAAGAAGCAG 360 

15 GTCTTATATA TAGATATGCC CCGTTTCAGC AGTAGCGAAG GTATCGAAAT GGGGCGTACA 4 20 

CACAATACGC AAGGAGGGGT GAACGTCTCC ATGCCATTGG TGTCGGCACA GCTTTGGAAA 4 80 

AGCATTGCCA TGACCGGAGA ACAG CTCG AT CTGGCTCTGG AGAAAGCTCG CAGCTCCCGA 54 0 

ATCGATTTGG TGGCAGAGGT GAAGAAGGCT TACCTCAGTG TATTGTTGGC CGACGACTCT 600 

TATGGCGTAT TCAAGCGCAG CTATGACAAT GCTCTGGCCA ATTATAAGAA CAT AT CCGAC 660 

AAGTTCGATC GTGGACTTGT GGCCGAGTAT GATAAGATTC GAGCCAATGY ACAGGTACGC 720 

AACATCGACC CTAACCTCTT GCAAGCGCAG AACTCCGTAG CCCTTGCTCT CTGGCAGCTC 780 

AAGGTCCTGA TGAGCATGGA AGTGGAAACT CCGATCAGAC TCTCCGGTTC ATTGT CCGAC 84 0 

TATAAAGAAC AAGTCTATAC CGGCTATTTT GCCGCCGATA CGCTTATTTC CAACAACTCC 900 

TCCCTGCGTC AGCTCGATAT ACAGCGTCGT CTGGCTGTCA GTGCAGACAA GCTGAACAAG 96n 

ZD TACAGCTTCC TGCCTACACT CAATCTGGGA GGGCAGTACA CCTATTCGCT CAACAGCAAC 10^0 

GACATCAAAT TCTGGGGCGA GGGACAACGC TGGACGCCTT TCTCCACCAT ATCG r "l'CAGC 1080 

CTGTACATTC CTATATTCAA TGGAGGCAAA CGTCTGTACA ACGTGAAGGA AAGT GCTTTA 114 0 

TCGATCCGTC AGATCGATCT GCAACGACGC CACATAGAGC AATCCATCCG AATGGGAATC 1200 

AAGAACCAAA ATGACCGTCT GCGTACCTGT ATGCAGAGAT TTGTGGCCTC GGAAGAGGCT J 260 

GTCCGAAGTG CAGAAAAGGG CTATCAGATA GCAGAGAAAC GCTATCAGAC AGGCGAAGGC 1320 

ACTCTCGTCG ACCTCAACGA TGCCGATGTG GCTCTTTTGC AGGCTCGACT CAATTATAAT 1380 

CAGGCCATAT TCGACTTTAT GACCGCAAAG GCCGAATTGG ACAAGATGAA CGGCATGGGG 1440 
ATTCCCGAAC AA 



30 



35 



45 



60 



65 



1452 



(2) INFORMATION FOR SEQ ID MO: 59 



(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 1620 base pairs 
40 (B) TYPE: nucleic acid 

(C} STRAW DEDNESS : double 
(Ti) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA {genomic J 
(iil) HYPOTHETICAL: NO 
(iv) AHTI -SENSE: NO 



50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PCRY PHYROMOHAS GINGIVALIS 

(ix) FEATURE : 

(A) NAME/ KEY : raiec_fe^turs 
55 (B) LOCATION 1...1620 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 59 



TTTCATAACT TTGACTTCCT AAACGCTATA AAATT GTTTT CGATGGCAAA TAATACTCTT 60 

TTGGCGAAGA CTCGACGTTA TGTCTGCCTT GTCGGTTTCT GTTGGCTCAT GGCGATGATG 120 

CACCTCTCTG GGCAGGAAGT CACTATGTGG GGGGACAGCC ATGGAGTGGC GCCGAACCAA 180 

GTGCGCCGAA CGCTGGTGAA GGTAGCCTTA AGTGAATCCC TTCCTCCGGG TGCAAAACAG 24 0 

ATTCGTATCG GATTCTCTCT TCCGAAAGAA ACGGAGGAAA AAGTCACCGC CCTATATCTC 300 

CTTGTGAGTG ATTCTTTAGC GGTGCGCGAC T TGCCGG ACT ACAAAGGGCG AGTCTCTTAC 360 

GATAGCTTCC CGATCTCAAA GGAAGATCG'i* ACCACAGCCC TTTCTGCGGA TTCGGTAGCC 420 

CGACGCCGCT TCTTTTATTT GGCTGCGGAT ATAGGGCCTG TTGCTTCTTT TTCCCGATCC 480 

GATACGCTGA CTGCCCGTGT GGAAGAGGTG GCTGTCGATG GCCGCCCTTT GCCGTTGAAA 54 0 

GAGCTGTCGC CTGCCTCCCG TCGTCTGTAT AGGGGGTATG AGGCCCT CTT TGTACCCGGT 600 

GATGGCGGAT CGCGGAACTA TCGTATCCCG GCCATTTTGA AAACGGCTAA TGGAACACTC 660 

/U ATAGCGATGG CCGACAGACG AAAATATAAT CAGACGGATC TGCCGGAGGA TATAGATATA 720 

GTCATGCGGC GCAGTACGGA CGGAGGGAAA TCGTGGAGCG ATCCCAGGAT TATCGTACAG 7 80 

GGAGAGGGGC GCAAT CAT GG CTTTGGCGAT GTACCCCTGG TGCAAACCCA AGCAGGAAAG 84 0 

CTCCTGATGA TCTTTGTCGG TGGAGTAGGC CTGTGGCAGT CTACCCCCGA TCGTCCTCAG 900 

CGCACTTATA TATCGGAAAG TCGGGACGAA GGACTGACTT GGTCGCCTCC TCGGGATATA 960 

/b ACCCATTTCA TCTTCGGCAA GGATTGTGCC GATCCGGGAC GCAGTCGCTG GTTGGCCTCC 1020 
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10 



TTTTGTGCTT 
ATCCGCGAAT 
GGCGATACAT 
TTGATGCCCG 
CAGCGTTTCT 
GAGGGCATCC 
GATCAAGTGC 
CTCTTCGATC 
GCCTACTCGG 
GATGAGATCT 



CGGGACAAGG 
CAGGGCAGGA 
GGCAGCTTTC 
ATGGCAGGGT 
TCGCTCTCTC 
ATGACCCCGG 
TGCACTCCCT 
ATGTCTCCGG 
ATATGACTCT 
CATTGGTTTT 



GCTTGTGCTG 
GTACGTCCTG 
CGACTGTGCA 
ACT GAT GAG C 
CTCCGACGAT 
CTGTAATGGA 
GCCTCTCGGC 
CCGCTGGTCC 
GCTGGCGGAT 
CATTCGGTTC 



CCATCCCGTC 
AACAACTATG 
TACCGCCGTG 
ATACGCAATC 
GGCCTTACTT 
GCTATGCTTC 
CCGGATGGGC 
GCTCCCGTTG 
GGAACGATCG 
GTCCTTGACG 



GTATCACGTT 
TCCTCTATAG 
GCGATGAGGC 
AGGGACGGCA 
GGGAGAGAGC 
AAGTGAAAAG 
GTCGCGATGG 
TTGTCAATTC 
GTTATTTCGT 
ATCTCTTCGA 



TGTGGCTGCC 
CGACGAT GAG 
AAAGCTTTCA 
GGAGAGCCGA 
CAAGCAGTTC 
GAACGGAAGG 
AGCTGTCTAT 
AGGATCGACT 
CGAAGAGGGC 
TGTCCGGCAA 



1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1 560 
1620 



15 



20 



25 



12} INFORMATION FOR SEQ ID HO: 60 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 9 base pairs 
[BJ TYPE: nucleic acid 
fC) STRAW DEDNESS: double 
(D) TOPOLOGY: circular 

fii) MOLECULE TYPE: DNA {genomic) 

(ail) HYPOTHETICAL: NO 

liv) ANTI-SEMSE: IIO 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: POR Y F 1 IY ROHON AS GINGIVALS S 

t ix ) FEATURE : 

(A) NAME/KEY: mi scf eature 

(B) LOCATION 1. . .879 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 



AAGTCTCCGA 
GTCGGTGGGA 
ATAGAATCAA 
GGTTTGGCi'T 
TCTCAGCCTT 
GTTTGGACTG 
AAGGAAAAAC 
TATATCTTCT 
GAACAAAACG 
CAGGACATCG 
GTAGAGATTA 
GAGATGCTGC 
AGCGGTTCCT 
CTCAACTCGA 
ACGGCAAGCA 



GCGAT TCGGC 
AACGCATTAC 
TACAGTGCTT 
TCGTATTGGG 
CGGTGACTGT 
CTTCGTACCA 
AGATATTGGT 
CAAGCGTAGC 
TCAGGACCTT 
AACATGTGGA 
CCTCCGACCG 
GCAATGCCTC 
CCGTGGGTAA 
ACGAAGATTA 
TAACGGTTAA 



TTTGCCGTCG 
TTTGCTTATC 
TGTGATGAAA 
CCTTTATGCT 
GACCGGTATG 
GCTCCAG AT G 
AGCAGACTAT 
CATCTCTAAA 
TGCCGGGTAT 
GAAAATATCT 
TCCGGCCTAT 
CGAAGACGCT 
GATGCTATCT 
TAGCTGGGGA 
GGCTTCTTTC 



GTATGGAGCG 
CTGTATTCTA 
AAAGAAAAAC 
CTTGGCCGCA 
GCCGAGCGTA 
ATGGATCTCG 
TTGAAAAACA 
GAATACAACT 
CTGCTCAGCC 
CGCGATATAA 
TACTACACCA 
TTCAATCGTG 
TCTTCGATGG 
GGTTCGTTCA 
GCTTTGAAG 



CGAATACAAT 
TGGCGATAAG 
TTTGGATTGC 
GTGTCGCTCA 
ATTTCAAATC 
AATCGGCCTA 
AGCAGCTGCC 
ACTATTACGA 
AGACAGTTAC 
CGGAGCTGAT 
AGCTCAATGA 
CTTCGGTCAT 
GCGTGTTCCA 
ATACGTCTTC 



ATCTGGACTC 
AAGTAGTAAT 
GAT CGTCGCC 
GCTACGCCGC 
CGATCTGATC 
CAAGGCTTTG 
CGATTCGTCT 
TCCTCGGCAG 
GGTGACCTCA 
CAATCAGGGG 
TCTGAAGGTC 
TGCGGAGGGG 
GATAGTGGGG 
CAAGATGAAG 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
040 
87 9 



55 



60 



65 



(2) INFORMATION FOR SEQ ID NO: 61 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 0 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDED! 1 ESS : double 
(0) TOPOIXKSY: circular 

Ui) MOLECULE TYPE: DMA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) ANTI-SENSE; NO 



70 



75 



(Vi) ORIGINAL SOURCE; 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: tcixsc_featur^ 

(B) LOCATION 1. . .84 0 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 61 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



41 / 490 



GGGAAACTCC AAATGAAAAA AACAATTGCA ATTATCGCCT CAGCCCTCTT GGCTTTAGGA 60 

GCCGTCGGCT GTAAGAAAAA TGCTGACACT ACCGCTGTCA GTGAAAAGGA TAGCATAGCC 120 

TTGTCGATGG GTATTTTGTA CGGACAGGAT TTTGCCAATC AGTTCGAAAT GTCCCGCTTG 180 

CAAGGCCAGC CGATTGATTC GGTAGCTTTC TTGGACGGTT TCAAATATGG TAT CGATACG 240 

5 ACGCGCTTCT CGTACAATCT GGGAGCCATC TATGCTTCCA ' ATATAGCTCG TCAGCTGGCT 300 

CATGATTCCA TCGATATCGA CAAGTTCTAT GCAGCCATGC GTGCGGCTCT TCTTAAAGAC 360 

ACCGTATCTA TCGCCATGAA GCCTGCAGAT GCACAGGCTT TCATGCAACG AAT CCAAGCC 4 20 

AAAAAGCAGC GAGAAAACAA TATGAAGCAG TTTGGCCAGA ACATCGAAAA GGGTAATGAA 4 BO 

TACATCGATA CCTTTAAAAA AGAAGATGCT GTAACTGTTA CGACAACTGG TCTGGCATAC 54 0 

10 AAGACTCTTC AGGAAGGTAC GGGAGCT ACT CCCTCTT TGG CCGATACTGT ACGTGTCAAG 600 

TATGTGGGTA CTCTGGTCGA TGGTAAAGAG TTCGACAAAA ACGAAGAAGG AATCGAATTT 660 

GCCGTTACCG GTGTGATTAA AGGCTGGACG GAGATGCTCC AACTCATGAA GGTCGGTCAG 720 

AAAGTTCGCG TGGTAATCCC ACAGGAGCTG GCTTATGGGG AGACCGGCAA CTATACCATC 7 60 

GAACCGTTCT CTACCCTGAC GTTCGAGATG GAACTTATCG GGATCAAGCC CGGGAAAAAG 840 



15 



30 



<2) IN FORI IAT IOt J FOR SEQ ID HO: 62 



|i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 2409 base pairs 

(8) TYPE: nucleic acid 
tC) STRAHDEDNF.SS: double 
(D) TOPOLOGY : circular 

25 (ii) IIOLECULE TYPF. : DHA (genomic) 

(ill) HYPOTHETICAL; HO 

liv) ANTI-SENSE: HO 



(VI ) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS GIHGIVA1.IS 



t ix > FEATURE : 
35 (A) NAME/ KEY: misc_feature 

(B) LOCATION 1...2409 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

40 TGTCGAAAGC AGAAGCTCTC AAACCCGAAG AGGAGCCGGT ATCCGTGCAG ACGGATATCA 60 

TTCCGACAAA GCGATAAGAA TCCGATGAAA GTATTACGGC AAGTATTCCT CCCCATCCTT 120 

TTTGT CCTAC TGACAGGTGC CTGCTCCACC ACAAAGAATC TGCCGGAAGG CGAACAGCTG 100 

TATATCGGAA TGGGCAAGAC ACAGATACTC CGGCAGGACA AGAGCCACGC CGGCCAACAG 24 0 

GCTCTGACCG AAGTGGAGAG TACACTGAAA GTTACACCCA ATGGAGCTAT TTTCGGCAGT 300 

45 GCAAGTGCCT CCTTACCCAA GATACCATTC GGGCTATGGC TATACAACAG CTTCGTGGGG 360 

GATTCCACTG TCATTTCGAA ATGGATATTC GACAAGTTTG CAG CCAAGCC GGTTTTCATC 420 

AGTCAGGTCA AATCCGATAG CCGGGCTAAG GTGGCGACGA ACATCCTCCG CGAACACGGG 4 80 

TACTTCGATG CTAAAGTAAA AAGCAGTGTG ACCACTCTGA AAAAGGACTC GCTCAAAGCC 54 0 

AAAATCTCCT ATACGGTGGA TATGGCCTCT CCTTATCATT ACGACAGCAT CATTCCCTTA 600 

50 CCGATCAGCA CTTTCCCCGA CAGCATTCTG GCTTACAGGC AGACTCCGTC TTTGATCAGG 660 

AAAGGAGACC AGTTCAATTT GGCAAAGCTG CACGAAGAGC GTCAGACCAT CAGTGCCCTG 720 

CTGAGAGACA ATGGTTACTA CTACTTCCGC CCACAGGATA TTATCTACGA AGCCGATACC 7 80 

CTCCTCGTAA GAGGTGCCGT ATGCCTGCGA GCCAAGCTCT CGGAAGATAC TCCACCCCAA 84 0 

GCCATGCGCC CGTGGAGGAT AGGGAAACGG ACAGCAGTCC TGCTCGGAAT GAACGGAGAA 900 

55 AGCCCGACAG ACTCGCTCGA AGTGGAGGAT ATGAAAGTCC TTTACTATCG TAAAATGCCG 960 

GTTCGCCCCA AGATTTTGGC CAAACGCTTT CGTTTCTTCT CCGGCAATCT GTATCGGCAG 1O20 

AAAGACCATG AGACGACACG CAAATCCTTG GCTCGTTTGG GAGCCVTCTC CGTTATCGAT 1060 

CTCAATTTTT TGCAACGCGA TT CCATTTCC GGCCTTTTGG ATGTGCGACT GCTAACCACC 1140 

CTCGACAAAC CTTGGGATGC AT CAT TAG AG ACCTTGTTCA CGAGCAAAAG CAATGACTTC 1200 

60 ATCGGTCCCG GACTGAATTT TGCTCTTG CT CGGCGCAATG TATTCGGCGG AGGAGAAAAT 1260 

CTTTCTTGGA ATATCGGTGG ATCGTATGAG TGGGAGACCG GCAATCGTCC CGAAAATAGC 1320 

AGCAATCGGC TGATCGATAT AAATTCGTAC AACAT G AAT A CGGCCGTGAA CCTCTCGTTT 1380 

CCCTCGATTG TATTTCCCGG TCTGCTGGAT AAATACTATT ACTACCCCAC GACTACGACT 14 40 

TTTCAGGCTT CTGCCACCGC GCTGAACAGG GCACACTACT TTAGCATGTA CTCTTTCGGC IS 00 

65 TTTTCGACCA CCTACGAATT TCAGCCCTCC AAGGAACACC GGCATGCTAT TTTCCCGCTC 1560 

AAGCTCAACT ACAACCTCCT GGGGCAT CAG ACAGAAACTT TCCAGGCCAT TACGGOGAAC 1620 

AATCCGCCCC TGCTGCTCAG CCTTCAGAGT CAGTT CCTTG CTCAAATGGG GTATATCTAT 1680 

ACGTTCAACA AATCCGTTTC AGAGAAAAGT CCTCATCATC TTTGGATGCA ATTCGGACTA 174 0 

TCCGAGGCAG GCAATCTCCT GAATCTGATC TATCTGGCAG CCGGCAAGAA GTACAGCGAC 1800 

70 ACCAAGAATT TCGTCGGCGT CCCCTTCTCT CAGTTCATCA AAGCCACGGG AGAACTGCGC 1860 

TATTCCTATA CCATACAOCG CAAT CAGTCA CTGGCAACCC GTTTCGGGAC AGGCGTGATA 1920 

TATAGCTATG GCAATATGCG AGTGGCACCC TATAGCGAGC AGTTCTATGT AGGCGGTGCC 1980 

AATAGTATCA GAGCTTTCAC CGTCCGTAGC ATCGGCCCCG GACGGTTCAA TCCGGATTCC 204 0 

G AG AAT CAG T ATTCCTATTT GGATCAGGTG GGCGAATTCA AACTCGAAGC CAACGTGGAA 2100 

75 TATAGAGGCA AGCTTTTCGG GGATCT CCAC GCAG CCGTTT TCCTCGATGC GGGCAACGTT 2160 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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TGGCTCTTGA (iGGAGGATTC TTCCCGTCCG GCCGCTGCTC TGTCCGAAGT GGGATCGGTG 2220 

AGCAATTTCC TGAATAGCAT CGCTCTCGGC ACCGGTGTCG GCCTTCGCTA CGATCTGGCA 2280 

TTTCTCGTGG TTCGTGTCGA TGTCGGCTTC GGTCTCCACC TTCCTTACAA TACGGGTAAG 2 34 0 

AAAGGTTACT ACAATATCCC ACGCTTTAAG GATGCCATCG GTTTCCATTT GGCTGTCGGC 24 00 

TATCCCTTC 24 09 



10 



15 



20 



(2) INFORMATION FOR SEQ ID HO: 63 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 2 34 9 base pairs 

(B) TYPE: nucleic acid 
(C> STRA11DEDNESS: double 
(D) TOPOLOGY: circular 

(ixl MOLECULE TYPE: DMA (genomic) 

(lit) HY POTK E Tl CAL : NO 

Uv> ANTI-SENSR: HO 



25 



<vii ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROI tONAS GINGIVALIS 

Uxi FEATURE; 

(A) NAME/ KEY: m1?c_feature 

(B) LOCATION 1...2349 



30 



35 



40 



45 



50 



55 



60 



65 



70 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 



TCTCTCTCCC 
ATTGCCGGCT 
GACGGTAGCA 
CCGGAAGATA 
CGCTGGCTAO 
TCGCTCCGGA 
GCCAACCGTC 
GTGGTAGACA 
CGTTATTATA 
CTTGGCAATT 
CCCATCGTAC 
TGGAAGTTCT 
GGTACGAAAT 
ATCGGCACGG 
GAGCTGCCAC 
GGACGTTATA 
TTCTGCGAGG 
AACGTGAATA 
TCTCGCCTTG 
GAAGTCCTCG 
GATCGCAATT 
GCCATTCGCA 
CCTCGTCTCC 
GAATGGAAGA 
GCTCAACTCA 
CTGCTCGATG 
CCGCCTACGA 
CTGAACTATA 
TTCAGTATCC 
AAAGACGAAC 
GATCTCGATC 
G , 3TTT CGGAC 
TTTGCCGGAG 
ATGAAGATGA 
AATGTCGAAT 
GGCAATGTCT 
CGCTTCTACA 
TTCCTTGTGC 
AAATGGGCTA 
TATCCGTTC 



CGTATATCCG 
GCTTGCTCCT 
GACTATTAGA 
TTCGGGACTA 
TGCGCGTCTA 
AAATGGGCGA 
TGGCAAAGGC 
CCGGCTTGTA 
TACGCAATAT 
CGCTTCCTTC 
TCGATGAAGA 
CCGCCGAGGA 
CTGCCGATCT 
T ATT CTTTCA 
GTATCGATTC 
TCCGGGCATC 
ATGATGTGGA 
TCCGATTTGT 
TGGACTGCTA 
GCACCAATTC 
TGTTTCGTGG 
AGGGTTCGCA 
TCTTCCCATT 
TCGGGTATAA 
ATTATTCATG 
TCGATTATCT 
CTGCACTGTA 
CCACCGCTTC 
AGACAGCCGG 
ACGGGTTGTA 
TGGCTAAAAC 
TGGCTTTCCC 
GATCGAACAG 
CTCCGGACAA 
ACAGGACAAA 
GGACGATAAA 
AGGAAATAGC 
GGCTGGATGC 
TCACACGCCC 



TTTTCCTATG 
CATGCTTGCT 
CAGGGTAACG 
TACCCTCCAG 
TAGCAGCTCG 
ACCGCCTGTC 
GATGGCCGGC 
CAAGAAAGCT 
GGCTTTGGAT 
GGCATACAAG 
GAGAAAGGCG 
TGTTTATTAT 
GAAATTAGTG 
TGCCGATTAT 
GATTTCGCGT 
GGCTCTCACG 
ACGCTCTTAT 
GGAGCACAAT 
TATTCTTACC 
CGCTGGAGAC 
GGCGGAGATG 
CAGCTTCATG 
CATTTCTGAC 
TTACCAGACA 
GCAGACCTAC 
CCATCTCCCG 
TAACTACACG 
GTCCATGGAG 
CAACCTGCTG 
TAAAATGTTC 
CGTTCTTCTC 
TTATGGCAAT 
CGTTCGCGGC 
GACCTTCTTC 
GCTGTTCTGG 
GSAGTATGAG 
TTTGG CCTAC 
CGGACTGAAA 
AAACCTTTCT 



TCCTCGCATT 
TCCTCCTGCT 
ATCGCAAGCG 
CAACCCAATT 
AAT CCGAACA 
CTCATCGATT 
GATGGCTTTC 
CGCATTACTT 
GTGAAGAATC 
GTCGGGATCA 
ATAGCTCGTC 
GAAGCAGATA 
GTCAATGGCA 
GATCCTCTCG 
GGCGATTACA 
CGGTCGGTGT 
ATCAAGCTGA 
GGTAAGGATG 
GTTCCGGCCA 
TTCGGGGCGG 
TTCAATATCA 
GAATATGGGG 
GAAACGCGCC 
CGTCCGGAGT 
CTGCACAATC 
TACATCGATC 
GAGCAGTTTA 
CGTACCGTAT 
CAAGCCATTT 
GGTCTGCACT 
GAAAAGGACA 
GCTCGCCATA 
TGGAGTGTCC 
GATCAGATGG 
AAGTTTCGCG 
AATCAGGAGG 
GGTCTGGGGC 
GCCTACGATC 
TCCAATTTCG 



CCGTTCGGTA 
CGGTCACCCG 
AAACGGGCAG 
ACAGACTGTT 
GCAACAGCTG 
CTGTCCTCAC 
TCGATGCTAC 
ATCTGATTCA 
CACTCCTTCC 
GCGAGGGTTC 
ATATGCGCAA 
CTACCGTTTC 
TCGGGCGTTA 
AATCGGACTT 
CTGTT TACT A 
CCGTTACACC 
ATGCGCTCCC 
AGATTGCTCT 
AGAGCAAATC 
CTTTGTCTCT 
AACTCAAGGG 
TGGAAAGCTC 
GGCGGCTACG 
TTGATCGGGT 
GTCTGCGTCA 
CCGACTTCGC 
TCCTCGGCTC 
CCAATCCTTT 
CTTATCTGAC 
ATGCTCAGTT 
ATACTTTGGC 
TACCCTTTGA 
GTACCCTCGG 
GTGATATTCG 
CAGCAGCTTT 
ACGGTCTCTT 
TTCGTCTCGA 
Cl'CAGCAGAC 
CTTGGCACAT 



TCTAATCGGC 
TTATGTGCCG 
TATCGCTCTG 
CGGGATGACT 
GTG5AACCGT 
CGATCGTACT 
TGCTCGTGCC 
GCCCGGAAGC 
TCCCGTTGCG 
TCCCTTGTCG 
CAACGGCTTC 
AGGAGGATCG 
TCCATATCGG 
CAGAGTTCAG 
TGGGAGTAGG 
GGGAGCTTTT 
TATCGTTCGG 
GGCGGATAGC 
GTTCGAAGCC 
CGGTTTCACC 
TGCTTACGAA 
GCTCCGTTTC 
GGCATCCACG 
GATTCTCTCC 
TACGATCOGC 
CCAATCCCTT 
GGCATATATA 
TACGGCACGG 
CGATTCTCCG 
CGTCAAGCTC 
ACTGCATCTG 
GTTACGTTAC 
CCC3GGGAGT 
TCTGGATCTG 
TGTCGATGCC 
TCGTTTCGAT 
CTTCGATTAT 
AGGGCGTTAC 
TGCAGTAGGC 



60 
12Q 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1 740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2310 
2349 



75 



(2) INFORMATION FOR SEQ ID NO: 64 

(ii SEQUENCE CHARACTERISTICS: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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u..,ii » .u™ 
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10 



(A) LENGTH : 2625 base pairs 

(B) TYPE: nucleic acid 

(C) STRAWDEDNESS: double 

(D) TOPOLOGY: circular 

Ui) MOLECULE TYPE: DMA (genomic) 
(Hi) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOHAS GINGIVALIS 

lu) FEATURE: 

(A) NAME /KEY : mirc_feature 
{ B ) LOCATION 1. . .2625 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 



GTCGAATCTA 
TTGCTGCTGG 
ATCGATTCAG 
TCCCTCCGCG 
TC7GTCTTGA 
TCTCAAGAAG 
CAGGTAATAG 
CGTGTtiGCTG 
ACTCCCTCTA 
CGTGG ATT CG 
GAAGACGGGA 
CAGATTCAGC 
AACATTATCA 
AATGATGGAT 
ATCACCATTG 
GCATTCTCTT 
ACCGGATTCG 
TATGACAAAT 
CCTACGGCTT 
AAGATGGATG 
GGTCGTCGCG 
CCCTATGAAC 
GCCAATGCTG 
AAGTGGTTCG 
GCCGGATACG 
GGCGGTAGCT 
CAACTGAAAG 
CTCTTCGCAC 
AACTACGAAC 
GGCGTATCGC 
TTCGCACAGG 
GGCAATATCT 
CTTACAGGAG 
GGATACTATA 
TATCTCAATG 
ATTCGTCAGA 
GTAAGTTACA 
AAGGGTCTTC 
CTGTTCAAGG 
TTCAACCCCG 
TCATGGAAGT 
GGTTCACTCA 
AGCGATG CCG 
TACGGTTTCG 



AATTGTTATG 
CATTAGGCTC 
AAACGT CCG A 
GTGTAACCAC 
CTTTCCGTTG 
ACCTCGGTAC 
CCTCTGTGGT 
ATATTCAGGC 
CCTATACGAC 
ACACTTACAA 
AAGTATATTG 
GCGGACTCGG 
CGAAGACTAC 
TCCACAAAGA 
CAGGCTCCCA 
ACTTCTTCAA 
GTGCACCACA 
ACGGCATCCG 
ATGCTTACAA 
AAAATACCTC 
CTTATGGAAA 
AAACAAAGGT 
CGGCGACCAA 
GTCTACTCTC 
ATGGGCGTTA 
ACTACATAGA 
TGGGTGACAT 
AGATGGAGCA 
TATACCGCAA 
CGTGGAAAAG 
GACACAATGT 
ATGCTGCGGG 
AGGTCGGCTA 
CGAAGTGGAT 
GCGTTGATGC 
TCGACCTTCG 
CTTCTTACGA 
ACGTCGGAGA 
GTTTCCATGT 
CAACGCGTAA 
TGCCCGATGT 
GCACCACATT 
ACGACAATAT 
GCCGCACTTG 



TCTTATGAGA 
CTCTCTCGCC 
ACCGCTCATC 
CGATATGGAT 
CGTAGGTTAT 
GATTCTCCTC 
GCCCAAAGAC 
AGCATCGTTG 
AAAAGGAAGC 
CTTCGGTGTA 
GAGCAATTCG 
AGCCTCCAAG 
GGACGCCAAC 
ATCGTTCTCC 
TATGACGGGT 
CGTTTCGAAG 
ATGGCACAAC 
TCACAATCAA 
TACGTACCAC 
TCTTTATACc 
GAACAGTAAG 
GACTCCCGAT 
TGGCTCGGAA 
TTCATTCAAG 
CTACCGTGGC 
GGATCCCAAG 
TGTAAATCGG 
TTCGTCCGAA 
TCACAACTAT 
CTTCCTTCCG 
ATTCGCCAAT 
GGCTATCATT 
TGGATTCACG 
GGATCGCGTG 
TGTTCACTGT 
CGGTATGTTC 
CGAAGCCGGC 
TGCAGCACAG 
CATAGGTAAG 
TGCACAGCAG 
AGGTCTGTTO 
CTATTTCAAC 
CATCGGTAAG 
GTCTACCGGT 



AAAAGAATTC 
ATAGCGCAAA 
GGTGTATCCG 
GGTGGCTTCC 
GCTACCGTAA 
GATCCCCAGG 
CGTATGACGC 
AATGTCGAAT 
GGAGGTTTCG 
CTCATCAACG 
AGTGGTCTGA 
CTCGGTATCA 
ACCGGAGGTT 
ATT TCTACGG 
CTGGGTTATG 
AAGTTCAATG 
CAACGTTCTT 
TCCTTCGGCT 
AAGCCCCAGT 
g CANCCT ACG 
TGGGT ATT GA 
GGACTTATCG 
GCAATTTTTG 
AAGAAACTTA 
GACCACTATG 
ACAAAGCTCG 
GACTACACAG 
TGGATCGATG 
GGCGGTAGCA 
TGGAGTGGCA 
GGCGGTTTCT 
CCCAATGACA 
AATCACAAAA 
ACCTCGAAGA 
GGGGTAGAGG 
TCTCTCGGTG 
AATGAGACAG 
ATGACGGCTG 
TACAACTTCC 
TACGAAGCGG 
GAT CTGTCTG 
ATGGACAACG 
AAACACGATG 
ATTCGTGTAA 



TACAACTTTT 
CAGTGGTGAC 
TAAGCACCGG 
GATTCGAAGT 
CTCGCTCTAT 
CCATCGGCTT 
CGGTACCCGT 
TTCCCGAACT 
GTGATGGTCG 
GAGTTCCTGT 
TGAATCAAGC 
GCTCGGTAGG 
CGGCTTATGT 
GTATGAACGA 
TGAAGGGGCT 
AACGT CAT AC 
CCAAATATTC 
ATCTGCGAGG 
TCTCGCTGAA 
CATCTTTGGC 
TCAACTACAA 
ACTACGATGC 
CCCTTGGCTC 
ATAGTTCGCT 
ACAAGATCAC 
CATACCATGC 
GCGAAATCAT 
CATTCGTATC 
AGTCCACCGG 
AGGCAGGTCT 
TCACACGTGC 
AAGCCAATAT 
ACTTCGAGTT 
GAATCGGAAA 
CTGACGTCAG 
ACTGGACTTG 
GGCAGGATAT 
CTGTATCGGC 
TTGGCAAGAA 
ATGGCAAAGA 
CATCCTACAA 
TAGCCGACAA 
AGGCTTCGGC 
ACTTC 



CCTGACCGCA 
CGGTAAGGTG 
TCAGGGAGCA 
ACCGGCCAAA 
AGGCAGAGGT 
CG AT GAG ATT 
TTCCAATATC 
GGTTAAATCC 
TACCAATGTG 
CAATGGTATG 
CAGTACCATT 
TGGTACGATG 
CGGTATGGGT 
CGGTTGGGCT 
GAAGGGACGT 
CCTCTCTCTT 
TGTAGCCGAC 
CGAACTGACT 
CCACTTCTGG 
TACCGGTGGA 
CACCGGACAA 
CGTACTGGCT 
CAACTCTCAC 
GACTTTGACA 
CGATCTGCTC 
GGAAGGTCAG 
GTGGCACGGC 
AGGATCTATC 
CTACCTGCCC 
GAGCTACAAG 
ACCACTCTTT 
GGAAAAGGTG 
CAATATCAAC 
CGAGTATGTT 
CTATCGTCCT 
GCAAAACAAT 
AACCTATATC 
AGACATAGAG 
CTATGCAGGA 
AATCGTGGAA 
TTTCAAGCTT 
GCGATATGTG 
TCTCGTATGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2500 
2625 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 65 

U) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 1380 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(Hi MOLECULE TYPE: DMA (genomic) 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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Uii) HYPOTHETICAL: NO 
(iv> ANTI-SENSE: NO 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

FEATURE : 

10 (A) NAME/KEY: misc_f eature 

( B) LOCATION 1...1380 

{Ki) SEQUENCE DESCRIPTION: SEQ ID MO: 65 

15 AACAGATGTA ATCCCTCGCG TCAATACTTC CACCTATCGC AGAATGACGT TGGAAGATCT 60 

CTCCGAGCTT TTCTAATCCA TCTATCTATG AAGTTTTCAA TCCGCCTTTT CCTCTGCATC 120 

ATCTTTCTCC TCTCTGCATT TATCCTGCCT GCTCTCGCAC AAAAATCCAA GCAGGTACAG 180 

CGACTTGAGA AGCAACCTAA GGAGGCCCTC AAAGCCATCG AAAAAACCGA TCGCGAACTA 240 

CGAAATACCA AGAAAGACAA GCAAGACAAA CAAAAGCATC TCAACCTCCT GAACAAGCAG 300 

20 GTTGCTCAAC GCAAGCAGAT GGTACAACTC TTGGACAATG AGGTCAAAGA GTTGCAATCC 360 

GACATTGATT CCATGACGGG TGTATGTCAT CAGCTCTCTG TACAAGAGAA AGCCCGATCC 420 

GATGAATATG CCCAAGCTCT ACAGTCTATU CAAAAGCGGA AACGCTCGTT GGATCGCATC 480 

CTTTTCATTT CATCGGCCAA GAGCTTTGAC GAAGGCATGC GACGGATGCG TTTCTTGGAA 54 0 

CAATACGCTT CTGCATACAA GCTGGCATCT GTCCGGCTGC GCGATACACG TAGCAAGTTG 600 

25 GAGACTGAAC GTGCGACTGT AGAAGACGCC AAAAAGGAGA AAGGACATCT CT TAG T CATC 660 

AGAGAAGAGG AAAAAAAGAA ACT CGAAGGA CAGCAAGCCG AGCAACGTCG GCAGGTGCAG 1 20 

GCTTTGGGAG CCAAACAAAA AGACTTGGAA GCGCAGCTGC GAAAGCAGAA AAAGCAAGCC 180 

GAAGCTCTGA ACAGAAAGAT CGAGAAACAG ATTGCCAAGG AAATAGAA6C TGCCGAACGT 840 

CGTGCTCGAG AAGAACGTGA ACGGTTGGCA CGCGAAGCCA AAGCCAAGGG TAAGCCGGTT 900 

30 CCTGCCGAAC CGGAACGGAA GGCGGAGACC AAAGGCGGCT ATGCTATGGA TGCCTCTGAG 960 

CGTGCTCTCT CGGGCAGCTT TGCACAGAAC AAAGGTCGCC TGCCCGGCCC CGTTCGCGGC 1020 

AGATACCGAA TCGTAAGCGA CTTTGGCGTG CATCAGCACA GTGAGCTGAA AAAAGTACAA 1080 

GTTAATAATG GAGGTATCGA CATCGCTGTA GCAACAGGAT CCGATGCTAC CAGCGTATTC 1140 

GATGGTGTAG TGTCCAGTGT ATTCGTGATA CCCGGTTATA ATTCGGCCGT AATGGTTCGT 1200 

35 CACGGTAACT ATATCACGGT TTATGCGAAT CTGAGCAAAG TGTATGTAAA TTCCGGCAGT 1260 

CGTGTTAAAA CGGGTCAGGC TCTTGGTCGT GC CTATACGG ATCCTTCCAA CAACCAGACC 1320 

ATT ATT C ACT TCGAAATCTG GAAAGAACGC AGCAAACAAA ACCCAAGACT ATGGT TACGA 1380 

40 (2) INFORMATION FOR SEQ ID HO: 66 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10? 6 base pairs 

(B) TYPE: miclsic acid 
45 (C) STRAW DEDNESS: double 

[D> TOPOLOGY: circular 

tii) MOLECULE TYPE: DWA (genomic) 

50 Uii) HYPOTHETICAL: HO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(IX) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1...1026 



60 



(Xi) SEQUENCE DESCRIPTION: SEQ ID HO; 66 



AGTTTTTATC AAGAAATACA CAGACTTATG AAAAAGTATT TGTTATATGC CTCGTTGCTA 60 

ACGAGTGTTT TGCTCTTTTC CTGTTCAAAG AACAATCCTA ACGAGCCGGT GGAAGACAGA 120 

65 TCCATCGAAA TTTCTATAAG GGTAGATCAT TTCACCAAAA CGGGTGAGGC AGTACGCTAT 180 

GAAAGGAATC AAGGAAGT GC TGCCGAAAGG CTCATTACCA ATCTTTACCT CTTGTTGTTC 240 

GATCAGTCAG GGGCGAATCC GGCGAAATAC TAT AT TACCG GTAACACTTT CACCGGAGGG 300 

ACCTGGCTTC CT GACGAT AT GAAGGTGAAG TTGGATATGA CACAATCCGA GGCCGGAGAG 360 

CGCAAAGTAT ATGTCGTAGC CAATGTTGAT AATGCGGTTA AAACGGCTCT TGATGCTGTC 4 20 

70 G CT AA CG AAA GCGATTTGCA GACTGTAAAG AGGACGACTG CAATGCCGTG GTCGACCGAT 4 80 

ATAGCCTCTC CTTTCCTGAT GTCCGGAAAC AAGACACACG ACTTCTTGGC CAATCGTCTT 54 0 

TTGGACAATG TGCCCCTTGT GCGTGCCATT GCCAAGGTGG AGCTGAATAT CTCGCTGAGT 600 

GAGAAATTTC AGATTGTGCC GATAATTGTC AATGGTAGTT TGAGTGAGTT CAAGTTCAGA 660 

TACGTAAACT TCGACAAGGA GACCTACGTA GTGAAGCCAA CGACCAAGCC GGACAATCTC 7 20 

75 ATTAGTTCTG CTAATGGTGT TTGGCCTCAG ATTACAGATT GGACTGTATG GGGTGCTTCC 7 80 
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TTAAATACTT CTCCTGCTCC GGATGCGGGC ACAGGTTATA CATTGGATGC AAATGGCAAG 34 0 

GTAACGGCAC TACGGATTGT TACCTATCTG AATGAGCGCG ATAGCAAAGG GGCTACGGTA 900 
GAGGTCGCAT TGCCTCGTGT GGATGATGGC ACCCTTCCTC CTCCGGAATT CGGTCCGGAG 960 

CTTTATCGTT TGCCTTTGCC GGACAAG AT C CTGCGCAATC ATTGGTACAA GTATCAAGTC 1020 

5 GAGAT'l" 10Z6 

(2) INFORtlATION FOR SEQ ID WO: 67 

10 m SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 987 base pairs 

(B) TYPE: nucleic acid 

(C) STRAUDEDNESS: double 
{ D) TOPOLOGY: circular 

15 

(il) MOT.ECULC TYPE: DMA (genomic) 

(in) HYPOTHETICAL: HO 

20 liv) ANTI-SENSEt WO 

<\ i) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOHAS GINGIVAL IS 

25 (ix) FEATURE: 

(A) NANE/KEY: misc feature 
< B> LOCATION 1 . . . 997 



30 



70 



(xi) SEQUENCE DESCRIPTION: SEQ ID IIO:67 



AACCTTAGGA CACAGCCTTT CTTTTTGGTA GATTTGCAAT CTATGATCAG AACGATACTT 60 

TCACGATATG TATCCTCGAA CTTTTGGAGT CGGGGAGCTA CCTTTTTTTT CACGATTTTC 120 

CCGGCCTTCA TCCTCGCCGC TACTGCTTTG CCGGCTTGTG GAGGGGGTAC TGCTTCAGGC 180 

TCCGATCGTA CGCTGGCTGT GACCATCGAG CCACAGAAAT ACTTCATCGA GTCCATTGCG 240 

35 GATAAGTCGG TGCAGGTGGT GGCATTGGTA CCGGCCGGCA GCAATCCGGA GGAATACGAC 300 

CCTTCGCCTA CCGTGATGAA GCUTTTGTCC GAAGCAGATG CCTACTTCTA TATAGGAGGA 360 

CTGGGGTTCG AGCAAAGAAA TCTCGCTGCC ATTCGGGACA ATAACCCTAA GCTCCCTCTT 4 20 

TTCGAAATGG GCAAAGCCTT GGCCGATGCC GGAAGTGCAG ATCTCCACGG CTCCTGCACA 480 

GATCATTCTC ATACAGACCT GCATGCCCAT GATCCGCACT ATTGGAGCAG TGTGGTAGGG 54 0 

40 GCAAAGGCAC TCAGTCGTGC TGCATACGAC GCGCTTGTGG AGCTTTATCC GAACGAGAAA 600 

GACAAATGGG ACAAAGGGCA CGACCGTCTC AACGGACGTA TCGACAGCGT GAAGAGACTC 660 

GTCGATACCA TGTTTGCCAA TGGCAAAGCA GACAAAGCCT TCGTCATATA TCACCCATCG 7 20 

CTCAGCTTTT TCGCCCAAGA GTTCGGCCTG CGGCAGATCG TCATAGAGGA AGATGGGAAA 7 80 

GAGCCTACGG CTGCCCACCT TCGTCGTGTG ATCGATCAGG CACGT GCCGA TGGTGTCAGA 84 0 

45 ATCGTATTTA TCCAACCCGA ATTTGAAACG CGTCAGGCGG AGGACATCGC ACGCG AG AT C 900 

GGTGCTCGTC CGGTAAGGAT CAATCCTCTG CGCAGCTCGT GGGAGGAGGA AATTTTACAT 960 

ATTGCTCGCG CTTTGGCTCA TGAACGG S07 

50 (2). II1FORHATION FOR SF.Q ID NO: 68 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2634 base pairs 
tB> TYPE: nucleic acid 
55 (C) ST RAN DE DH E SS : double 

(D) TOPOLOGY: circular 

(ii) IIOLECULE TYPE: DMA (genomic) 

60 (ill) HYPOTHETICAL: NO 

Uv> AIITI-SF.HSE: NO 

(vl) ORlGIt4AL SOURCE: 
65 (A) ORGANISM: PORYPHYRCTIONAS GINGIVALIS 

(ix) FEATURE : 

(A) NAME/ KEY : mi3C_f eature 

( B) LOCATION 1...2634 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 



GCAGATTCTA TTCGATATCC TCTTTACTTT TTTGGGCGGA ATCGGAAGAA ATG CTTTAGG 60 
GAACCTATTC CCACCTTATA CAATAAAAAC ATGATCGGAA AAAAAATCTT TTTTATCCTG 120 
75 CTGGCGCTCA TTGCGTTCAG TGGGCTGAAC GCAGCGACAG ACACTGAGTT CAAGTACCCG 180 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



WO 99/29870 



46 / 490 



ACCGATGCCA ATATCATCGG TCACGTCAAA GACAGCAAGA CGGGTGAACA CCTTGTCGGT 
ATCACTATTG CTATCAAAGG CACTACCTTT GGTACATCTA CAGATGCAAC CGGGCACTAC 
TATCTTCGTA ACTTGCGTCC GGGTGAGATC ACTTTGATTA TGCGTGGCAT GGGCTATAAG 
AGCCAGGAGC GCGTAGTCCG CGTAGAAAAG GACAAGACTA TCGAGGTGAA TTTCGAAGCA 
GAAGAGGATG CCATCAATCT GGACGAAGTC GTGATTTCGG CCAACCGCGA ACTGACCCTT 
CGCCGTCTTG CTCCTACTCT GGTAAATGTA TTGAACGAAA AAGTCTTCTC GCAAGTCAAT 
GCTTCTAACC TGGCTCAAGG CTTGT CATTC CAGCCGGGAG TTCGTGTAGA GAACAACTGT 
CAGAACTGTG GTTTCAATCA AGTT CGT A'i'C AATGGACTGG ATGGTCGTTA TGCACAGATC 
CTCATCGACA GCCGTCCCAT CATGAGTGCC CTTGCCGGTG TTTACGGTCT GGAGCAGATC 
CCTGCCAATA TGATCGAACG TGTGGAGGTA GTACGTGGTG GAGGATCGGC CTTGTACGGT 
TCTTCTGCTA TTGCCGGAGT GGTGAATATC AT CACCAAGG AACCTTCTCA CAATTCTTTC 
ACATTCAATG AATCTCTGAG CTTTACCGGX TTCAGCAAGC TGGATAACAA CACGAACTTC 
AATGCCTCCA TCGTCAGCGA TGACAACCGT GCCGGTGCCA TGGTATTCGG GCAGGCTCGT 
TACCGCAACC ATTGGGATGC TAACAATGAC CGTTATTCCG AATTGGGTAA AATAGATGCC 
CGCTCGCTCG GAGCGCATTC TTATTTGCGC TT6AGCGACT ACAGCAAATT GACGGGAGAG 
TTTCACACGA TCAGTGAATT CCGCCGTGGT GGCGATCGTA TCGATTTGCC TCCTCACGTA 
GTGGGTGTAG CTGAACAAAC TGACCATAGC GTATTTAGCG GAAACTTGAA ATACGATCTC 
TTCTCTTOCA ACTATAAACA CCACTTCCAG GCTTATACTT CCGGACAGAT CGTAAATCGC 
AAGAGCTATT ACGGAGGTAT CGGAGAGATT GACGTCAATG GCCACCCCGG TGGTACGGAA 
GGCTACCCTA TCCCTCAAGA TCAATACGGC AATAATTATG GCGTGACCAA AGGCAAGACA 
TATATGGGCG GTATCCAGTA GAGCTACGAO TTGGACAAAT TCCTCCTCAT GCCTTCGCAA 
CTTTTGTTCG GAGCCGAATA TACGCGTGAT GAACTCAATG ACGTGATGCC CATCCTTTCA 
TGGCAGACCG GCGAGGATGC CAATGGGAAT ACCATTCCCC TCTATCCCGA ATTGGATCAG 
AATATCAACA ACTACAGCCT ATTCGGTCAG AACGAATGGA AAAATGACAG ATGGAGCATC 
CTTGTTGGCG CTCGCTTGGA CAAGCATAGC GAAG'fCAAGG ATATGATTCT GAGTCCTCGT 
AC CACACTGC GTTTCAACGV GAATCCGGAC ATCAACCTGC GCGCTACATA TGCAAAAGGG 
TTCCGCGCAC CGCAGGTATT CGATGAAGAC TTGCACGTAG CGGTTGTAGG CGGTGAGGCA 
CAGAAAGTAT TCAACGATCC GAACCTCAAG CCTGAAATTT CTCATGCATT CAGTTTGAGT 
GCCGATATGT ATCATCGTTT CGGTAACGTC CAGACCAACT TCCTTGTGGA AGGCTTCTAT 
ACTCOTTTGC TGGA'FGTATT CACCAACGAG GAGCAGCCTG ATCAGCACGA TGGCATCAAA 
CGCTACACGC GTATCAACGG TAGCGGAGCC AAAGTATTCG GTCTCAATCT GGAAGGTAAG 
GTCGCATACA AGTCCTTCCA GCTCCAAGCC GGTCTTACCC TGGCCAGCAA CAAATACGAC 
GAAGCACAGG AGTGGGGTCT GAATACGGTG AAAGACACCA ACGGAGCTTT TGTTACCGAG 
GCCAATGCAA ATGGACAACA GGAATACAAG AACGAATCCA TGACGGATAC GCAGATCACC 
CGTACCCCCA GCGTATACGG TTATTTTACT TTGGCCTACA ATCCTGCTCA CTCATGGAAC 
ATAGCCCTTA CGCCAGCATA TACCGGTCAG ATGTATGTAC CCCACGCTAT CGAATATGGT 
GTGAAGTCTG CCGAACTGGA TATTATGCAG AACAATCCTG AGATTACCGA CGAAACCGGA 
AAGGCTCCCC GTATTGATGA GCTGAAGAAG ACACCTGCAT TCTTCGATTT GGGCTTGAAA 
GTGGGTTATG ACTTCCACGT ATTCCAGGCT ACTGAGGTTC AACTCTATGT AGGTATGAAC 
AATATCTTCA ACTCTTTCCA GAAGGACTTC GAT CGTGGAG CTGCACGTGA CAGCGGATAT 
ATCT ATGGTC CTACGCAGCC GCGTACAGGC TACATGGGCT TGGTAGTGAA GTTC 



(2) INFORMATION EX3R SEQ ID HO; 69 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ili MOLECULE TYPE: DNA (genomic) 

Uii> HYPOTHETICAL: NO 

(JvJ AIITT -SENSE i NO 

(vi.) ORIGINAL SOURCE: 

I A) ORGANISM: PORYPHYROMOtlAS GIIIGIVALIS 

(ix) FEATURE: 

I A) NAHE/KEY-* miscjeatute 

(B) LOCATION 1. . .616 

Ui) SEQUENCE DESCR1 PTION : SEQ ID MO: 69 

AAACAGATAG TTATGACAGT AAAGCGCGCA GTGCGAATAG CACTTCTCAC GCTGATAGGC 
ATTCTTTTTT CCTCACCTTC TCTTGTTCGG GCGCAAAGTC TTTTCAGCAC CGAACATGTC 
TTGCAACT AT ACAACAAGAT ACTCTATGGA GAGTCGGCGG CGGATACCGT CGCACAGAAA 
ACGGCAGGTG AGTCGGCATT TCCTTTTATA GACAAACTCA TCAATCTCGG C CGC ACT TTC 
CTCGGCAAAC CATATCGCTA TCGCGGTCCT TCCCCATGGC CGATGGACTG CTCGGGCTAT 
GTGTCTTACC TCTACTCCAA ATTCGACATC AAACTCCCAC GTGGTGCGGC AGCACAGAGC 
CAATATACGA ATCCTATCGA GCGCGAGGAT GTTCGTCCGG GCGACCTCCT TTTTTTCAAA 
GGCCGCAATG CACGCAGCAA CCGTATCGGG CATGTAGCTT TGGTCGTATC TGTCGATGAA 
GATGATATTA CCATGATGCA CAGCCGCAAT TU6CGAGGGA T CGT GAT CG A AAAACTCAAT 
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SccSc g££££ CG TCGCTTGGTG ^™GGCA GGGTACCCGG ACCCAAGAGA 

5 (2) IK FORMATION FOR SEQ ID NO: 7 0 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 01 base pairs 

(B) TYPE: nucleic acid 
10 <C> STRAIJDEDNESS: double 

t D ) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

15 (Ml) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: HO 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



600 
618 



25 



30 



35 



40 



45 



50 



55 



65 



(ix) FEATURE: 

(A) NA1 IE/KEY: mis c _feature 

(B) LOCATION 1 1401 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO:70 



TTCTAT CAGG GCAAAGACTT C 



(2) INFORMATION FOR SEQ ID HO: 71 



CCTCCTGACG 


60 


CGAAGCATCA 


120 


GGAGCGCAAC 


180 


CGAT GCAGCA 


240 


CTTCGGACGT 


30O 


TCTCAGCATC 


360 


GCAGCAAAAG 


420 


CAGCCTGCAA 


480 


GGCAGAAACA 


540 


CGTAGGTAAA 


600 


ACAACTTCTC 


660 


CCTCGAACTG 


720 


AGCAGAAAGG 


780 


GAAACCGGCA 


840 


GGCTCGTGCG 


900 


CCGCGACCTC 


960 


CAACGGCAGC 


1020 


GCAAGATCGC 


1080 


AGAGAAAAAA 


1140 


TAAGGCCATC 


1200 


TCGCGACAGC 


1260 


ATACGCCCTC 


1320 


AGTTTTGGAT 


1380 




1401 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 
IB) TYPE: nucleic acid 
Rn ( f -> STRAW DEDHESS : double 

DU t D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

Uii) HYPOTHETICAL: NO 

(Iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 
7 q < A ) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY : mi sc feature 

(B) LOCATION 1...1353 



75 



(vi) SEQUENCE DESCRIPTION: SEQ ID IIO:71 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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10 



15 



20 



25 



AAAACATCAT 
GCGGCTTTGG 
GACTATGCAC 
AAGGCCGATT 
GCTTCGTGGA 
AGCTTCAACA 
GTCTATCGAC 
CAGCAGGAGC 
CAAATGCAAG 
GCTCGAATGG 
ATGGCCGGTG 
CGGCTCAAAG 
GCTGACAGTC 
GCTGCCCATC 
CGTTTGCGAG 
ACGGGATTCT 
AACCGTCGGG 
CTTGTGAGCC 
GAAGCGGAGC 
CTGGCTTCCT 
GT CTT GCAGC 
CGGCTCCTGG 
AAACTCATAG 



ATCGGAATAA 
CTGCGAATCC 
GCCGGCACAG 
ACCTTCAGGC 
ATTTCGGACG 
ATTCGTACAG 
TGCGGATGGC 
TGGCAGCTCT 
AGCTGGCCAT 
AAGAGCTGGG 
ACCGTTTGGC 
AAAAAATGAA 
TCTCCGCCGA 
ATCATCCCGT 
CCGCGCGAGG 
CACGCTTTTT 
GGGAATACGT 
ATCTGCGTCA 
GCAGGCTCTA 
ACCGCCAGGC 
GTTATGAGGA 
ATGCCCGTGT 
CCTATTACGG 



TAT GCGTTTC 
CCTTACGGGC 
TTCGGCCGTG 
CGTCGGCAAT 
CGGATTGGAT 
CATACATGCC 
GCATCCACGC 
CGGCACCACG 
GCAGAAGTAC 
GATGAAGAGT 
CCTGACTCAA 
CTTCCCCATG 
CAT GGCCGAA 
CCTCCTCCGT 
TGCATTCCTG 
GAATGGATCG 
CAGTCTGAAT 
GGCGCGTGCC 
CAGCGAGATC 
GAAGGAGCAT 
GGGGCTGAAT 
GCAGCGACTG 
CTGCCTTTCG 



CAACATTATC 
CAATCGAATA 
GCGCTGTCCG 
TTTCTGCCCC 
GCCGAGACGA 
ACGAT GACCC 
CGGGAGGCTT 
GAGGCCTACT 
GAGGAGAGCA 
CGTCCCGATG 
GCGGACAATC 
GATGACGAAC 
TCGGACAGCT 
GCCAAACTCG 
CCGAGTGTGT 
GACTATACGC 
CTGAGTATCC 
GAACGCAGGG 
GCCCAAGCCA 
ACCGACGCCA 
ACGGCCATCG 
AGAGCGGCCA 
GAC 



TCATCTGTAC 
TGACCCTCGA 
CTGCGGAACT 
GTGTATCGGC 
ATACCTACAC 
TTTTCGACGG 
CGCGCCTCTC 
AC GAC CT C GT 
GCCGCCTCCA 
TCCT CGAGAT 
AGTGCATCAT 
TCGTCGTAGA 
CGGCCGGCGT 
ACGAGCAGGC 
CGGTATCCGG 
CCTTCAGCGA 
CCATCTTTTC 
CGGCAAT CGT 
TGGCCGACCG 
TGCAAACCGC 
ACCTGACCAC 
TGACCTACCG 



GGCTGCCGTA 
AGAGTGCATA 
GGAGCAGTCC 
CGGAACCGGT 
CGACATCAAC 
TTTGCAGAGT 
CGTTCGCGAG 
CTATGCGCGC 
CCGGCAGACG 
GCAGTCGCGA 
CGCTCTGATC 
CGATATGCCG 
CTTCGCCCGT 
TGCCACCGAC 
AGGATGGAAC 
GCAGTTTCGG 
GGGATTCAGC 
CCGACGGGGC 
GGATGCCGCT 
TTACGAAGCC 
TCAGGCCAAT 
GCTCAAATGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
700 
840 
900 
9S0 
1020 

loao 

1140 
1200 
1260 
1320 
1353 



(2) INFORMATION FOR SEQ ID HO: 72 

tl) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 20BG base pairs 

(B) TYPE: nucleic acid 
<C) STRAHDEDNESS: double 
(D) TOPOLOGY: circular 

35 (ii) MOLECULE TYPE: DNA {genomic) 

(iii) HYPOTHETICAL: NO 



40 



(iv) ANTI-SENSE: NO 



(vl) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHY ROM ON AS GtWGIVALIS 



45 



(ix> FEATURE : 

(A) NAME/ KEY: mis cofeature 
[B> LOCATION 1...2886 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO:72 



50 



55 



60 



65 



70 



75 



GCCATTTTTG 
AAGAGCATGA 
ATGGCAACTG 
TCTTCAAGAT 
GATTTCAATC 
CCCGTCCGGA 
ATCTCTTCCA 
GCCATTACGC 
ACCCCCGACC 
GCTTTTGCCA 
GGAGGGACTT 
GGACGCCCCG 
GAGATCAACA 
CAAATGATCA 
ACGAAAGGAG 
AATTTCGGTG 
AGAGGATGCA 
ATGAAGGCTT 
GAAGTCCCTC 
TCGAGCGGTG 
AAGACTCCGG 
TTCTATTTCA 
AATCAGAATA 
GAGTTCTACG 
CGCATGGACA 
TATCGTGACG 



TCGTATCATT 
ACAAATTTTA 
CACTGACCGC 
CCGCCGGAAC 
CGGAAGACCT 
TAGGACAAGT 
TCGGAGACGT 
TTTATTACGA 
ATGAAATTGT 
CAGAGCCGGT 
TGCCTGACAT 
TAACGGATAA 
TCAATTGTCC 
TGGTAAAAGG 
ACTTTACTCC 
TAACGCAATC 
GCAAT GGTAC 
TCCTCCCGAT 
TGCGCTATCG 
CCGGTATTCA 
CTCTGAATAC 
AATACGATCA 
AGCACGTGGT 
GCAGACTGAA 
TCTATCTGGA 
GTTATAAGCC 



GCAAATTGAA 
CAAATCACTT 
TTCTGCACAG 
GCATTCATTC 
GAT CGCACAG 
AATACCGGTG 
AGATGTATAT 
TGCATTCAAT 
GTTGGGAGCA 
ACCGGGGAGT 
CAAGATCTCC 
CCATTACGGG 
TGAAGGTGCA 
ACAGTATATC 
GCTGATCATT 
CGAGTTGGAT 
ATTGGCCATC 
CAAAGGTAAA 
TGTCTATTAC 
TCATCCGGCC 
ATGGATCTCC 
AGGTGGTACG 
CGGCACACTG 
CAGTCATTGG 
TCCCCAAAAC 
TTTGCCCTCT 



AAAATAACAG 
TTGCAGTCAG 
ATTTCGTTCG 
GACGATGCAA 
AGCCGTTGGC 
GATGT GGACT 
CGCCT GCAAT 
ATTCCGGAGG 
TATACGAACG 
GAGCTTATTA 
GGTGCGGGTT 
ATCGGTGAGG 
GACTGGCAGG 
TCAATGTGCT 
TCTGCCGGAC 
AAGTGGATCT 
TTCCGTGGCA 
TCCGATGGTC 
AATGGATGGG 
GGAGATGCCA 
TCCAGTGGTT 
GAAGGAGGAT 
ACCGGAGGTG 
AACGAGTATG 
AATGGCCAGA 
GTGCCCCGGC 



AGAATAAGTA 
GACTGGCTGC 
GAGGGGAACC 
TGACTATCCG 
AATCGCAAAG 
TTGCATCCAA 
TCAAGTTGGA 
GCGGACGCCT 
CCACTCATCG 
TGGATTATGA 
ATATATTCGA 
ACGATTCCGA 
CAGAGAAGAA 
CAGGCAACCT 
ACTGTGCTTC 
TCACTTTCCA 
ACAGTATCAT 
TCTTGCTGCA 
ACAGTACGCC 
TGAAGATTTC 
CCGGAGGGAC 
CGTCCGGTTC 
CCGGCAATTG 
CATCCGATGG 
CGACCATCCT 
TATTGTTGCA 



TAATTCAGAC 
CTTCGTGTCG 
CTTGAGTTTC 
CCTTACTCCG 
AGATGGCCGG 
GG CTT CGCAC 
AGGAGCCAAA 
CTATATCTAT 
CCGCAACGGA 
AGTGTCT CGC 
CAAAGTCGGC 
TTCGGATTGC 
CGGTGTGGTG 
GCTCAATAAT 
CATAACAACC 
CTATGAAAAA 
CGGAGCTTCC 
ACTCAACGAT 
CGATATTCCC 
CATCCTAAAG 
TGACGATCAC 
TTCTCTCTTC 
TGGCGGGACG 
CAATACGAGC 
CAACGGAACG 
GTCTACAGGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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GATCAGGTCG AATTGAATTG GACGGCTGTT CCTGCCGATC AATATCCATC ATCTTATCAG 1620 

GTCGAATACC ACATATTCCG AAATGGAAAG GAAATAGCTA CGACAAAGGA GTTGTCCTAT 1680 

TCGGATGCCA TCGACGAAAG TATTATCGGT AGCGGTATCA TTCGATACGA AGTAAGCGCA 1/40 

CGCTTCATTT ATCCCTCGCC GTTGGATGGA GTGGAATCTT ATAAGGATAC GGACAAGACT 1000 

5 TCTGOCGACC TTGCCATAGG AGACATTCAG ACCAAGCTGA AGCCGGACGT AACACCTCTC 1860 

CCCGGAGGAG GAGTATCATT AAGCTGGAAA GTTCCTTTCT TAAGCGAGTT GGTTTCCCGA 1920 

TTCGGAGAAA GCCCCAATCC TGTGTTCAAA ACCTTTGAAG TGCCCTATGT TTCTGCCGCA I960 

GCCCCACAAA CCCCCAATCC TCCCGTTGGC GTAGTCATTG CAGACAAGTT TATGGCCGGT 204 0 

ACATATCCCG AAAAGGCTGC TATCGCTGCC GTTTATGTAA TGCCATCCGC TCCGGACTCT 2100 

10 ACTTTCCACC TCTTCCTCAA GAGCAACACA AACAGAAGAT TGCAGAAGGT GACAACTCCC 2160 

TCCGATTGGC AGGCCGGAAC ATGGTTGAGG ATCAATTTGG ATAAGCCGTT CCCGGTGAAT 2220 

AATGACCATA TGCTTTTTGC CGGTATCAGA ATGCCTAATA AGTACAAGCT CAATCGTGCT 2260 

ATCCGTTATG TAAGAAATCC GGATAACCTT TTCTCCATTA CCGGTAAGAA GATTTCATAT 234 0 

AACAACGGAG TCT CTTTCGA AGGCTACGGA ATACCCTCGC TCTTGGGCTA TATGGCTATC 24 00 

15 AAATATCTGG TGGTAAATAC CGATGCTCCG AAGATCGATA TGTCGCTTGT ACAGGAGCCT 24 60 

TATGCTAAGG GAACGAATGT GGCTCCATTC CCCGAATTGG TCGGCATATA TGTCTATAAG 2520 

AACGGAACAT TT AT CGGCAC ACAGGATCCA TCCGTCACAA CTTATTCGGT TTCAGACGGA 2560 

ACAGAGAGCG ATGAATACGA AATAAAACTG GTATATAAGG GAT CGGGCAT TTCGAATGGC 26-10 

GTTGCTCAGA TTGAGAATAA CAATGCTGTC GTTGCATATC CGTCTGTTGT AACAGATCGT 27 00 

20 TTCAGCATTA AGAACGCTCA TATGGTTCAC GCTGCCGCCC TCTACTCATT GGATGGCAAG 27 60 

CAGGTTCGTT CTTGGAACAA CCTCCGCAAT GGCGTGACAT TCAGT GTTCA AGGACTTACG 2 620 

GCCGGTACTT ATATGCTCGT TATGCAGACG GCAAACGGCC CTGTGAGCCA AAAGATCGTG 2860 

AAGCAG 2886 



25 



35 



{21 INPORIlATIOIf FOR SEQ ID NO: 73 



(i) SEQUENCE CHARACTERISTICS: 

IA) LENGTH: 2106 base pairs 
30 ( B) TYPE: nucleic acid 

{CJ STRANDEDNESS : double 
( 0) TOPOLOGY: circular 



(ii) ffOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(AJ ! IAME/ KEY : wis cofeature 
45 ( B) LOCATION 1...2106 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 

TGGAAAAAAA GTAATCCTCA ATACCGGGGC AGGACAGCAA ATTACCGAGA TCATTATAAT 60 

50 ACAGAAAGCT ATTTAAGCCT AAGGAGGAAT CAAACAATGA AATATCTTAT CAGACTCTTC 120 

TTATCATTGA TGTT ACT CTC TCTCTGGACG GGCTGTACAC ACGAGGAGCT CTCTATTTGC 180 

GATGGCGAGA ATACGCTTGT TTTACGCGTA GAGACCGGTA AAGCCCCAAA TGCTCGTGCC 24 0 

ACAGAACCCG GTCAGGGCAT AT ACAAT GAG AATAAAGTAG GCTCCATTTC TGTGCTCTTC 300 

TATTTACAGG GACAACTTCG TTGGCAGGTG AAGTCTACAG ACTATCAAAT CCATGAAGGG 360 

55 GCCTATATCA TTCCGGTCAA AGAGCAAATG CGACCACTAT TCAATGGCAA CAACAACTTC 420 

AGCATCTATG TAGTGGCCAA TCTCGATTTC AATGCTCCGG CCACAGAAGC TGCGCTTTCT 480 

CAATTTGTGG TAGAGAAATC TATTGAAGTC TCTTCTACGA CAGCCCCTGC CGATTTCGTA 540 

ATGCTTGCTC ATGGCAATAA GCAGATCAAT ATGGCTACGA CAGAAGGGAA ACTGTTGGGG 600 

GATTATAAAC TCAAACGAGT GGCAGCAAAG ATTCGCATGA TAAAACCCAC CATCAATGTG 660 

60 CAAGGATATG AAGTGGTCGG AAATATACAG GCAAAGTTTC GCAATTCGGT AACGAAGGGG 720 

TTCCTTACCA CAGAAGCTCA AGAGATCCCA GCTGCTGCAT CCTATAAGAC ATCGGAATAT 780 

CTTGATATTG CAGAGTCGGC ACCTGCCAAT TCTATCCATT TCTATTCTTA CTATAACAAA 84 0 

TGGACACTCT CCACACCGGA GAAGCGACCG GAATTCTTCA TCATGGTCAA ATTCAAAAAG 900 

ACAGGACAGC CGGACAACAC AGCCAAACCG TACTACTACA GAGTGCCCCT CGAATCTCAG 960 

65 GACAATCAGG TCAAGAGCAA TGTCCTCTAT AATCTGAATG TGAAAATCGA AATCTTGGGT 1020 

TCTTTACAAG AGCCGGAAGC TGTTTCTGTA AACGGCACAC TCGCAATAGA AGAATGGATT 1080 

CTCCATCAGG ATGCATTCAA TCTGCCTGCC ACCAATTACT TGATAGTGGA ACAGCACGAA 114 0 

ATCTTCATGA ATAACGTGAA C A CAT ACT CG GTGAAATATC AAACTTCGCA GAAACCAATC 1200 
AGCATTAGCA TACAGTCAGT TACCTTTAGC TACGT CTCTT CTGATGGCAC T CAGC ACAAT , 1260 

70 CATCTTGTAG CAAGTAGTAG CGACCAGTAT CCTACGATTA CAAGCGATAA TACAAGCATC 1320 

ATAATCACTT CCAAGATACC GGTTAATAAC GTACCAAAGA AGATCGTTTT TGAGGTAACT 1300 

AATGGGGTAG CCGGTTTGAA AGAGACTGTC ACAGTACTCC AATATCCTGC ACAATTTATT 1440 

GTCAATACAC TTGGCACAGC ATCGGCATGG AGACCAGACG GATCTTTGGC TCCGGGGCTT 1500 

- AACAATAAAG CGATTTACCA TGTCGTAGTA CTGCTTCCAC CCGAGAATTT ATTTGAAGAT 1560 

75 GGGACACAGA CAATCATCGG TTATCCCCCC ACTGAAACAA TTTCTTTTCA TAAGAAAGAG 1620 
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10 



AACAATACCT 
TCAAGAATGA 
CCCTATCTCG 
AATAAGAGAT 
G AAG AAA TT A 
AAGCTGCAAC 
GATAGTTACT 
TCCAAAGCCT 
GGTAAG 



ATCCGATACT 
TTTCACCTTC 
AGTATTGGCC 
ACGCCTTGTT 
AATTCGATGA 
ATAATGAGCA 
CTGCAAATGG 
ATGTTCGTTG 



ATGGTCTGAC 
CTTTGAGTTA 
AGGGACATCA 
TAATTGCGCT 
CTGGCGTTTG 
GAGTGCTGTC 
GTCTTATAAA 
CGTG CGGGAT 



ACAAATACGA 
GCCTCCCAAC 
TATCTCGTTG 
TTTTACTGGG 
CCGACAGAAG 
CAAGCTATCA 
ATGCAAGGAG 
GTGAAAAAGC 



CAAAACAGGA 
TTGGGGCTAC 
ACTATTCGGG 
AGAAAAGAAA 
CTGAGATCAA 
TGACAGGGAA 
GAGGGGGCCA 
CGATTCGTGA 



CCTTGAGACA 
TCTCCCGATG 
AAACTATAAT 
AGTTAATAAC 
ATTGATAGAT 
TTATTATTGG 
AGGAAATTCC 
CAAGAAGTCA 



1680 
1740 
1800 
1B60 
1920 
1980 
2040 
2100 
2106 



(2) INFORMATION FOR SEQ ID NO: 7 4 

ti> SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH : 3936 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS: double 
(D) TOPOLOGY; circular 

20 {ill MOLECULE TYPE: PNA (genomic) 

<iii) HYPOTHETICAL: NO 



25 



Ut) AllTI-SEUSE: HO 



ORIGINAL SOURCE: 

(A) ORGANISM; PORYPHYROi IOHAS GIUCIVALIS 



30 



(ix) FEATURE: 

(A) NAME/ KEY : mise_f eature 
( D) LOCATION 1...3936 



(r.i) SEQUENCE DESCRIPTION: SEQ ID NO: 14 



35 



40 



45 



50 



55 



60 



65 



70 



75 



AAGAGGGTAG 
CAAAAATTTA 
TGCTCTCTGC 
GCCCTACCTT 
TTCCCCG GAT 
ATGCCGGGTT 
ATGATAGCCG 
ACATTCGACA 
GGGGGGACTT 
AGCCTCCCTT 
ATCGAAGTTA 
AAAAAAATCG 
GGCATATTTA 
GCGATAGAAA 
GCCAGCAACC 
AATCAACAAC 
TTTGATGTAG 
TCATACATCA 
GCGAAGAGGA 
AAATATTT GG 
TTAATTCCAA 
GTTTCGCCCC 
CATGAATTGC 
GCACACAAAA 
TTCTTAAAAG 
TCTGCAATAG 
ACTCATGGAA 
GCCCGATTGA 
GTAGCAGATT 
GGAT T CT CG G 
CCCTGTCCTC 
ACTGTTGTTG 
TCGAAATTCA 
GAOGTTAGTA 
AAAAATGGTG 
TCCATACAAG 
GAGACCGGTG 
CAGGTAGAGA 
AAATTTCATT 
GTGGAGATAA 
CTAAGCGTTG 



AT AGCT AT G A 
CTAATGTAAA 
ATTTAGGTCT 
CTATCGGTAT 
TTTATAGTGT 
GTGGCTCGTT 
TTCCGGAATT 
ATTATAATAT 
ATCTGGTAGA 
CTACTCACTA 
CCCTGTATCC 
AGGTTACAAT 
ACAAAGTAGC 
ATGATATGGT 
TCCCTCACAA 
CACACGACGA 
CTGCTGTAAG 
ACGAAACTAA 
CTTTAGATGG 
CTGACACTGA 
GTCATCCAAC 
TTGATACTGT 
ACAATCTGAT 
ATATTCTTTA 
AAATCGCCTC 
ACTCGATATT 
TGCCGACTGT 
ACAATACATC 
CAACTATTAG 
CTTTCTTAGG 
CGTCAGAATT 
GCGAAATGTT 
ACTTCAATTT 
ATTGTATTAC 
GCTGCCTAAA 
TCATGTCCGG 
CTAACCCCAC 
TAGACAATAT 
TTGACAGTGT 
GCAATTGCGA 
AAAACAGTAT 



ATGCAGACAA 
ACTAAATGAT 
ACAATCT C AG 
TCAAGAGTCA 
GGAAAAACGA 
TGGGAATCTG 
T TCAACAGCT 
CTATCCTAAT 
GGCTTTCGCG 
TGTCTATTCT 
TTTTCGATAC 
AACTTTCGAT 
CTCCTCTCCA 
GTTCAGTCGT 
CTGTGACTAC 
AATCAAACGG 
TATAAAGGAC 
ACTGAAAAAT 
CAAACTGGGA 
TAATACAAAA 
TTTTGGTTCC 
CGGCGATTTG 
TGAAAAGACT 
CGCAGAAGGG 
TGGTTACACA 
TGACTGCTTG 
TTG3GGGATA 
TTCGCAGGGA 
ATCGCTTGGA 
AGGAAGCAGA 
TTATGAATAT 
GCTATCATCC 
CCTTGGCUAC- 
ACTACCAAAC 
AATACCGGAA 
AGGAACTCTG 
CTTTATTACC 
AGACCGAC1T 
GAAATTCAAC 
ATTTACCAAT 
GTTTAGCAGT 



AAAGCT TGCA 
ATGCGAAAAA 
ACTTGGCATG 
AGTTGTACCC 
GAAGGCAACC 
GGCGAAGCTG 
AACGTTGCTG 
CCTACCTATG 
ATAAACAATG 
CAAGACGGGT 
AACCCTGTCC 
AATCCTCAGC 
TTTATTAATT 
GGTACAACAA 
TTGGTTATTT 
CTGTGCGAAC 
GTATTGAATA 
TTCATTCGCT 
TACGTGCTAC 
GTCCCAACCT 
ATATGCGCCT 
TTTATCGGTC 
ATCAACAAAG 
AAAGGCTGCG 
GTCAACTCTA 
AATAATGGTT 
GGGCAGGGAC 
TTATGTACGA 
GAAGT CCTGA 
GCCACCCAAT 

Ttaccttatt 
attatcaata 

CCTGCACTAA 
AACACCATTA 
AAAGGAGTTT 
GAAATAGGCA 
GTTTACGGCG 
AACTTGTTTT 
AGTGCCCCGC 
CGAAGTGACA 
TCGGGGATAA 



AATGTGCAAT 
TTTTGAGCTT 
GAGATCCGGA 
GAATCACGTT 
AAGTCTTTCA 
AATTGCCTGT 
TAAAAATCAA 
TCGTAGAGGA 
ACTATTATAG 
ATTTTCGCTC 
GACAAGAAAT 
CACCTTTACA 
ATUAAGCTGA 
CGTACATAAG 
ACGATGATAT 
ATAGAGCCTT 
GCTTCCCATC 
CAGTTTACAA 
TGATCGGAAA 
CTTTTATTCA 
CCGACTATTT 
GATTTAGCGT 
AAATCTCATA 
ATGCTCCAAT 
TCTTAAAATC 
CCCATCATTT 
TCGACGTCAA 
GTCTATCATG 
CCACATACGC 
ATGCCGTTTA 
CTTTATATCA 
CTAATTCTGT 
ACATTATGGC 
TAAGCAGTCC 
TGCATTTTAC 
ATCAGGCTAA 
ATGGTCTTGC 
CTACGCATTC 
TGTATACAAC 
TTATTTCAAA 
CGGTATTCAA 



TTGTGTGATA 
TTTGATGATG 
CTCAGTGGCA 
CGAGGTTGTT 
GCGCATTTCC 
TTTGAAAAAG 
AGAGACGGAG 
GTTGCCTGAG 
CCAAAATGTA 
ACAAAGATTT 
TCTATTTGCA 
AAAAAACACC 
TGGCAAATCG 
CGGAAATGTT 
GTTCAACGTA 
CTACAACGGC 
AAATGCCACC 
CCAAAGCAAT 
ACCATTGAGC 
TAATGTCTCC 
TTTTAGTTCT 
CACCAATGCT 
TAATCCTATT 
CTTACGTTTA 
TAATCAGGTC 
TTATTTTAAC 
TACTCTAACA 
TAGTTCGGCT 
ACCTAACAAG 
TTTAGAAGGC 
CAATCTCTCG 
TGATACGTAT 
TCATGGCATG 
GATAACAATA 
TAATAATGGC 
AATATCCGGA 
GATTAACAAG 
GGTCATGCCC 
GAACTGTATT 
GAATTGTGAC 
GCCTATGGCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1000 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



51 /490 



ACAAGCTCCA TCACCGGATT ATCTACAAAA 
ACAGGAAACT TCGCCTACCA TATCACAAAC 
GCC7\TCAAGT TAGACAATAT TCCTGAGTAT 
GATGAGGCTC TTGTACTAAA TAATAGTGGC 
5 CGGAATGTGA TAAAAAACTG TAGGATTGGG 
AACCGAAATA AGATCAGTAA CAATCATATA 
TATTTCGATA ATGCTCCTGT AATCAATGAA 
ACTTGGCAGC TCTATTCATC AAACGGTACA 
CAGGGGGGAG ATACAGATAC ATGGATTTAC 

10 GTTTCAAATA ATCACTGGGG CAACAATGA7 
CCAGACTTGT TCATTTGGAT ACCTTTTTGG 
AGCTCTGCTG AAG CAGT AG A ATTCCAAACA 
CTTTCGGCAA AAGTGGCTCT C AAG AT GAT G 
ATAGCTGCTT TGAAGGAATT GTTCAGGATA 

15 TTGAAAGATT ATTTCAGATC CAATCCAACC 
GCTGATTTCC TGTCTGCGCG ATGCGATATT 
TGGTACGAAA ATCGCTTGAA TAGTGAAATC 
GACCTTGGTG ACATTTATTG GAATAT GCAG 
TTuAACATAC TTTCCTGTGA ACAAAGGAAA 

20 TATTTGTTGT CAACTCTT CC CGAATCAACA 
AAATGAAGCC TTGATAAATC CAAGATAATC 
GTAACAATAA TCTACTATAC CGATAACCCT 
AATGGACCCT CGGCTGATAT AACCGGGTTG 
ATACAGTTCA ATACATCCAA CTTTGATCCC 

25 CAGAAAATTA TAGATACGGA AAAATTACGA 



GCAAAGATTA CCGACAATAC TTTTTTTGCG 2520 

acgcoaggct TAACAGCAAC CTCCAATGCT 2580 

TACATTTCCG GTAATAAAAT AGTCAATTGC 2640 

AACAGAACGA ACAGACTCCA CAATATCACA 2700 

AGCACGCTTT ATAATTCCTA TGGTATTTAC 27 60 

GGAGTACGTC TCCTCAACAA CAGTTGTTTT 2820 

GAAGATAAGC AGACGTTTAT TTCTAATAGG 2880 

TTCCCTCTCA ACTTCCATTA CAACAGCTTG 2940 

AACGACACGT ATACGAATCG CTATATTGAC 3000 

TTGTTTGATC CGAATCAGGT TTTCAATACG 3060 

GATGGATTGC CAAATGGGAG ATCGGGCAAT 3120 

GCATTGGACT GTATTGGCAA TAGCGATTAT 3180 

GTTGAAACCT ACCCGGAATC CGACTTTGCA 32 4 0 

GAGAAAATGT CAGGCAACGA TTACGAAGGC 3300 

ATCATCTCTT CCCAGAACTT GTTCCCGACA 3360 

GTGTGTGAAA ACT AT CAGT C TGC CATC GAT 34 20 

TCCTATCAGG ACAGTGTTTT TGCAGTCATT 3480 

TTAGACTCAC TCAGAGGGAC TGGTATAGAT 354 0 

TCGCTCGAAA GCCATCAAAA TGTAAAAAAT 3600 

GGT ACT CTCC TGCCTCCATT AGAATGCAAC 3660 

TCTATTTCGC CCAATCCGGC GAAAGCTGTT 3720 

TCCTGTTCTG TAATAAAAAT ATATGGAATA 3780 

CCCAAACATC TATCCGAAGG TTATTACAGC 3840 

GGTTTCTACC TGGTAACGCT AAATGTTGAT 3 900 

ATCAAA 3936 



(2) INFORMATION FOR SEO. ID NO: 7 5 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2814 base pairs 

(B ) TYPE: nucleic acid 
<C) STRAHDEDNESS : double 
(D) TOPOLOGY: circular 

35 

(11 ) MOLECULE TYPE : UNA (genomic) 

tiii) HYPOTHETICAL: HO 

40 (ivj AHTI -SENSE: HO 

(vi) ORIGINAL SOURCE: 

{A| ORGANISM: PORY PHYROMON AS GIMGIVALIS 

45 (ix) FEATURE: 

(A) NAME /KEY: mlsc_feature 
IB) LOCATION 1...2814 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 



TCAGAAAATT ATAGATACGG AAAAATTACG AATCAAATAA TGGCTATCAT GATGAAAAGT 60 

ATTGTTTTTA GAGCATTTCT AACGATTTTG CTCTCGTGGG CAGCSATCAC GAATCCGACT 120 

GCTCAAGAGA TCTCAGGCAT GAATGCATCC TGTCTGGCTG CTCCGGCTCA ACCGGATACT 180 

ATCTTATATG AAAGTTTTGA GAATGGACCT GTTCCCAATG GCTGGCTTGA GATAGATGC1* 240 

55 GATGCTGATG GTGCCACTTG GGGAAGCCCA TCAGGCTCTT TCTCTGTACC TTACGGACAC 300 

AATGGCCTTT GCACCTACTC CCATATACGT TCCGGTATCT CAACAGCGGG CAACTATCTG 360 

ATTACACCCA ATATAGAAGG AGCCAAACGG GTCAAGTACT GGGTATGCAA TCAGTATAGT 420 

ACCAATCCGG AACATTACGC AGTAATGGTA TCGACAACGG GGACTGCCAT T GAAG ACT'TT 480 

GTTTTGTTGT TTGATGATTC CATAACAGGG AAACCGACTC CTCTTGTATG GCGTAGACGA 540 

60 ATCGTGGACT TACCGGAAGG GACCAAATAT ATTGCATGGC G ACATT ACAA AGTCACCGAC 600 

TCACACACAG AATTCTTGAA ATTGGATGAT GTCACTGTGT ATAGGTCGAT CGAAGGGCCC 660 

GAACCTGCT A CCGACTTCAC AGTAATCAAT ATT GGTCAGA ATGTGGGACG ATTGACTTGG 720 

AACTATCCGG AGGATTATCA ACCGGAAGGA AAGGGGAATG AAGAGTTGCA CCTTAGCGGC 780 

TACAACATCT ATGCGAACGG T ACACT ACT G GCACAAATAA AAGATGTCTC CATACTGGAG 84 0 

65 TATGTGGACA GCACTTACTC IT TGC GAG AC AATCCCTTGC AAG TGG AGT A CTGCGTTACA 900 

GCCGTTTACG ATGAAAGCAT AGAATCTTCG ACCGTATGTG CCACGCTGCA TTACGCCACG 960 

GATGCCATCC TTTATGAAAA TTTTGAGAAT GGACCTGTTC CCAATGGTTG GCTTGTGATA 1020 

GACGCTGATG GAGATGGATT TAGCTGGGGA CACTATTTGA ATGCATACGA CGCTTTTCCC 1080 

GGCCATAATG GAGGCCATTG CTCCTTGTCG GCTTCTTATG TT.CCGGGTAT AGGCCCGGTG 1140 

70 ACTCCCGACA ACTATCfGAT TACCCCCAAG GTTGAAGGAG CCAAACGTGT CAAGTACTGG 1200 

GTAAGCACGC AGGATGCCAA TTGGGCAGCG GAACATTACG CGGTGATGGC TTCGACAACG 1260 

GGGACTGCTG TCGGAGATTT CGTCATATTG TTCGAAGAAA CCATGACAGC GAAGCCGACC 1320 

GGCGCATGGT ATGAAAGAAC CATCAACTTA CCTGAAGGGA CTAAATACAT CGCATGGCGG 1380 

CATTACAACT GTACCGATAT ATATTTCTTG AAGTTGGACG ATATCACTGT ATTCGGGACT 14 4 0 

75 CCTGCATCAG AGCCCGAACC TGTTACCGAT TTCGTTGTCT CGCTTATTGA AAACAACAAG 1S0C* 
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10 



15 



20 



GGACCATTAA 
CCATTGCAGC 
GACCCGACTG 
GTGGAATATT 
AACCTGATTT 
ATTCCTGAAG 
CCTTGGACTA 
ATGATTGGCG 
CTTCTCAAGT 
ATGGTTTCTA 
ACCGCTAAGG 
TATATTGCCT 
ACGGTTTATC 
ATTGAGAATA 
ACTGATGATA 
CTTGTTCACA 
GACCGTCAGG 
CAATCGGTTT 
ACAAGCTTGA 
CGGAGCAAGT 
ACTCATTCAG 
AAAGTAGTCG 



AGTGGAATTA 
TTGCCGGCTA 
TTTTGGAGTA 
GTGTCACTGC 
ATGATTCTCA 
GCTGGTTGTT 
TGTATGGACA 
TTTTAACTCC 
ATTGGGTAAG 
CTACGGGAAC 
CTAACGGTGC 
G GCGGCATT A 
GTTCTACTGA 
ACAAGGGTCG 
AAAAACCATT 
TACAAGACCC 
TGGAAATGGA 
GCGATAAGCT 
AAATATATCC 
CGACAATCGA 
AGAAAACGGA 
GTGGAAATAA 



T CCTAACGG C 
CAATATCTAT 
TATCGATGAG 
CGTTTATAAC 
ATCGGACATT 
GATTGATGCT 
TGACAGTGAG 
GGATAACTAT 
TGCGCAAGAT 
TGCTGTTGAA 
ATGGTATGAG 
TGATTGCACC 
GACTGTTCCC 
CCTGAAATGG 
GCAGCTTACC 
GACTGTTTTG 
ATATTGTGTC 
GAACTATACT 
TAATCCGGCA 
GTTGTATAAT 
AATCGATGTT 
AACAACAACC 



TACGAACCCG 
GCAAACGGCT 
ACTTATTCTT 
GACAATATCG 
ATCTTATATG 
GATGGCGACA 
AAGTGTATTG 
TTGGTTACAC 
GCTGTTTATT 
GATTTTGTCC 
CGAACTATTA 
GATATGTTTT 
GAGCCTGTTA 
AATTATCCTA 
GGCTACAACA 
GAGTATATCG 
ACTGCCGTTT 
ATCACATCCT 
TCGTATGTGG 
GCGCTGGGAA 
TCACGTCTCA 
GAAAAGGTAG 



ATAAGACTGA 


TGATAAAGAC 


1560 


CGCTCCTTGT 


TCACATACAA 


1620 


CACGAGACGA 


TCAGGTGGAA 


1680 


AGTCCCAATC 


GGTTTGCGAT 


1740 


AAGGCTTTGA 


GGCCGGAAGT 


1800 


ATGTTAATTG 


GGACTATTAT 


I860 


CATCCCCTTC 


GTACTTACCG 


1920 


CCAGACTCGA 


AGGAGCCAAG 


1980 


CGCCTGAGCA 


TTATGCTGTG 


2010 


TCTTGTTCGA 


AGAGACAATG 


2100 


CATTGCCTGC 


AGGAACAAAA 


2160 


TCTTGCTCTT 


GGATGACATT 


2220 


CTGATTTCGT 


TGTCTCGCTT 


2280 


ACGGCTACGA 


ACCCGATAAG 


2340 


TCTATGCAAA 


TGGCTCGCTC 


2400 


ATGAGACTTA 


TTCTTCACGA 


2460 


ATAACGACAA 


TATCGAGTCC 


2520 


TGGATAATAT 


TCAATCTGAT 


2580 


TAAGGATAGA 


GGGATTGAGT 


2640 


TTTGCATATT 


AAGGGAAGAG 


2700 


ATGACGGAGT 


CTACTTGATT 


2760 


AGATAAAGAG 


GCCG 


2814 



25 



30 



45 



50 



55 



60 



65 



70 



75 



(21 INFORMATION FOR SEQ ID HOt7 6 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1818 Jaase pairs 
f B ) TYPE: nucleic acid 
<C» STKAMOEDNESS: double 
tD) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DMA (genomic) 

35 (iii) HYPOTHETICAL: NO 

|iv) AHTI-SENSE: HO 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM : PORYPHYROMQNAS GINGIVALIS 



(lx> FEATURE: 

(A) NAME/ KEY : mifc_feature 

(B) LOCATION 1...1818 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 



ATAATCTTCT 
CAATTATATA 
CGAAATACAG 
CAGACTGTGG 
CGATTAAAAC 
TCCTCTTCAG 
CGGGGGTACC 
CGTCTGATAG 
TCGGCTTTCA 
GTGGACTACG 
CATTATTTCA 
TCGACACCTG 
GCAAAAAATG 
CTGGGTACCG 
AGTAATGAGT 
TTTGCCAAAA 
TATTATGTGG 
AACATACAGG 
TGGCCTAAAC 
TTCGGCGGTG 
ATGCCCAATA 
GGGAATATAG 
GTGCCTTTCT 
GT GAGXTl' CT 
TACTCTTATC 
GATGGAGGAC 
CATCCCATTG 
TATTCTTTCG 



GTACGATTCA 
CGGCCGTCAT 
AAACCAAACG 
AGATGGAGCA 
CTTTCCGTCC 
GCAGGAACAA 
TGAATATCGG 
AT GC AG AG CA 
ATACCGGTGA 
AGCAGCGCAG 
ATAACTACGG 
TTACTCCTCA 
ATGTGATCGA 
ATCCGATGAA 
TGTCCGATGA 
ACAGCGAAAT 
AGGGCGCGCC 
CCGGAGTAGG 
TGGATGCTTC 
TGATTCGAAA 
CGATTGTACT 
CCGATGTGGT 
AT ACT CCGAC 
TGCCGATATA 
GCGATATGCT 
TTGTCGCCTC 
CCCCATTGGA 
GCTCGGCTGG 



TCACTCTGAG 
AATGGCTCTC 
CCCCGACACG 
TGCGGATCCG 
GGAATATAAC 
TCTTCCGAAT 
TATCGGCCAT 
GGAGAGACTG 
CTTCGACGGC 
GCCTTCCTTT 
ACGGGGAGCT 
GATGGACAAC 
TGCCAGGATC 
GGCTCTCACA 
TATTAAGCTC 
GATTCAAACG 
CACAATCGGA 
GATTTCTTCC 
GCTTAGTATC 
TGGTCTCGCC 
CCCTTCGCGC 
ACGTATGGAG 
TCTACCCTTA 
TGCCGACGGC 
CCGCTTTCTG 
CATGCAGCCC 
TGTCAGACTC 
CTCGGAAGCC 



TTGGAAATAA 
TCTGTATCAT 
CTGCGCAGGG 
CTTCCGGCTG 
AAGCGTACAT 
ATCCTGCCGA 
ACGCTAAACC 
AATCTTTTCC 
GACAGAAAGG 
GTGCTTGCTA 
ACCACCAATG 
GGGACCCACA 
GACTATCGTT 
GAACATACGC 
GGTGTCGAAG 
GGCGTTCTGT 
TTTGTCGGAG 
CATTTCGGAG 
TTCCCTTCAT 
GATGTTATGC 
AATGCTTTGA 
GTTTATGGCG 
TATAATCCAT 
ACCCGCTGGC 
GTAGACGCAT 
GATCTTATAT 
CGGTATACAC 
TTGGGTATCG 



TGAACAGCAT 
CCGTTTGCGG 
AGCTTACTAT 
CATACAAGGC 
TCGGATTTGT 
CGGAAGGTCA 
AGCGAATGGA 
TCTCCTATCG 
ATAGACGAAT 
CCGGCTTGTA 
TGGGCAGCAT 
ACGTCCGTGT 
TCTTCCGTTC 
CTGAACTGAA 
TTCGTACGGG 
CCGAAACCGA 
ACTCGGACAA 
CCAAAGGGAG 
GGCGTGTGTA 
AAGAGGAGAT 
OCGCCCAATT 
ACTTCTCCAA 
CCGACTTGTA 
GCGCAGGTGG 
CCTATGGCAA 
TGAAGGCAGA 
AGCTGAACGG 
GTAATGTACA 



CATGAAATAT 
TCAAACCCCA 
CGTTAATGAC 
CATCGAACCT 
CCCTGAAGTT 
TATGAAGCAC 
TGCCGGCTAT 
TGGGATGAAA 
GATGGCAGGA 
TTATTCGAAC 
CCCTCAGCTA 
ATACTTGGGT 
TATTCCCTAT 
CGTGACGATG 
AGGATTGTTT 
CCGCAACCTG 
TATGCAATGG 
GTTGTTTTTC 
TGCGAAAGCC 
GCCCTACCTG 
AGGGGTGAAG 
GCTGACAGGT 
TCAGTATAAT 
TAAGCTGGAA 
GTGGAATTTG 
AGTAGGTGTT 
ACGGTATCGG 
TCTTCTTAGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
lOBO 
1140 
1200 
1260 
1320 
1300 
1440 
1500 
1560 
1620 
1680 
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GCGGATGTTT CATACAAGCT GAAAAAGAAC TTGAGCCTTT ATCTCAAAAT CGATAATATG 17 40 

CTGGCGGAAA CGACAGAGCT TATCGGTTAT TATCCTATGC AGCCGTTCCA TTGTTTCGCC lftOO 
GGTTTTAGCT GGACTTTC ^ iH °° 



15 



50 



70 



1819 



(2) INFORMATION FOR SEQ ID NO: 77 

(1) SEQUENCE CHARACTERISTICS: 
Afx {A) LENGTH: 1071 base pairs 

1° tB) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 



(li) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: NO 
liv) ANTI -SENSE: HO 



20 (vi> ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONA5 GINGIVAI.IS 

(ix) rEATURE: 
0 (A) NAME/ KEY : miscjeature 

^5 (B> LOCATION 1...1071 

(;ti) SEQUENCE DESCRIPTION: SEQ ID NO:77 



n n AACTCCCATG CAACAGTCTC CGTTATATGT TCAATGATGG AAAAATGTAT CTTTGCTCAC 6C 

30 TATCCACATA ACCTTGTGTT CATGATTCGC AAGCATTTCG GTATCATTTT GGGATTTCTT 12C 

TCTCTTGTGT TTTCGGCAGG TGCTCAACAA GAGAAGCAGG TGTTTCATTT TCTGAACCTT IOC 

CCGGCTACTG CACAGGCTTT GGCTGCCGGA GGCAAAGCTA TCACCATCGT AGACGACAAT 24C 

CCCGGACTGG CTTTTGAGAA T CCGGCTCTG CTCGGATATG AATCCGGTGG CCGCGC^ 30C 

CTTTCCTATT TATATTATAT GAGTGGTTCG CATATGGGCA ATGCCTGTTA TGCCTCGTCC 360 

35 GTCGGAGAGC GTGGCATGTG GGGTGTTGGC ATGCGTTTCC TGAACTACGG GTCTATGCAA 420 

GGATACGAT C AGAATGCGAT TGCCACCGGC TCTTTTAGTG CTTCGGATAT AG CT GTAC AA 480 

2™^I ACA GCCATGAACT ^CAACCAC TTCCGCGGTG GAGTCAGCCT AA^G^TT^ sTo 

Znll^lfl* TCGAGACGTA TAGTTCCTTT GGCCTTGGT G TGGATGTCGG TATCAGTTAT 600 

40 In^™™ ACAAAGGATA TTCCGCTTCC GCTCTGTTCA AGAACGTAGG GGCGCAACTG 660 

40 AAAGGCTATA ATGAAGAACG GGAACCGCTC GATTGGGATT TCCAGCTCGG CTTTTCCCGC 720 

AGTTTTATGA ATGCT CCGTT TCGCT T GCAC ATCACGTTGT TCAATCTGAA TCCGCACTAT 7 80 

IT^^T C TTGTACCACG CGATCTGTCC AAGATGCAAA AGTTCCTCCG ACACTTCTCG 840 

*™ GG ^?* G AAaTTACTCC TTCCGAGAGG TTTTGGGTCG GGCTGGGATA TACGCCACAG 900 

ATTGCACAGG ATTTCGAGGT GGAAGGCGGC AACAAATGGG GAGGTCTTTC GGCCGGCGTC 960 

45 2™=" CAGGTGTAGT ACGTGTAGGC GT ATCTGCT G CCACCTATcI TCCTGCAGCT llzO 

CTTTCGTTCA TGTGTTCGGT AGGTATCCGT TTGGACGATA AGAGCATCTT C 1071 



<2) I H FORMAT I ON FOR SEQ ID HO: 7 8 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1011 base pairs 

(B) TYPE: nucleic acid 
<C) STRAMDEDNESS : double 

5b <D) TOPOLOGY: circular 

(ii> MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

oU 

[lv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
g 5 < A » ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATIOH 1. . .1011 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78 

CCTCAGCCCG TCGGCCTTAA AGAAATAACC ATTAAACCCA TGTGCCTCG* A-CCATAAT" 
GCTCCGATTT CATCCGAGTT GCTCGAGCAG GAGCTGACTG CCGATCCTTT T^TGCGGATG 
ACAAACAAAG CCGGCAATGA GATCTATGTT TTTACGGCCG AAGAAGCTCC C^GC^rG 
AAAGAAGTAG CCCGACTGCG AGAAGAAGCC TTTCGGCATT ATGCCgUS 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



O : y S H «I. £ ci k.* i» Onb ff£ o «^ 



WO 99/29870 



PCT/AU98/01023 



54 / 490 



10 



15 



GCGATCGATA 
GATCCGCAAA 
TTCGATACCG 
TTTTTGCACG 
TACCAATCGA 
ATCGGACCAC 
TACAAAGACT 
TCCGATCCGG 
GACGAAGCCT 
GTGCGCAAGC 
GAGATGCGTC 
ATATrCATTG 
ATCCTCAGCC 



TAGACGAGTT 
ACAAGGCTAT 
ATGGCAAGCC 
ATTATCTCCC 
CACGGATGGG 
TCAGTGTAGT 
ATGATCGGCG 
AAGGCTTGGT 
TGTTCTCCTC 
TGUGTATCAA 
TTTTCGGCAC 
CTGTn GGTAA 
GGAACGAAAA 



CGACACCATG 
ACTCGGAGGC 
TTTGCTGGCA 
CTACACAGTC 
CACAAAGGCC 
CAATCCAGAG 
AGCTCGCAAT 
CAAGCCTATT 
ATCCGACTTT 
TATCCCTCCT 
TGCAGTGAAT 
GAT CCTGGAA 
AAAAGGTCTC 



CCCGGGAGCT 
TACCGCTTTA 
ACGGCAGAGA 
GAATTGGGAC 
ATTT TTGTGC 
GCACTCTATT 
CTGATCCTGT 
CATCCCCTAC 
GACACCAATT 
CTC6TGAGTG 
GAGTCTTTCG 
GAGAAAAAAC 
GACAGTAGCA 



ACAAACAGCT 
TCTATGGGCG 
TGT TTCGCTT 
GTTCGTTCGT 
TGGACAATCT 
TCTATGGCAA 
ATTTTCTTCG 
CGATAGAGAT 
ACAAGACTCT 
CATATATAGC 
GAGAGGTGGA 
AACGGCACAT 
ATGGCCGATC 



GATCGTATGG 
GGACGTTGCT 
CAGTGATGCT 
GTCGCTCCAG 
TTGGGACGGT 
GGTGACCATG 
CAAG CACTTC 
CAGTGCGGAG 
CAATATAGAA 
TTTGTCTCCG 
GGAAACCGGC 
AGAGAGCTTC 
A 



300 
360 
420 
480 
540 
600 
660 
720 
-7B0 
840 
900 
960 
1011 



(2 J INFORMATION ITOR SEQ ID HO: "7 9 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1698 base pairs 

(B) TYPE: nuclexc acid 

(C) STRANDEDNESS: double 
<D> TOPOLOGY: circular 



25 



(ii) MOLECULE TYPE: DMA | genomic) 
Uii) HYPOTHETICAL: NO 



30 



(ivl A1IT±— SENSE : NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHONAS GINGIVALIS 



35 



lu) FEATURE: 

(A) NAME/ KEY: mi 3c_f eature 

(B) LOCATION 1...I6 98 



(Xi) SEQUENCE DESCRIPTION: SEQ ID HO: 79 



40 



45 



50 



55 



60 



65 



TACGACGGAG 
AAAACCATTG 
" GTGCACTTGT 
GTCCTGTCTC 
GGGGCCGATC 
TCGGCCTATG 
CTCTCGCTCG 
GCTTCCTACT 
GCCTACTATC 
OGGCTTGCCG 
TACAGGGGCG 
TTGGAGCTTT 
GCTGCGTATC 
GACAAATTCT 
TTCGGTATCT 
CGTAGGGGCG 
TCGTCCATCA 
CTGTCGGACT 
ATATACGAAG 
ATTCGCCGCT 
OGCACGGATA 
GAGACGTATC 
ATAGCCTATG 
CGAATGGTGC 
CAGCTGGCCT 
CTGTACGTCT 
GGCGACCTGA 
CATATCCTGT 
AATATCTCCT 



CGAGACTGGT 
TAAGATACAG 
CTGTCGAGGC 
CCACCGAAAG 
TTTTCCCGTT 
GCGATATTGC 
AATCGGGTGG 
CACGTGGCAT 
CCTATTTGGT 
GCTACTATTC 
AAGTTGCTTA 
CTTGTGCTAC 
TCTATCATAG 
TCGTCAGCTA 
CCAGAATGAA 
ATGCCATCGG 
ATCTCTTTGC 
TCGATTTTTC 
ACTACAAGCC 
GGTATATGCT 
GAGGTTGTGC 
GCAAGCATGG 
ACCATGCCGG 
TGACGCATTC 
ATTTGOCCTA 
CGATTCCGAT 
TGAAAAGAAA 
CCGATCCGCA 
ACCTCTTC 



CTATACCCTA 
CCGCCTTCCG 
GCAGATGCTC 
GCAATACAGG 
GAGCGATAAG 
GGGCGACTAT 
TCGCATCAGT 
GCACAAACGC 
GTCCGATTCG 
TTTTCGCGCC 
TCGGCTGACC 
CTCTTTGACG 
ACAACACCTC 
CGGTTTCGGT 
CTACGTCAAC 
TCTCGACTAC 
TTTGCTTTAC 
ATTTTCAGCC 
GGATGATAAT 
CAATGAGTTT 
CCTGAGAGTG 
ACATCATACC 
ATCCAAATTG 
GTATAAGATT 
TGCCTATCGT 
GCAGAATACC 
GGACGGTATA 
AGCCGAACGA 



TTTCGGAATA 
GTCGCTCTCT 
AATACTCCTT 
GAGATTTGTG 
CTGCGCGATT 
CTTCCGTACA 
GTCCGTAACT 
ATCGGCTGGA 
ACCGGCGGAG 
GGCCGCTTGC 
GATCCGCGl'A 
CTGCCTCGAG 
ACACAGTACA 
CAGGTGGATG 
GGATGGAAGC 
AGCGGCTACT 
AATCGCCTGC 
GACTATGCTC 
TATCATATCT 
TCTGCCCAAG 
AGTCCCGGTA 
ATGAGCGGAA 
G ATTT TGG AC 
CGTACCATCC 
AATAGAGAAG 
CACCGCCTGA 
GCCTATGGCA 
ACGTCCGGCC 



GAAACGACAT 
TCTTTTGCCT 
TCGAGCTGTC 
TGCAAACGAA 
CGGCCTATGT 
ACGGCAATAA 
ATGGCACATT 
ATGCTCrGCG 
ACTATCATTT 
CCCTCGGTAT 
CGACCAA7AC 
AGAACAGGCT 
ACTGGCGTCC 
TCAGCAACAG 
TTAGCTCCCG 
TCCTCGATAC 
GACTCTATGG 
TGCGCCAAGG 
ACGACCTCCG 
CCCAAGCCTC 
GTGATTTCTA 
TGCTACGTCC 
TTTCGCTTTC 
AGAAAGAGCA 
GCGTGGAGGT 
TGACAGAGCT 
AAACGCCCGG 
ATACCATCGG 



TCACCCTATG 
TTTGGGAGCT 
GGATCAGATC 
AGAAAAAAGG 
TCGTTTCGGC 
CTACTCCTCG 
GCAGGGCAGT 
CAACGCCGAA 
CGAAGACTAT 
AGGCTTCTCA 
GACCGGTGCA 
ATCGCTTTCG 
CGGGCAGCAG 
CCCTATCTGG 
TCTGGATACC 
CGAaGAGAGG 
TAGCTGGCAT 
GATAGAGCGG 
TATCTTGGCC 
CTACCGTATT 
CGGCTATGAT 
TTTTGCCGGT 
CGCTGCTTAT 
GCTCGACTAT 
GCGTTCCTCT 
GCGGTTGTAT 
TGTCATCrCA 
GGCTATCTGC 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13SO 
1440 
1500 
1560 
1620 
1680 
1698 



70 



<2) INFORMATION FOR SEQ ID NO: 80 



75 



(I) SEQUENCE CHARACTERISTICS: 

[A> LENGTH: 24S7 base pairs 
{B) TYPE: nucleic acid 
(C) STKANDEDNESS: double 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



^."js^Cis ...k. iC',\ jE; f 3' lilwii -m ilJi ii"*i3"inr Clii'U 



WO 99/29870 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(D) TOPOLOGY: circular 

|ia) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AHTI-SEHSE: IIO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHOHAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY; mxsc_f eature 
{ B) LOCATION 1...2 4S7 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: SO 



ACGACGAACG 
AATCGGAATG 
GTAGGAAACT 
TTGTCCAACT 
GTCCTGCCTT 
AGCAATGTGC 
GTCGCTTACA 
CCCTATCTGC 
GGATATTCCG 
GGAGTACCCC 
GCCTCTTCCA 
GGAGCTTTTG 
CGTGTCGATT 
GGACGCATCG 
TACGTGGATA 
AGCAACACGG 
AACGGTCTTT 
ATGTACGTGG 
CGTCACTACC 
GCACACTACA 
GAATATGCAC 
CGTCAGAAGT 
GGTGCATGGG 
CGTATCACTT 
AACAGGGCAG 
GAACTGAACA 
ACGGACGAAT 
CGC1TCCTCA 
GCTTCTGTTG 
GGACAGTATC 
CCCCTCTTGT 
GATGGCCGTT 
ATGGGACTGG 
TCTTTCACTA 
GCGGACTACA 
TCGCCCAATG 
TGGACGAGCC 
CTTTCCTCCT 
AAGAGAGTGG 
GCCTACATCT 
CTGCGTTATT 



TTTTCTTATC 
CGTGGATGGC 
CTCCGTC7GA 
TCAATGATAA 
TCTGCCTGAC 
AGCTCCAGAC 
CCAACGTTCG 
TGATGCTGAC 
GCTTTCGCGT 
TCAACGACTC 
T CG AAGACCT 
GGGCAAGTGT 
TGAGCGGAGG 
GTCGCCATTG 
GAGGAAGCGT 
CTCTCAGGTT 
CCAAGGAGGA 
ACGCGCAAGG 
ATGCCATCAT 
CGGCCGGATA 
TGCAGCCCTA 
ATCTGGACAA 
ATTTGCAGTT 
ACATCAAAAA 
ACAAAAGAGA 
TCTATGCCGA 
ATGATGAGGT 
ATCCGAAGGC 
CGGTAGCACA 
CTACGCCTGA 
CGGCCGGAGT 
TGAGCGATGT 
AGCTGACTCT 
TGAGTCGCAA 
ACTGGCTCGA 
TCATTGCCGG 
GCTTCGTCAG 
ATTGGGTGAA 
CACTGGGCGT 
ACGATGCCGG 
ATCCTCAGGC 



TTTGTCCCAT 
TGAGAACAAA 
CTATCTTCAC 
GAGATTTATG 
GGCACAAGCA 
CGTACAGGTC 
CAAGGCCGAA 
TCCCTCCGTG 
GCGTGGCACC 
CGAATCTCAG 
TCAGGTGCAG 
CAATATGCGT 
TTCGTTCGGC 
GGCAGTGGAT 
GGATCTGAAA 
CATCACTTTC 
TGAAGCCAAA 
AGTACCGCAC 
GACGCACAGC 
TGGCTATACG 
TGTGGAAAAC 
TGACTTCGGA 
CGGGGCCTCG 
GTACAATCAG 
AGGTGCAGCC 
CCTCCAGTAT 
ACAGGGAAGT 
CGGTCTTACC 
CCGCGAGCCT 
GCGACTGATC 
AGGT CTCTAT 
GGGACAGATG 
CGGTTGGCAG 
CAAAATCGAC 
ACTCAAGGAG 
CAGCATGCTT 
CAAGCAATAT 
CGACCTCCGC 
ACAGCTCAAT 
TTACGTACAG 
CGGATTTAAT 



AAAATTGGGA 
CCCTCATCAC 
GGCGAAGCAA 
AAAAAACTTO 
CCCGTCTCCA 
GTAGCTACTC 
CTTTCCAAGT 
GTAGCCACCA 
GATGCCAATC 
TCCGTCTTTT 
CGAGGTGTGG 
ACGGATAATT 
ACATTCCGCC 
GCCGGCCTGT 
TCCTATTTCG 
GGAGGAAAAG 
TATGGCCGCC 
TACTACCACA 
TTCTCTCCTT 
GACGAATATC 
AGTGTGACCG 
GGACTCATCG 
GGCAATATCT 
CCCTTAGCTC 
TTTGCCAAAG 
CGTACCATCG 
ATGCAGCACA 
TATAGTTTCG 
AACAGAACCA 
GACTATGAGC 
TATATGCAAT 
CTCACAAGCA 
ATCCTTCCTC 
CG CTACGTAC 
GAGACCCTCG 
ACCCTCTCTC 
CTGGACAATA 
CTCGGCTATG 
AATCTCTTCA 
GCATCCGGAG 
GCACTGGGTA 



GAAGGGGTGC 
CTGAACCGGA 
TCATTCCCCC 
ACATGATTGC 
ACAGCGAGAT 
GCGCCACGGC 
CCAATTATGG 
GCGATGCCGG 
GCATCAACAT 
GGGTGAATAT 
GTACTTCCAC 
TGGGACTGGC 
GATCGGTCAA 
CCAAAATCGG 
CACAGGTGGG 
AAGTTACGGG 
GATACAACAG 
ATACCGACAA 
CCGTTATCCT 
GTACCGGACG 
TGAAGAAAAC 
GTTCGCTTAA 
ATAAAGGAGA 
CCGACTTCGA 
CCAACTCHCA 
GCTACACGAT 
TCGATTTGGA 
ACGATGCTCA 
ATTACACCGA 
TGGGCTACCG 
ACAAGGACCA 
ACGTCCCCGA 
GTTTGCTGCG 
AATATACATC 
AAAGCACGGA 
ATGCCGGTTT 
CACAGCGCAG 
TGCTGCCGGT 
ACCTCATGTA 
AACTAAGTGC 
GTCTGACAAT 



TTCCTGTAGC 
TAATACCGGC 
TCTCTCTTCT 
CGCCTTAGCC 
AGATAGTCTT 
GAAAACCCCT 
TCGTGACATC 
TACGGGTATC 
AACTACCAAT 
GCCCCACTTC 
CAATGGTGCC 
TCCTTATGGC 
ACTCGGTAGC 
TTCGGACGGC 
CTATTTCGGT 
TATCGCATGG 
TGCCGGTCTT 
TTACGAGCAG 
CAACCTCACG 
TAAACTAAAG 
GGATCTCATC 
CTGGCACACC 
CCACTTCGGC 
ATATTATCGG 
GATCACTCCG 
AAACGGCATC 
CAAGACCTTC 
TACTGCCTAT 
AGCCGGAATA 
CTATGCTTCG 
ACTCGTGCTG 
CAGCTACCGT 
TTGGGATGCT 
CGTATATGAT 
TATAGCCTAC 
CGAAATGGCT 
CGATCGCATG 
TCACTTCGTT 
TGCCTCCAAT 
ATATGCCGAT 
CGATTTC 



<21 IMFQRIIATIQN FOR SEQ ID HO' 81 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1596 base pairs 
<B> TYPE: nucleic acid 
(C) 3TRANDEDNESS : double 
(DJ TOPOLOGY: circular 

<ii) MOLECULE TYPE: DMA (genomic) 

(11 J) HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

tvi) ORIGINAL SOURCE: 



60 
120 
IflO 
240 
30O 
360 
4 20 
490 
54 0 
600 
660 
720 
790 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
13?0 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2457 



SUBSTITUTE SHEET (Rale 26) (RO/AU) 
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(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 

Ux) FEATURE: 

(A) MAME/KEY: misc_feature 
(B> LOCATION 1...1596 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 61 



10 



15 



20 



25 



30 



35 



TACAGGCGAT 
ACTCGGTTAC 
AGAAGGTTTC 
CGGTTTCCGA 
CGTCCTTGGC 
CATTTCATCA 
CAAACAGGCT 
CACGGATCGC 
CCGTTTGCCT 
ATCAACGACC 
CTGTCGGACC 
GCTATCGCTC 
TCTGTCGGGA 
GCCGGCTTTC 
CTGAATCTGA 
TTCTTCCAAT 
AATCTGATCG 
GGAGGTCTCT 
GAGGAGGTAC 
ATCTTCCAGC 
CTGAATGTCG 
TATAACCT CG 
TGGAGCTGGC 
CCGCACCAGA 
CGCCTACTGG 
GTCGCTCGCC 
ACCGGCGACA 



CGGGAAGGGT 
AGCCGAGAAG 
TATCGCTGTT 
TGGTGCAGGG 
GCGCCATCGG 
TGAACGAGGA 
TTGGCTGGGA 
TCTATTTCAA 
T CTT T GGCAG 
TCTGTGCCAC 
TGCTCATCGA 
TGATCAATCC 
GTCGCAGCGG 
GCTTTTTGGC 
GATTCGACTA 
TCAAAGCCGG 
GAATCCTAAG 
TTCAGCACTT 
TCGTCACCCC 
ACCACGGAAA 
TCCCGATGGG 
GCAGCGGATT 
TCTTGGGAGT 
AAAATACCGA 
TGACGAGTTC 
GTTTCATCCG 
TACAGCTGCG 



TTGTCCTCGT 
AGCGGTTCGA 
ACTGCT GTAC 
AATCGAGTTG 
TAAAACGATA 
CTTTGCAGAC 
CAATGACAAG 
TGCAGCGAGG 
TCTCATGTGG 
CACCATAGGC 
CAATCGTACC 
GATGCGCTTT 
ACAGATATTT 
AGACAAGCGG 
CGGCGATCCA 
ATTGAGTTTC 
CGGATGCCAA 
CGACTACTAC 
ATACCGTATC 
ATTTCGACGA 
AGCCAGTCTG 
GAGCGGCAAG 
CGAAAGCTAT 
TGTCAGCTCT 
CGAGTTCGCA 
CAAAACAGCC 
TGTCGGATTT 



ATCCTCCGGA 
CTACGAGCCG 
ATACTCTCTT 
GATACCGATT 
GGCGTCAATC 
ATCAGTTGGC 
TTTGTCACCA 
TCGAACGGTT 
GAGCTGCTTA 
GGTATAGCTT 
ACAGGGTGGG 
CTCAACCGTC 
CAGT CTGTCC 
CATGCCCGAA 
TTCCGAAGCG 
TCCGAATCGC 
CTGCTCGCAC 
AATTCGGAAA 
TCGCAAGTGG 
CGTCCTCTGG 
TCGGATCACT 
CTATACCTTG 
CGGCTCTACA 
TTTATGGTGC 
TTTCATCCTG 
TATCAATTCT 
CACTTC 



ACAAACGTTC 
GCACCAGAAC 
CCATCAGCCT 
CGCTTTTCTC 
TGGCCGTATG 
AGACTATCAA 
ACCTCTTCGC 
TGAGCTTCAG 
TGGAAAACGA 
TGGGGGAGAT 
AACGTATGGG 
TGACAGCAGG 
CCATAAACAT 
CCGGTGCCAC 
AGACTTTCTC 
AACCTCTGCT 
ACGAACGAAC 
AACGAATAAG 
CAGCTCTGGG 
AGCTATATGC 
ACAACGTGGA 
GTGCTACGTA 
CATGGATCGG 
AGGGGGACGA 
GCCCCTGGCA 
ACCCTAACGT 



CTACGCAGAT 
AAAGATGAAA 
TTCTGCTCAG 
TCTGCCCAAG 
GGGCTTCGAT 
GAGCAATTTC 
ACATCCTTAT 
GCACTCTGCT 
GCCACCGAGT 
GGGGCACAGG 
GCGCGAGGTG 
AGAGGTGACT 
AGTCGTCGAT 
GGCTCTGACC 
TCCATACGAT 
GAGCCAGATC 
GGTTTTGGTG 
CAAAAATTCG 
AGGCGGTCTT 
CGAGACCTAC 
CAATCGGGAC 
CAATGATCTG 
GTATGAAGAG 
AAGCAAGGCG 
TGTAGCCATC 
ATCATT CGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
040 
900 
960 
1020 
1080 
1-14 0 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1S9G 



40 



45 



50 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 82 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9O0 base pairs 
(R) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
{D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

<iil> HYPOTHETICAL: NO 

<iV) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PH YROMON AS G1NGIVALIS 

fix) FEATURE : 

(A) NAME/KEY: mi a cofeature 
(8) LOCATION 1...900 

(Xi) SEQUENCE DESCRIPTION: SEQ ID MO: 82 



AAAAGAAGAA 
TTCACCCTTG 
ACTATAAAAG 
GGTGGAGAAA 
GATAAATGGA 
AACAAGTGGA 
GGCAAACATT 
GAGAAGCCCT 
TATCTCACTC 
TCTCCTACAA 
TTCGGGGTGA 
GCCAATATAA 
TATACGCACG 
AACAAGTTCC 
AACGATGGCC 



AAAAACAGAT 
CGAACGCACA 
GTGTGACCGG 
ACACGGTGGC 
GTTGGGACAA 
ACAAAAGTGT 
GGTACGGAAG 
CGGATCACTT 
TCGGTATTGG 
CGGGCAAGCT 
AAGTTGGT G A 
ATCTGATGGA 
ACTTTGGCAA 
TCACGGCTAC 
CGAAAATCCA 



GAAACG ACT G 
AGAAGCAAAC 
ACT AAAT GCC 
AGGTAACCTC 
CGGTTTGCGT 
AGACAAG AT C 
TGCGCTTTTC 
GACAGGAGTC 
TGCGGACTAT 
GACTGTAGTA 
AAAGACAATG 
GAATGTCAAT 
CATTGACATC 
GATAGCCACC 
GTTCAAAGAA 



ATTGTTTTTC 
ACTGCATCTG 
TCTCAGACTT 
TATTTGAACA 
ACAGACTTCG 
GAACTCTTCA 
ACT TTCCTCT 
AAGCATATCT 
AAGCCCAATG 
GCAGACGACT 
TTCGAACTTG 
TTGATAACCA 
AATTGGGAGG 
AATCTTATCT 
GTTGTGGGCG 



TGGCAATGGG 


TGGCTTGCTG 


60 


ACACTCCCAA 


AAAGGACT GG 


120 


CTCTGACCAA 


CTGGGCTGCC 


180 


TAGATGCCAA 


CTACCTGAAA 


240 


GTCTGACCTA 


CACAACAGCC 


300 


CGAAGGCCGG 


CTATGAGATC 


360 


CACACTATGC 


CAAAGGATAT 


420 


CTAATTTCTT 


CGCTCCTGCA 


480 


AGAAGTTCTC 


TCTCTACCTC 


540 


ACCTCTCAAG 


TTTGGGAGCC 


600 


GTGCTTTGGT 


AGTGGGTTCG 


660 


AGGCTTCATT 


CTTCTCGGCT 


720 


CT AT GCTGGC 


CATGAAGATC 


780 


ACGACGATGA 


TGTGAAGATC 


B4 0 


TGGGTGTTGC 


GTACACTTTC 


900 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 PCT/AU98/01023 

57 /490 

(2) INFORMATION FOR SEQ ID NO: 83 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

10 

(ii) MOLECULE TYPE: DMA (genomic) 

(ill) HY POTHET I CAL : NO 

15 (iv> AIITI -SENSE: NO 

[vi> ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

20 { ix ) fEATURE : 

(A) NAME/ KEY: miscjeature 

( B ) LOCATION 1...663 



25 



45 



55 



60 



65 



(xij SEQUENCE DESCRIPTION: SEQ ID NO:83 



40 (2) INFORMATION FOR SEQ ID NO: 84 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 4 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: circular 



tii) MOLECULE TYPE: DNA (genomic) 
50 Uii) HYPOTHETICAL: NO 



(iv) AMTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME /KEY : ml sc_f eature 
(E) LOCATION 1...744 

(xii SEQUENCE DESCRIPTION: SEQ ID NO: 94 



ACGAGAGAGA GTGTGTTACA TTGTAGAACA AAACTCAAAA AAGAACGAAA AATGAAGAAA 60 

ATGATTTTGG CAGCTACTAT GCTGCTCGCA ACAATCGGTT TTGCAAATGC TCAGAGTCGT 120 

CCTGCTCTTA GACTGGATGC TAACTTTGTC GGTAGTAACT TAATGCAAAA AGTCGCAAAC 180 

ACGAGCGTGA ACAATAAGAT GATCGTAGGC TTACGTGTTG GTGCT GCTGC TGAGTTCGCT 24 0 

30 CTTAGCAATG ATGGATTCTA TCTCGCCCCC GGATTGGCCT ATACGATGAG AGGTGCTAAG 300 

ATGGAATCAC TAAGTGAAAC GACAACTCGC TTGCATTATC TGCAAATACC GGTGAATGCC 360 

GGTATGAGAT TTAGCTTTGC TGACAACATG G CTATTT CAT TGGAAGCAGG TCCCTATTTC 4 20 

GCATATGGTG TCGCCGGAAC GATTAAGACT AAAGTTGCAG GCGTTACGGC TTCTGTAGAT 4 80 

GCCTTTGGTG ATAACGGATA TAACCGTTTC GACTTGGGCT TGGGCTTGTC TGCTCCCTTG 54 0 

35 AGCTACGACC GTTATTACGT ACAAATTGGA TATGAGCATG GATTGCTTAA TATGTTGAAG 600 

GATGCTCCGG ATAAGACTTC TTTGCGTAAT CATGACT TCT TTGTGGGTCT CGGTGTTCGC 660 
TTC 



663 



ATCAAACGAA TAGAAATGAA AAGGATTTTT ACTGTAGCCC TT GTGCT ACT TGCTTCGGTC 60 

ACTATGGCCA TCGGACAAAG CCGCCCGGCA CTTCGCGTAG ATGCCAACTT CGTAGGCAGC 120 

AAT CAGAGC A TGAAAAGAGA CGGATATGTG TGGGACACCA AAATGAATGT CGGCCTGCGG 180 

GTCGGTGCCG CTGCCGAATT CATGATCGGA TCAAGAGGAT TCTACTTGGC TCCGGGTCTG 240 

AACTATACGA TGAAGGGCTC CAAAACCGAA TGGGATATAC CCGAAATGGT TCCTGGTACC 300 

TATATTACGA TGGTTTCCAC TCGCTTGCAC TATCTGCAAC TGCCGATCAA TGCCGGCATG 360 

CGGTTCGACC TGATGAATGA CATGGCGGTT TCGATCGAAG CGGGTCCTTT CCTTGCATAC 4 20 

7U GGTATATATG GTACATATCG GCAGAAGTTG GAAGGATGGA AGCCGAACAA CTACAGCACA 4 80 

GAG Tl'TTT T G GCCCAACGCT TGGTGGCCCA ACAAATATCC GCTGGGACAT CGGGGCAAAC 54 0 

ATAATAGCCG CATTCCACTA TAAGCGTTAT TATATACAGA TAGGCTATGA ACATGGATTT 600 

GTGGATATTG TGTCAGGTGG AGGTTCTGAT ATTCCCCGAC TGAACGACAA TAGGCAATCC 660 

TCTTCGACGA CCGCTCTAAG AGAAAAGGGA AAT AAC G A AT ACGCTTATAA TCGTGACTTC 720 

75 TTCGTGGGCA TAGGTTACCG CT'l'T 744 
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(2) INFORMATION FOR SEQ ID NO: 65 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 
13} TYPE: nucleic acid 
(CJ ST RAN D E DM ESS: double 
(D) TOPOLOGY: circular 



(ii) MOLECULE TVPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

15 (iv» ANTI-SENSE: NO 

<vi) ORIGINAL. SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

20 (ix) FEATURE: 

(A) NAME/ KEY: mice feature 
(B> LOCATION 1 . . . 633 



(Xi) SEQUENCE DESCRIPTION: SEQ 113 HO; Bb 



AAAAGAAAGA GTATGAAAAG AATliCTGCTG CTTCTCGTTG TATTATTATA TGGAATTGCA 60 

GGCCGAT'l'GG CTGCACAAGA CGTTATCAGA CCATGGTCAT TGCAGGTCGG AGCGGGATAC 120 

TCCGATACGG AGAACATCCC GGG AG GATTC ACCTATGGTT TCTATTTGGG AAAGCGTATG ISO 

GGGAGCTTTC TGGAAGTGGG GCTGTCCATG TACAACTCCA CACGTCAAAC AGCCAACAAT 24 0 

30 GCAGACTCCT TTGCATCGAA CGAAGGAGAC GGATCTTTTC AGGTAAATAT GTCTTCTCCG 300 

AATGAGAAGT GGTCATTCTT CGATGCAGGC AGTGCCAACT GCTATATGAT CGTCGTCGGA 360 

GTCAATCCTC TCCATCTGTT TTGGCAGAAT AGCCGGCACA ATTTGTTTCT GGCAGTACAA 420 

GCCGGCCTGT CCAATAAGCA CAATATTCAT TTCATCTATG GAGACAAGGG AGCCAAAGTC 4 80 

AGTATCTACA CCAATTCGAA TACCTACATC GGTTACGGAG CACGTGTAGC CTACGAATAT 540 

35 CAAATTCATA AAAACGTGGG GGCGGGTGCC GCTGTAATGT ACGACCACGG CAATAAGATG 600 

CTTACGGCCA TGGCCACGCT CTCCACTCAT TTT 633 



40 



(2) INFORMATION FOR SEQ ID NO: 86 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2859 base pairs 
(B> TYPE: nucleic acid 
(CJ STRANDEDNESS: double 
45 (D) TOPOLOGY: circular 

Ui) MOLECULE TYPE: DMA (genomic) 

(ill J HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



50 



55 



(Vi) ORIGINAL SOURCE: 

(A) ORG AH ISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY : misc feature 

(B) LOCATION 1...2859 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID HO: 86 



65 



ATCCGAATGA GAGTATCCGA TCTCTGTTCC AGACTTTCAT GGTTATTACC CGTAATCCTT 60 

GTCGGATTGC TCTGT GCT AC TTTGGTCGCT GCGGAACGTC CTATGGCCGG AGCAGTCGGA 120 

TTGCACCACC GTCGGCATGC TGCGCTGTCT GATTCTACAG CCAAAGACAC GGTGCCTCTC 180 

GCAAAACCTA TTCCTGACAG TGCTTTTCGA GATTCCCTTC CTGCCGATTC CACCGGATCG 240 

ATGCGGCAAG AT AGCGT GT A TGACGATGAA TTCGAATTGG AAGATATAGT GGAGTACGAA 30O 

GCTGCCGATT CCATCGTTTT GCT CGGAC AG AATCGTGCCT ATCTTTTCGG CAAGAGCTAT 360 

GTGAGCTATC AAAAGAGTCG CTTGGAGGCA AACTTCATGT ATCTCAATAC CGACAGCAGT 420 

ACGGTTTATA CTCGCTATGT CCTCGATACG GCCGGTTATC CGATGGCC1T TCCTGTTTTC 4 80 

70 AAGGATGGAG AGCAGTCGTT CGAAGCCAAG AACTTTACCT ACAACTTCCG CACGGAGAAG 54 O 

GGGATTATCA GCGGAGTGAT CACGCAGCAG GGCGAAGGCT ATCTGACTGC CGGTAAGACC 600 

AAGAAGATGC CCGACAATAT CATGTTTATG CAAGGAGGGC GTTATACGAC CTGCGACAAT 660 

CACGATCATC CTCACTTCTA TATCAATCTT TCCAAGGCAA AGGTGCATCC GGAGAAAGAC 720 

ATCGTCACAG GTCCGGTCAA TCTGGTTATC GCCGATATGC CGCTGCCGAT AGGTCTTCCT 100 

75 TTCGGCTATT TTCCCTTTTC CAACAAATAC TCTT CCGGT A TATTGATGCC CACGTACGGA 84 0 
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GAGGACAATC 

TATATCGATT 

CAATCGAAAT 

TCGAAGTCCG 

CGATGGACAC 

AATTTTGCCA 

CGTACTGCTA 

TTTTCGATTA 

ACCTTGCCGA 

GTAGGACCGG 

AGTATCTTGA 

GGTATGCGTC 

ACl'ATGGGGG 

GAGGATAAGA 

TACAGTCTGT 

TTTTCCTTCG 

TCCTATATGC 

GATCAGAA03 

GCTCCCTCCA 

AAGATCAAAT 

ACATGGTCTA 

TCGCTGGCAC 

TATTTGACGA 

CTGCGCATTT 

TATACGCTCA 

CGGAGAGATG 

AT ACT T GAG G 

CAGGGCGGAG 

TTCGACTATA 

CACTACTACC 

TGGAGCTTCG 

TGCAACGTCA 

GCATACAAGT 

TATCAGCAGA 



GCTATGGATT 

TGGCATTGCG 

ATAAGAAGAG 

GCGACAAATA 

ACAGTCAGGA 

CCGGGAGCTA 

CGACACGAAG 

CGGGTAGCAT 

ATCTTTCGAT 

AGCGATGGTA 

CAAAAGAGAA 

ATTCCGTACC 

TTAACTACAA 

AAACATTCCT 

CGG CAGGCTT 

GAGGCAATCT 

CGGACTTCAC 

GCAAGCTGCA 

TGGGCAATGC 

CCAAATCCGA 

CATCCTATAA 

TTCGCCTCTC 

AGTATTATGA 

TTAACGGCAA 

ACAAAGAGTC 

AAAAGAAAAA 

GAGGAAGACC 

CAGTGGATCA 

GTTGGAATAT 

GGGTAACGCA 

GATTCAATGC 

CTCGCGACAT 

CCTATAATTT 

GCAATCGTCC 



TTATTTGAGG 

TGGGGAGATC 

GTATAAGTAC 

CGTGCCCGGA 

TCCGAAGGCC 

TTTCCAGAAT 

TTCGGCCGTG 

GGATATCAGC 

TAATATGTCC 

CGAGAAGTTG 

AGATTTGCTC 

GATCAGTTTG 

TGAGTGGTGG 

GCCTTCGGAC 

ATCTACCACA 

CATTATGATC 

GAAACQCCGA 

TACGCTGCTC 

AGGATCTGTC 

TTCGACAGGG 

TATGTTTGCC 

CAAGAGCTTT 

GGGAGAAGAT 

GGGATTGGCA 

GCTCAGCGGA 

CACAGGGGCT 

GCAAAATGAA 

G GAT GG TT AC 

TGCTACCGAC 

GAATCTGAGC 

GAACTACAAT 

GCACTGCTGG 

CGTCATATCG 

CATCACGAAT 



AATGGTGGAT 

TTTTCCAAAG 

AACGGCTCGT 

GACTACAGCA 

AATCCTTTGC 

TCGCTGAATA 

AGCTATTCGC 

CAGAACATGC 

ACGCGTTATC 

AGTGTGGGCT 

CAGAGCAATC 

ACTGTCCCTT 

TACACGAAAG 

ACGACCTATA 

TTCTACGGTA 

CGTCATCGCT 

TATGGCTTTT 

TACTCTCCTT 

AATTTCTCTT 

ATCAAGAAGA 

GATTCGATCC 

ACCTTGCGCT 

GGGAAGATCA 

CGCCTGATCA 

TTGATAGCTC 

ACTCCTCATG 

AGTCGGGGGT 

TTCGCAT AT T 

TACAATAGGT 

TTTCGCGGCA 

TTCGAC1TGA 

GCTATCTCGG 

GTGAAGAGTT 

ACTTGGTAT 



ATTATTTTGC 

GGTCATGGGG 

TCGAAGCCAA 

AGACCACCAG 

AAACGTTGTC 

CCACCTATGA 

GCAAGTTTCC 

GCGATACGAC 

CTTTCAAGCG 

ATTCCGGTCA 

TCGTGCGCGA 

TGTTGGATTA 

GCATACGGAA 

AAT T CCGCAG 

TGTTCAAGCC 

TCACGCCCAC 

GGGAGCTTCT 

ATTTCGAGCA 

TTGACAACAA 

TCAGCCTGAT 

GATGGAGCAA 

TGTCCGGTCT 

TTCCCTATAA 

GTACGGGTAC 

TTTTCAGTGG 

AAGGAGACGA 

CGCTCCTCGA 

CGATCCCATG 

ACAATGTCAA 

ATATCCAGCC 

AGAAAATAAC 

CCAGTTTCAT 

C ACT CTTGCA 



CTTCAGCGAC 

CATTTCAGCC 

TTATCTGGTA 

TCTGAATAT C 

GGCCAATGTC 

TGTCAATGCC 

GGGTACTCCT 

GGTGAGCCTT 

GAAGACCCGT 

GCTTCGCAAT 

TTGGAAGAAT 

TATCAATCTG 

GTCGTGGAAT 

ACT GT AC GAT 

TTGGAAACCT 

TGTCAGTTTC 

TGAGCATACG 

GATATTCGGT 

CTTAGAGGCC 

AGATCAGTTC 

TATCTCGGCT 

GTTCGATCCC 

GAGCAACGAC 

TTCTTTCAGC 

CAAAAAGGAG 

TGCTGCCCAT 

GCGCAACCGT 

GAGCCTGTCC 

TAAGATGGAG 

TACACCGAAC 

ATCGCTTACC 

CCCTATAGGA 

GGATCTGAAG 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1600 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
234 0 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2659 



(2) INFORMATION FOR SEQ ID NO; 87 

A _ (1) SEQUENCE CHARACTERISTICS: 

40 (A) LENGTH: 37 53 base pairs 

(B) TYPE : nucleic acid 

( C ) STRANDEDNESS : double 
<D) TOPOLOGY: circular 

45 (ii) MOLECULE TYPE: DNA (genomic) 

Uii) HYPOTHETICAL: HO 



50 



55 



60 



65 



70 



75 



tiv] ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: miec_f eature 

(B) LOCATION 1 3753 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 



AAATGGAAAT 
AACGAGGTAA 
TGCACGGTAT 
AAGCCTTGGA 
TTAATTCAAG 
CCTGTGAACG 
GAAAATCTAC 
GTTGCCAATC 
ACTAAAGCGG 
GGAGGGCAAA 
AAAATGTTGC 
CATGAGCCTC 
CGGGCTTTGA 
ATTATGGCCA 
ACGGAGCAGT 
GCGAATGATC 



TGGCATGCGC 
TAATAATGAT 
TTACCTTTCA 
TTCAGAAACA 
CTGAAATTGT 
AAATAAGGCC 
GGGCATCTCC 
CAATGGATCC 
TAACAGTACC 
ACATTGCCAG 
CTAAGGGGAC 
ATCGCTTATC 
CTCCTATTTG 
C1TCTACTGC 
ATCCTGTTAA 
CCATTGATTT 



TTTTGATTGC 
GAAACGATAT 
AATAAAAGCT 
TTCAATGGAT 
ATACCAATCT 
TGCAAATCGT 
CCCCGTAGTT 
TGAAAATCCC 
TGTCGATGTG 
ATTAAAGTCT 
GGCTACAGAA 
TGATTTTACC 
GGGAACACAT 
TGTGGATAAG 
AAATGTAACT 
GGTTATACAA 



GCGTGTTGTT 
ACAATAATTC 
CGCCCTTATG 
TCTAAATTGG 
GTTTCT GAAC 
TTCCCTTCGC 
CCCGTTGCCG 
AATGCCTGGG 
GTGATGGTTA 
GCCATTGCAT 
GGGGTGCGTA 
AAAGACACTG 
ACCCAGGGGG 
CATATCATAT 
ACTGCAGACT 
GGAGCAATTA 



TCGATCCATT 
TTGCAGTTTT 
AAAGATTTGC 
TGCCTGCAAA 
ATAGTGACTT 
ATAGGAAGTC 
TCGACAAGTA 
ATGTGACGCT 
TCGACCAGTC 
CGGGACAGCG 
TCGCTCTTGT 
CTTTTCTCTG 
GGCTTAAAAT 
TGATGTCTGA 
-TCATTGGCAA 
ATTTCCCTAC 



T GT AG T AACT 
TCTTTTATTC 
AGATGTAGAG 
TAAGGGTAAC 
AGTTATTTCA 
TTTTTTTGCA 
TGCGGTACCG 
AAAAATCACT 
TTCGTCAATG 
TTTTGTGAAA 
GAGTTATGAC 
TCAAAAAATC 
GGCGAGAAAC 
CGGGTTAGCG 
AACTGGAAAT 
AAATTATGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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40 



45 



TCCAACAATC 
CGGAGAAATC 
GATGGTGTTG 
TTCCCTTGTA 
CATACTATTG 
GCTACAGACG 
AATATTGCCC 
CCTGGTTTCA 
TCAAATGGAA 
ATCCTGAGCT 
CAGAACAATG 
GATACCAATA 
TTAATTTTCC 
TTGGTAAATA 
GAGGCTCATG 
CCCAAATGGA 
ACTAACACGG 
TCCACGCCGA 
TTCGCTTACA 
GATGTGACGT 
GCAACGACAG 
CCCAAAATTA 
AGTCAATTGA 
GTGAAGTCGT 
CAAAATGTAG 
ATGTATCGCA 
AATGATGTGG 
TGGCGGCCAG 
ACGAATGACG 
TGCGATGCCT 
ATCGGCAACT 
GTCGAGCAGA 
GGAAGCACGG 
GGTCAGGACA 
AACGGAGCGT 
AACAACGGTA 
CCGTCACTTG 
GATGGGGGTC 
GGTTTGTCAC 
GTTCCGGAGT 
TTGGAGTTTG 
TCAGGAGCCA 
GATTCCGGGG 
GTCGTGAGTA 
GTAACCTTAT 
AGAGARGTCA 
ACTAATGATG 



CATCTACACC 
TGCCGGAATC 
CTGGCGCATT 
ACG CTGCT AT 
GCTATGACCT 
AGAATCACTT 
AAACTATTAA 
TCGTTAAAAA 
CGGTGCACTA 
CATCAGAAGC 
ATATTCCGGT 
CCGAGGCAAA 
CACGTCCGAC 
AAGACGGTCA 
TTCTACAGTC 
TAAAGTTGGA 
TCATCACTAC 
ATCCGGGCCA 
AGCTCCTCAA 
CCAATTGGAC 
AAAAT TTCGG 
TCGGTAACCT 
CGATCAACGG 
CGAAAGACAA 
GG GGGACCGT 
GGAGCTGGCA 
GCGGAAACGA 
CACCTTATGC 
TGCAGGCACA 
TCCTGAATCT 
CATACACTGG 
CGGTGTATCT 
TTTCAGGCTA 
ATCTTCCGGA 
CTTGTACGTT 
ATGGTACGCA 
TGATGGATGT 
TTTCTTTCGG 
AACTTTATGC 
TGAATAACCT 
CTCTTTCGGA 
AACACCGTAT 
CTCGTTTCCG 
CAAATGGCCG 
TCACAATAGA 
TGAAAGTGCA 
TGCATAAGGT 



TCTTACCCCA 
CAAATTCGAT 
GGTCTATGAA 
CAATGAGGCT 
GGGAGATTTT 
CTTTACGGCG 
TATAGGTATA 
TCTGACGCAA 
TGATGTCTCT 
TACCATAACT 
AAATACTACT 
ATTGACCTAT 
GGTTAAGTTA 
ACCCATACAG 
ACAAGATTTC 
CAAAACGACC 
TGCCGATGGT 
AATCGGTATC 
TTATTGGATG 
AGGAGCCCAA 
TTCTCCGGCG 
TATCAATAAT 
CGTGGTTGAG 
TCCTACGGGG 
CGAGTTTTAG 
GTATTTCGGT 
GACCGTCAAC 
ACCTGATACA 
GCCTACGGCA 
GACACGCACG 
AGCCATCGAC 
GTTCAACACG 
TCGAGCCGGT 
TCGTATTCCA 
GCANATCTTG 
GATCACATGG 
rCTTGGTAAC 
ATTCGACAAC 
GATGTCTGAC 
GCTGATCGGC 
TCATTTTGCG 
TACGAATTCT 
CTTGTCATAT 
TGAAATTATA 
AGGTAAGCTT 
GACCGGAGGG 
GCTTGTT GAG 



AATTATCCAA 
TATAGTAATC 
CCGAGGTTTC 
CAGTTTGCGA 
GCCTTGGCCA 
ACACCGGCCA 
CAGAGGGGGG 
TCGGGAGATG 
ACTAAAAAAC 
TATCGTATTT 
TCTGCTATCG 
ACCAATTCCA 
GGTTATGGTG 
GCAAATGGAA 
TTTTTGCCCT 
GAAGCATTAC 
AAACGTTATC 
AGTTGGAAAA 
GGAGGAACAA 
GTACCGCTCA 
GTAGCCGATT 
TCCGACAAGG 
GATAACAATC 
ACATTGCTTT 
AATCAGGCAT 
ATCCCTGTCA 
CAATGGGTTG 
GAGCTTCAAA 
GTTTACAGCT 
TCCGGTGTCA 
ATCAAGCAGG 
GGAACACGCG 
CAGTACCTCT 
TCGATGCATT 
TACGATAAGC 
CGATCCGGCA 
GAGTCGGCCG 
GGCTGGGATG 
ATCGGTAATG 
TTCGATGCGG 
AAAGGGGCTG 
ACGTCGTATT 
GGATGTGATC 
ATTCTGAATC 
ei'TCGCCGCT 
GCCTATATTG 
TAT 



CTCATTCTTC 
TGAGTGCAAG 
CTCATCCCTA 
AAAACTCTGG 
ACAATTCGTT 
ATTTAGCTGC 
AGGTGACGGA 
TTACTCATTT 
TGACATGGAC 
ATGCCGATTT 
GCCCGGATCT 
ATGGCGAACC 
TTATTAAGCG 
CAGTTGTCAG 
CAGGTGGAGG 
AG TACT ATT C 
GTTTTGTCGA 
AACCGGCAGG 
CAGACCAACA 
CAGGAGAAGA 
T GCATGTCCC 
ATTTAGTTGT 
CGAATGTCGG 
TTGCCAATCC 
ATGATTGTGC 
ATGAATCAGG 
AGCCTTTCAA 
AATTCAAGGG 
TCAAGGGTAT 
ACTACTCGGG 
GTATTGTCTT 
ACCAGTGGCG 
CTGTACCTAA 
CCTTCTTGGT 
TGCTCAAGAA 
ACTCCGGATC 
ACCGTTTGTG 
GTCGCAAGCT 
ATAAATTCCA 
ATAAGGATGG 
TTTACCTGCA 
CATTCGATGC 
AGAACGTAGA 
AAGATGCTCT 
TGAAAGTATT 
TGCATCTTCA 



TAAAGTTGGA 
GATTACTTTT 
TTATTATTAT 
TTATACAATC 
GAAACTAACC 
AGCGTTTGAT 
CTTTGTAGCT 
GCTAAATGTT 
TACTGGTACT 
GGATTATATA 
TGGTGGATTC 
GAATCAGCAG 
GCACTATGTA 
TTCCCTAAGC 
TCATATTGTT 
CGTACCGCCG 
AGTCCCAGGC 
AAACGCTTAC 
GAGTGAATGG 
TGTAGAGTTT 
GACAACCAAC 
TACCACAAGC 
TACGATCGTC 
GGGCTATAAT 
CGATTGTGGT 
TTTTCCAATT 
TGGCGATAAG 
CTACCAGATC 
GATTTGTGTG 
CGCCAACTTG 
CCCGCCGGAA 
TAAGCTTAAT 
GAATACAGCG 
GAAGATGCAG 
CACGACTGTA 
GGCGAATATG 
GATCTTTACC 
G ACT G AAAAA 
GGTTGCAGGG 
TCAATACACG 
CGATCT TCAG 
CAAGCGGGGA 
TGATTCGCAT 
TGACTGCACT 
AGCTGGTCAT 
AAATGCTTTC 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
Z400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
34B0 
354 0 
3600 
3660 
3720 
3753 



50 £21 IMFOWIATIOH FOR, SEQ ID HO: 88 

(1) SEQUENCE CHARACTERISTICS: 

<AJ LENGTH: 1278 base pairs 
(B) TYPE: nucleic acid 
55 (C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic J 

60 (ilij HYPOTHETICAL: NO 

(iv) ANTI-SEUSE; NO 



65 



70 



75 



(Vi> ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGJVALIS 

tlx) VEATURE: 

(A] NAME /KEY: raise feature 
(BJ LOCATION 1...1278 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 88 

ACTATGGAAG TCAAGAAAAA CACAGTGGTG CTACGCOTTC TGATTTGGTT CGTGGCCATT 60 
CTTCTCTTCC ACTCCTCACG GCTGTGGGGA CAGGAAGGGG AGGGGAGTGC CCGATACAGA 120 
TTCAAAGGAT TCGTGGATAC CTACCATGCC GTACGCAGCT CTTCTCCTTT TCATTTCATG 180 
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AGCTCGCGTA CGAGAGTGAG AGGTGAGCTG GAGAGGTCGT TCGGTAATTC GAAAGTAGCC 240 

GTATCGGTCA ATGCCACCTA CAATGCTCTA CTGAAAGACG AGACCGGCTT ACGTTTACGT 300 

GAAGCCTTCT TCGAGCATCA GGAAGAGCAT TGGGGCTTGC GCCTCGGACG ACAGATTGTC 360 

ATTTGGGGGG CTGCCGACGG TGTGCGCATC ACGGATCTGA TCTCCCCGAT GGATATGACC 420 

5 GAGTTTCTGG CACAGGATTA CGATGATATT CGTATGCCGG TCAATGCATT GCGTTTCTCT 4 80 

GTCTTCAACG AATCGATGAA AGTGGAAGTC GTGGTACTGC CT GT ATTCGA GGGGTACCGT 54 0 

CTGCCTGTGG ATCCTCGCAA TCCTTGGAAT ATCTTCTCCC TTTCGCCCAT TGCTCAGGGG 600 

ATGAATATCG TCTGGAAAGA AGAAGCCGGC AAACCGGCCT TCAAGGTTGC CAATATCGAG 660 

TACGGTGCGC GATGGAGCAC TACGCTCTCC GGTATCGACT TCGCTTTCGC TGCATTGCAT 720 

10 ACATGGAACA AGATGCCCGT CATCGAAGTA CAGGGCATTG TGCCGACGGA AATCATCGTT 7 SO 

AGCCCTCGCT ATTATCGTAT GGGATTTGTC GGCGGCGACC TCTCCGTACC CGTCGGACAG 84 0 

TTTGTTTTCA GGGGAGAGGC TGCGTTCAAT ATCGACAAAC ACTTCACCTA TAAGAGTCAT 900 

GCCGAGCAAG AGGGTTTCCA AACAATCAAT TGGTTGGCCG GAGCCGATTG GTATGCTCCC 960 

GGTGAATGGA TGATCTCAGG ACAATTCTCA ATGGAAAGCA TATTCAGGTA TAGGGATTTC 1020 

15 ATCTCCCAAA GACAACATTC TACCCTGATT ACTCTCAATG TTTCCAAGAA ATTCTTCGGC 1080 

. AGTACACTCC AACTTTCGGA CTTCACCTAC TACGACCTTA CGGGCAAAGG ATGGTTCAGT 1140 

CGCTTTGCAG CTGACTATGC CTTGAACGAT CAGATACATC TGATGGCCGG ATATGACTGG 1200 

TTCAGTAGTA AGGGCAGCGG TATATTCGAT CGCTACAAAG ACAATTCCGA ACTCTGGTTC 1260 

AAAGCCCGCT ACAGCTTC 127 8 



20 



35 



70 



(2i IN FORMAT TOM FOR SEO. ID HO: 89 



li) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1392 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

30 (ii> MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 
(IvJ ANTI-SENSE: NO 



Ivi) ORIGINAL SOURCE: 

(Al ORGANISM: PORYPHYROMONAS GINGIVALIS 



<ix) FEATURE: 
40 (A) NAHE/KEY: miscf eature 

(B) LOCATION 1...1392 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8S 

45 TACGGCAAAA GAAGAAAGCT CGGAACCTCC GTCCGACCTT CCGTTCTAAC CCAAATCAGA 60 

TTTATTTTGG ACTTACACCT TATTACTGAT TTTTTTGAAG GGCTTCGGGT CAATCCT AT C 120 

GGTGCAGCAG CCATAGTGGC TTTCATTATC GACCTGCTTC TTCTTTGCTG TTCGGCTTTT 180 

ATGTCCTCCT GTGAGGTGGC TTATTTTTCA CTAAAGCCGA TCGATCTGCA GAACATCCGC 24 0 

GAACGGAATC ACTCTTCCGA CATCGCGCTT TCCAATTTAT TAGACAATTC GAATCAGCTA 30 O 

50 TTAGCTACTA TTCTGATCGG GAATAATGTG ATTAATGTAG CCATCGTTAT CCTTTCCAAT 360 

TATGCCATCG AGCAGACATT CGTTTTCTCT TCTCCGATCA TTGGATTTCT GAT CCAGACG 420 

ATACTCCTGA CCACTGTTCT TTTGCTGTTC GGAGAGATTC TGCCGAAAGT GTATGCGCGG 480 

AAGAATCCGC TGCAATACTC GCGCTTTTCT GCTGCAGCTA TGTCCGTTAT CTATAAGATA 540 

TTGTCACCGT TTTCAAAATT GCTGGTCAAA AGTACCGGCA TCGTTACCAG AGGTATCAGC 600 

55 AAGAAGAAAT ACGATATGTC CGTGGATGAG CTCTCGAAAG CGGTAGCCCT CACCACTACG 660 

GAGGGAGAGC CGGAGGAGAA AGAAATGATT AACGAAATCA TCAAATTCTA TAATAAGACA 720 

GCCTGCG AAA TCATGGTTCC GCGTATCGAT ATTGTGGATG TGGATCTGAG CTGGCCATTT 780 

CGTAAGATGC TTGACTTCGT TGTTTCGTCG GGTTATTCCA GACTTCCCGT TTCAGAGGGG 040 

TCAGAAGACA ATATCAAAGG GGTGATTTAC ATCAAAGATC TAATCCCACA CATGGATAAA 900 

60 GGCGATGAAT TCGACTGGCA TCCTCTGATT CGTAAAGCAT ATTTTGTCCC CGAAAACAAG 960 

CGCATAGATG ATTTGCTCGA GGAGTTCAGA GCCAATAAGG TGCATGTCTC CATCG TTGTG 1020 

GATGAGTTCG GTGGCACTTG CGGACTGATC ACAATGGAGG ACATATTGGA AGAGATCGTC 1080 

GGCGAGATTA CGGACGAGTA CGATGAGGAA GAACTCCCCT TTAAGGTTTT GGGGGATGGC 1140 

AGTTATCTTT TCGAAGGAAA AACGTCTCTC TCCGATGTTC GACACTATCT TGACCTTCCG 1200 

65 GAAAATGCTT TCGGTGAATT GGGGGACGAG G TAG AT AC*Kl TAAGTGGGCT CTTCTTGGAA 1260 

ATCAAGCAGG AACTCCCCCA TGTGGGCGAT ACAGCAGTGT ACGAGCCATT CCGCTTTCAA 1320 

GTGACCCAAA TGGACAAGCG CCGAATCATC GAAATCAAGA TTTTCCCTTT CGAGCGCACT 1380 

TGGGAGGTCG AA 1392 



(2) INFORMATION FOR SEQ ID HO: 90 



(I] SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 798 base pairs 
75 (B) TYPE: nucleic acid 
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(C) STRANDED! 1 ESS: double 
{ D ) TOPOLOGY: circular 

(111 MOLECULE TYPE: 1>NA (genomic) 

I ill] HYPOTHETICAL: HO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

I A) ORGANISM: PORYPHYROMOUAS GIHGIVALIS 



15 



(xx] FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .798 



Ui) SEQUENCE DESCRIPTION; SEQ ID NO: 90 



20 



25 



30 



ATAATT TATA 
TCCCCGATGT 
AAAACCATAC 
TTCACCTTCA 
TTAATTAATA 
GTACACACCG 
TCGCAATGGG 
TATCGGCTGG 
TTCGAAAAGT 
AGTATCAAAA 
ACAACTACGG 
GGAGCTATCG 
ATCATCTACG 
GGTATCGATA 



GATCAACAAT 
ATTCGCAAAT 
AAGGAACGAT 
AAAATACTGC 
AGCTTGAGTT 
AATACCGCTA 
CAGAAAGTAG 
TAAATAGTGA 
GGGAACAGCC 
GCCACCTGTC 
CTATTCACCA 
ACCTCAA AT A 
ATACTGCCCC 
TTTCGTTT 



GAAACTATTA 
GCTCTTCTCA 
ATTGCCCGTA 
CAATCTCAAT 
TTCTACCTAT 
TTTGTTGCAT 
AGGAATGAAA 
TAACTGTCTC 
AGCCACTAGC 
TATCAGTTTC 
GGGAAAGCCT 
CCATATCACA 
TATTCTACCT 



CTTTATCTCC 
GAGAATCTCA 
CTUGATTrCA 
CTGCTGATAA 
GGCAATAAAG 
CATGTTTTTG 
TATAAGGTTT 
ATGTTTGCAA 
CTCTTTGCAG 
AGACATCGGT 
GACAGTTATT 
CCTACGATAG 
GTGCGGAAAG 



TATTGGTCTT 
CAATGAATAT 
AAACCGAAAA 
AGCACGGTCA 
TAACCGTAAG 
AGGTTTATCC 
CTACGGGATT 
CATTGGGGGT 
GAACGTATGC 
TGGGTGAACA 
TTAAGAAGGC 
GAATACGCGG 
ATTACAACAC 



GTCGACTCTA 
AGACAGCACC 
GGAAAATGTG 
AGTAATCAAC 
TGGAGGATAT 
TTATGTCGAG 
ACAGTCGCGT 
ATTTTTCGAA 
ATACAGCCGA 
TTGGGAATTT 
ACGTTTTGGA 
GGCCTATCGG 
CGTTGATGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
79B 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(21 INFORMATION FOR SEQ in NO: 91 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2721 base pairs 
(Bl TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic} 

(ill} HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PH YROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY: miscjeature 
<B] LOCATION 1...2721 



(xlj SEQUENCE DESCRIPTION: SEQ ID MO: 91 



ACTATATGCG 
GGGGCGGAGC 
AAGGGGTTCG 
AGCCGATACA 
TCTTTCCTGT 
CACATATCCG 
AGCATCCGGC 
GGCTACTTCG 
GGTATGAAAA 
ATCGACATTT 
CGACCACTGG 
GCCAAGACGA 
CAGGGCAATA 
TCGACCATGG 
AAACGGGTGG 
ATCCTGGACA 
GCCAGTGTCA 
AAAGTCGGGC 



TTGCCTTCTT 
GGAAGACTCC 
ACTACAGACT 
TACACAATAA 
TCGTTCTGTC 
GTACGATCAA 
TGACAGGAGC 
TCATAGGCCT 
CCCATACCAT 
CTCTCGAATC 
TGAAGATGGA 
ACAACCTGCT 
TCCAGGTGAA 
TGAGCAGCAA 
AGGTCATCAC 
TCGTCACGGA 
GCAACAATCC 
TGACTACCAA 



ATCCGCTCCC 
CTCTCTCCTA 
ATTCACAAGC 
ACCAACAATC 
CATACTTTCT 
GGATGCCTCC 
AGATACCACA 
GCCGGCAGCT 
GCAGATTAGT 
CGAGGACAAA 
GATAGACCGC 
CGAAATGCTG 
AGGATCTTCC 
CCCGAAGGAG 
CGATCCGGGT 
AGAAGGTAAG 
CACAGCCAAC 
CTATAACTAC 



GTTGCGGCAT 
CCCCTTCTTT 
AATAAACTTA 
AT CCAAGCAA 
TTGATGGCAC 
TCCGGCGAAC 
CAGGTGTTCC 
CCCTCCTATC 
CGGGGAAATG 
CAACTCTCCA 
CTGTCCTATA 
CGCAACGTTC 
AACTTCAAAA 
GTCTTTCGCT 
GTAAAGTACG 
AAGCTGGAAG 
GGTAGTATCT 
TACGGTGGCA 



TGCGCGAGTC 
TCGAGCATTC 
AAGTGTTTGC 
TGAAACGAAT 
AGAACAATAC 
CAGTGCCCTA 
GACAAGT GAC 
ACCTGACAGC 
GACAGCACGA 
CCGTCACCGT 
ATATGAAAGA 
CTTTGGTAAC 
TCCACCTCAA 
CCATTCCTGC 
ATGCGGAAGG 
GATATTCAGG 
TTCTGACGGC 
AAAACAAGGG 



GCCGCCAATG 
TGATAAAGGA 
CACAGGTAAC 
CG1TTTATCA 
CCTCGATGTA 
TGCCACTGTA 
TGACGGCAAC 
TTCGTTCGTA 
CATCAAATCC 
ATCGGCAGCA 
TGACCCCGCA 
GGTGGATGGT 
TGGCAGGCCC 
CCATACGATC 
CACAAGTGCC 
TTCCATCACG 
AAAGTCGGGC 
CTCTCGCTAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
790 
640 
900 
960 
1020 
10B0 
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TTTACCGAAC 
TTTGCCCGAC 
ACGGTGGGCG 
AGCTTTGCCG 
GCCGGATCAT 
CTCACCGTTT 
TGGAAGCGCG 
TCCGATGCGG 
GCACAT TCTT 
TATGAGATAC 
CCGTCGAACG 
TATCGTAAGG 
GCACTCTTTC 
CAGCTCACGC 
CGAATCCAAC 
CAAGTACAGT 
TATAATCAAT 
AACGCCATCC 
TATGGCAATA 
GCCGTATGGG 
GCACTCGGCA 
CTGCCGAAGG 
CAGACGAAGT 
AAAAAATTGA 
AGCCGGACCA 
GTTTCCCTCA 
CGTACGATCG 
C AAGGAAAT C 



GTACTACATC 
ACTTCGGCAA 
GCAATGTACG 
GCAGCAACCT 
ACGAGCTCAA 
CCTACCGCTT 
ATCCGCTCAA 
G CAT GG ACG A 
TGGAAGCAGG 
GACCATCCGA 
GAAAGTTCCG 
ATCAGTATTC 
CCGAAAACGC 
TCGGCTATAC 
GTCCTGCAAT 
ATGGTAATCC 
ACGGAGCCAA 
AGAATTACAC 
TCGGACGAGA 
TCAGGATTAT 
TTGATGTCAA 
ATTGGACTGT 
ATGATGGCAA 
GAGTCTCGCT 
TCGGCAATGG 
GCCTCACCTA 
TCAACGACGA 
CTACCGGCAA 



CATGCTCCAA 
TGCCCTCCTC 
CCTTT GGGAG 
CATGT CCTAC 
TGCCGACTAT 
CACTCACAAT 
CACAGCTAAT 
ACATACGGCA 
GCTGAAGTAC 
AGATGCTCCG 
GCACGATCAA 
TTTGCAAACC 
AGCAGCAGAT 
CCCCTCGCCC 
CGGCCAACTG 
CGACCTAAAG 
GGTCATGCTT 
CTTCTCCGAC 
GCATTCTTTC 
GCTCAACGGA 
TTCATGGTCC 
GAATCTCTTC 
TGTATTCAAC 
GAGCGCAAAC 
ATTTACTATT 
CAGCTTCCGT 
CCTCAAGCAA 
T 



ACGATAGAAG 
TCATTCGAGA 
ATGACCACCG 
ATAGACAGAA 
CACCACAGCA 
CCTAATAATA 
ACGATCCAGT 
CAAGTGGACT 
ATCTATCGTC 
TGGCAGCCCG 
TACATCGGAG 
GGCCTCCGAG 
TTCTCCCACA 
ATGAAGCAGC 
AATCCCTACC 
TCGGAGAAGC 
ACAGCAT CGC 
CCGGCCAATC 
AGCTTGAATA 
AATA'i'CGATC 
GGCATGGTAT 
GGAGGTTATT 
AATATCGGTA 
AACATTCATG 
TATTCGGAAA 
AAGATGAATA 
ACCTCATCCC 



AAGGCAAAGG 
TAGATTCGCT 
ACCGGAACAG 
AACTCAAAAC 
CTCGCCTGCC 
GCGAGACCTT 
ACGCCGGCCA 
ATACACGTCC 
ATGCCACGAG 
GCTCTCTATA 
CAGCCTATGC 
TGGAAAGCAG 
ACTCGTTCGA 
TTAAGCTGGC 
GGCTACAGAC 
GTCACCACGT 
TCGACTACGA 
CCAATCTGTT 
CCTATGCCAT 
GCACATTCCA 
AerCAGGCCT 
AT CAT GGG GG 
TAGCCAAACA 
CGAAGTATTC 
ATGCCGGTAT 
CACAAGTGCG 
AAGGACAGCA 



GCAAGAAACC 
CAATCTCTTT 
CGTAGAAAAA 
ACAGATGGAT 
GGGCGAATTG 
CATTGACCAA 
GCACTCCAAA 
CTTAGGACAA 
CGATCCTCTC 
TGCACAGAAT 
CGGCTACAAC 
CAGGCTGAAA 
CTGGGTGCCA 
CTATAACTTC 
CAACGATTAT 
CGGTCTCTCC 
CTTCTGCAAC 
CCACCAGACC 
GTACACGCCG 
AAAGAGCGAA 
GATGTTCACC 
AAGAAGCTAC 
GCTTTTCGAC 
GACATGGAAG 
ACAACGGAGT 
CAAGGTAGAG 
GGGTGGCGGA 



(2) INFORMATION FOR SEQ ID NO: 92 

ID SEQUENCE CHARACTERISTICS: 

(A) LENGTH ; 1350 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS: double 
fD) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

|lv) AHTI -SENSE: NO 

tvi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME 'KEY: miscf eaturs 

(B) LOCATION 1. . .1350 

(Xi) SEQUENCE DESCRIPTION: SEQ ID HO: 92 



TGGTGCCAAT 
CAACAAGGGC 
TGCCTTGGGC 
TTGATTGTGA 
CG ACT GGAGA 
TTCTTTCG GA 
CGGGTTTCCG 
GGAGTATCTA 
GATTTACCTT 
CAGGCCATGC 
ACGGCTCACG 
GAAGCCGGTG 
TGGATGAGCA 
ACTACTTCGA 
ATTGCCACGC 
GTCAATGGAG 
ATTTGGGGAT 
AGAGGGTGGG 
GACTATTGGT 
ACGTATGACA 
TATTCGTGGC 
CCTTCGGATC 



CCGACCCAAT 
GATCAGACCA 
GGTATGAAGG 
TGCTGCCTGC 
CGGACAGCAT 
ACAATGAATA 
CTTTTGCCTC 
CCCTGAACTA 
ATTGGACGGA 
TGAAGCCGAC 
GACTGATCGA 
TGCAATTTCG 
TGATTTTCAA 
AATT GTTATC 
ATCGCGGCGG 
CTGTCGGACT 
CTGCTTATGG 
GCGGTTATCT 
ACGGCAGGCA 
AACAGCCTCT 
GGATGGCACG 
GTTT TGCG AT 



ACCT CAT TGC 
AGACCGTTTA 
ATTTGGTGAA 
TGTGCTTAGC 
TCGTCCGAAC 
CAATGCACGT 
TTACTCGCTG 
CTGGGGGGCA 
CTATAACGAC 
GGCCACGACT 
ACCGATCTAC 
GGGTGATTGG 
AAATGACAAT 
GGGTGAAGGC 
GG AATACAAC 
TAAGCTTTCG 
TGTGGCAGCC 
TT CTCTCGG A 
TTACGTTTCT 
TACGAACGGT 
AAGTGTTTCG 
GAGCCACGCC 



CATGCAGTAT 
CCTCCCGTTC 
AGGATGAGAC 
GGGCAGCATT 
GAACTCTCGG 
TCGGTCAAAG 
CCGGCAGCAC 
AGTCGCTATC 
TATGTACGCT 
GCTCTCATGC 
AATCCTGAGT 
ACACGTTTCC 
CATCAGGAGT 
AAATGGCGAC 
TGGGCGCAGC 
TATCGCCCTC 
TTGTCAAG CG 
ATGGACTTGG 
CCCTTTGCTG 
TGGGGCGATT 
TTGGCGGCTG 
TTGGAACTGA 



ATCGAAACGC 
GAGGCTACCG 
TCTCTGCCAT 
ATTATTCCAT 
CATCGATCCG 
GTTATACGTT 
ATGGTGTGAA 
CGGCCGGTAT 
TGCGTATCCT 
TGGGCAATAT 
TGGATTTGAC 
GAATGGATGT 
CGTTTGTCTT 
TCGAACTGCC 
AGGATACCGT 
GTACCGACAA 
GAGGATACTT 
AGCACTTCGC 
CACCTTTCGC 
ATATTCGTCT 
TTGCTCGGGT 
CGATGCGTAT 



TCAAGGACAT 
GTATGCTCGG 
TCTTATCGCT 
GGCGGGAGAG 
AAGTGCGCTT 
GCCGGGTGCA 
GCTTTCGCTC 
CGCTTATTCC 
GCCTTATGTA 
AGCCGGTGCT 
GGCTGATCCT 
TTGGGT CAAT 
TGGCTTGTCC 
CTTGCAGGCT 
GCATACATGG 
ACCCATGCAG 
CCCTTACGAA 
TTTTCGTACC 
CAATTCCCTG 
CTATGCCGAC 
ATGGTTCCAG 
CGATCCCAAA 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2721 



60 
120 
180 
240 
300 
360 
420 
460 
540 
60O 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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(2 1 I N FORMAT I ON FOR SEQ ID WO: 93 

UJ SEQUENCE CHARACTERISTICS: 

IA) LENGTH: 1341 base pairs 
(B) TYPE: nucleic acid 
<C) STRAMDF.DNESS : double 
(D) TOPOLOGY: circular 

111) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE : NO 

(Vi» ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOHAS GINGIVALIS 
fix) FEATURE: 

(A) NAME/ KEY; misc feature 

IB J LOCATION 1. . .1341 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 



CCGTTTTCAC 
TCGACAGTCT 
GACTCGTCGC 
ATACCCGACC 
ATGCCGGCCA 
GAACGTGTAT 
TCGTTCTACG 
GAGCTGCGGA 
CGTCTCAACC 
ATCGCAGGTG 
GCGCGTTTCG 
CTTGTGGATT 
GAACAGCAGC 
CACGTGACAC 
AACGGTACCC 
GCCCAAGAGC 
ATCAACGGAT 
ACTGCTCCCT 
AAGTGGAACT 
GGCTATGCGC 
GGAGGGGTGG 
CGGCATTACC 
ATCTATCAGA 



CGCCATGGGT 
CAAGTACCGC 
CGGCATATTC 
AACGTATGAA 
TCGCCCAACA 
TCGGTCTGGA 
ACCAGCCGCT 
TGTCTGCTCA 
GTGCCGCCGA 
TGGACrGGCA 
GAGGGATAGA 
ACATGACCTG 
TGCAGCTACA 
CCGATGTCGC 
TCCTCGGAGG 
GTAATATGTG 
ACCTCGACCT 
ACGGCAAGGG 
TCCGCATTTT 
CTGCCCAATA 
AATATTACCC 
GGTACAGTGC 
TACCTTTCTT 



ATGTGCTGGA 
CGCCTCATCA 
GCCCATTGGC 
CAAATCGCl'A 
GACAGGACCG 
GCAGAAGCAG 
TGTCGATGAC 
TGGCGAACTG 
CGGCACTTCG 
CCCGAACGAC 
ATACGACATG 
CTATACTTCG 
GGTACTCAAT 
TACCGCCACG 
AGCACTGCAT 
GTACT TCACT 
CACCTACTCG 
CAAGACCCTC 
CGATCAGGTC 
CGGCGAGAGC 
TACGGAGACC 
GACCGAGACG 
A 



GCGGCTTCCT 
CACAGGCGAT 
TCTTTGTCCT 
TTATCATTGG 
GCCGAACGCA 
AAAAAGCTGA 
GAATCCCATA 
AACCGCCACC 
TTTGCCGACA 
AAGGTGTCTT 
AACCCCGTAG 
GGCGTGAACT 
GCTTACAACA 
AGCTACCCGC 
ATGCGTTACG 
GCGGGCAACC 
ATCGAGGGAT 
ACGGACGTCA 
AATCT CTTCC 
AGCCACACGC 
AACTTCCGTC 
GAAAGCACCA 



CAGTACGCAC 
CGGAGCACAT 
CATCTCACTC 
CATCCCTCA'i' 
GCGGCGAGCC 
AG GT GTACT T 
TCGGACACTT 
TCAGCTT CGA 
ATCTCTCCAA 
TCTTCTTCGG 
AGATCTACCA 
TCGCATGGAA 
ACCGCTTCGC 
TCCTCTACTC 
CCGTGTCGAT 
TGTTCAAT CC 
TGGACGACAA 
AGTACTATGC 
TCAAAGGCAT 
GCCACTCCTA 
TGTTCGTCAC 
ATGCTCTTCG 



ACGGCCATGC 

GCGATCGGCG 

AT TTGATTCG 

CCTGTGCGGT 

TTCTCTGGCC 

AGGCATACAG 

CAAGGTACAG 

CTGGCGACAA 

TGCCATCGAC 

ACGT CAGTAC 

GTACAGCGAC 

CTTCCACCCC 

CGACCGCTAC 

GGCACAGTGG 

GGCTCATCAG 

GGGCAAACGG 

AGGCATTATG 

TCTGGTATCG 

GTACGAGAAC 

CGGCTATATG 

CTACATAGGA 

CCCCGGTCTG 



(2) INFORMATION FOR SEQ ID NO: 94 

tl} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 base pairs 
<B) TYPE: nucleic acid 
(C) STRAt IDEDNESS : double 
(DJ TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

<iv> ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: POR Y PHY ROMONAS GINGIVALIS 
(ix) FEATURE : 

(A) NAME/ KEY : misc feature 

(R> LOCATION 1. . .681 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 
TATAAGCAAG CTATAATGAA ACATTTGTTT AAGTCGACAT TAGTACTTCT TTGTGCTCTT 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
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720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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1320 
1341 
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TCTTTTTCCG 
TTTTCT GTT A 
TATACATCTG 
AAGAAAGAAA 
GTCGATATAT 
AATAGAATAG 
GTCGGAATGG 
GATAAGCAGC 
CACATTAATA 
AAAAGGATCA 
ATTGGCATAG 



GTACCTATAC 
GGCTGGGACA 
CTAACGATAG 
ATGCAACCGG 
CCGGCAAAGA 
CCATTCCTAT 
CTTGGATGAG 
GOAAGTCGAG 
AGACCGTTTA 
ATGACTACCA 
GATTAAACCT 



CTTTGCACAA 
GGGATATATT 
ACTTTTGTCT 
CGTAGCATTT 
AAATACCTTG 
ACGCCCTATC 
TCGTCATGAG 
GTCGGGACTG 
CTTTATGGCA 
GCAAAAGGAT 
C 



GAAAATAATA 
GCAGGTTCAA 
GGTGCAATTT 
CGTTTCTTAT 
AATTATGCGT 
AAAAATTTTA 
CAAATTTACA 
GATTTTGGTC 
GGAACCGATC 
CGAACCTTCA 



CAGAAAAGTC 
CTACCAACCT 
AT CTGGGCTT 
CNCCCTCTCC 
•rTTACGTTGT 
ATTTCATCTT 
ATTCTACTTC 
TCGGGATGCA 
TTACGTCTTG 
TTGCACTTAT 



ACGATTTGAT 
GATGTATGGG 
GACACCAAG't* 
GGGTTATTAT 
CGGAGCATAT 
CTCTACAGAA 
GCAGACTTGG 
TCTGCAATHC 
CATGTTCGGA 
CGACAACAGT 



120 
180 
240 
300 
360 
420 
480 
540 
60C 
660 
681 
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(2) III FORMAT I OH FOR SEQ ID HO: 95 

(i) SEQUENCE CHARACTERISTICS: 

tA) LENGTH: 1218 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

Ui> MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAN I SH: PORY PHY ROMOt J AS GINGTVALIS 

t ix ) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1 . . .1218 

(xl) SEQUENCE DESCRI PTIQH ; SF.Q 10 NO: 95 



ATATTCATAG 
ATGCTGAAAA 
AAGGCTCAGG 
GATGCACGTG 
CAGTATTGGA 
TATACACCCT 
TACAAAT TGG 
GGAAAGTTGG 
TTTGCTGTCG 
CTGCGTTACA 
GCCTTTGCGG 
AGCTTGTGGT 
GGTGTCACGA 
ATCGATGACT 
CCTCCTATCA 
GAAACTTCTT 
GAAGAATTCC 
TTTGTTCGTG 
GCCGGTGCCG 
ATCCAGAGTA 
TTGCGCAATT 



ACCCCGATAA 
ATAAATTGGC 
AGCAACTGAA 
CGGCCGGTAT 
ATCCGAGTAA 
GGCTGTCCAA 
GAACAGACGA 
AGACTTTCGA 
ATTTGGGCTA 
TCCGCTCAGA 
CGGATATAGC 
CGTTGGGTTT 
GTTTTTTCAT 
ATAACAGCAT 
TGGATCAAAA 
CGATCAGCGG 
GTGAGATTAC 
C OS GAT ATT C 
GCTTCAAAAT 
ATCCGTTGGA 
TGTTCCAC 



GAATACAAAA 
CCCCTTGCCC 
TGTGGTACAC 
GGGGGATATA 
ATATGCTTTC 
GCTGGTCAAT 
GAATCAGGCT 
CGAATTGGGC 
TAGCCGCCAG 
CCAAAGCACT 
CGGTTATTTG 
CAACGTAAAG 
CCCTACTTCG 
CAATTT CAAC 
CGATCAGGCC 
TATATTCTCT 
ATGGGGACTT 
ATACCTGCAC 
GAACATATTC 
TCAGACTCTG 



CAAAACGAAC 
ATACTGTTCC 
ACCTCTGTGC 
GGTGTGGCAA 
ATGGATACGA 
GATATTGCCC 
ATTAGTGCTT 
GAATCCATGG 
TTGTCGGAGA 
CACAACACCG 
CAGAAGTATG 
AATATCGGAA 
TTGAATCTCG 
CTTGAACTTA 
GGGTATGAGG 
TCTTTCGGTG 
GGGGCTGAAT 
CCCACCAAAG 
CGTATCGATG 
CGGTTTACGC 



GAAATATGAT 
TTTTTGCTCC 
CAT CGCTGAA 
CGACGCCGGA 
AAGCCGGTAT 
TGATGCAGAT 
CTCTGCGTTA 
GAGAGGCCCA 
ACTTCTCCAT 
GAGAGAATCA 
TGCTACTGGG 
CGAAGATCTC 
GGACGGGGCT 
GCAAGCTGCT 
CTGCACTCAA 
ATGCGCCGGG 
ATAGCTATGA 
GCAATTTGCA 
CTTCCTACCT 
TTGCTTTCGA 



TATCAAGAAA 
AAAGGCTATG 
TATCAGTCCG 
TGCGT ATTCA 
TAGCTT CTCA 
GACCGGTTTC 
TTTCACATTA 
TCCCAATGAA 
GGCTGTTGCA 
GGCCGGAAAT 
TAATGCGGAG 
CTATGACGGA 
GTTGTATCCG 
TGTACCCACT 
GAAATATCAG 
AGGACTCAAG 
CGATAAATTT 
GTACTTCACG 
GTTGTCTACG 
TATGGATGGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1218 



60 {2 J INFORMATION FOR SEQ ID NO: 96 

<i) SFQUCNCE CHARACTERISTICS: 

(A) LENGTH: 1356 base pairs 

(B) TYPE: nucleic acid 
65 (C) STRANDEDNESS; double 

(DJ TOPOLOGY: circular 

lii] MOLECULE TYPE: DNA (genomic) 
70 (lii) HYPOTHETICAL: HO 

(iv) ANTI-SENSE: NO 



75 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHY ROM ON AS GINGIVALIS 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



Hi S J n;;;: 35 ft- *■ O o rd! B U 
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(ix> FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1...1356 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 96 

CTAATAATCG AAAAGGAAAT GAAAACAACA GTTCAACAAA TTATTCTGTG CCTGGCTTTA 60 

ATGATGTCAG GTCTATTGGG CGGAAACGCA CAGACCTTTT GGGAAGAAAT AGCTCCTCCT 120 

10 TTTATCAGTA ATGAGCCTAA CGTCAAGTAT ATAATTCCCA ATATGGGGAT TGATTCAAAG 180 

GGAACAATCT ATGTAACCGT GACAAAAAGG ATTCAGCAGG GASCAAATTA TACTTCT G AG 24 0 

CAATTGGCTA TGTACTATCG ACCATTAGGT GATAATGAAC AGTGGTGGAA ACATGATCCG 3O0 

TATTTTGATG ACAAGATAGT TGCGGATATT CAGACAGATG CATATGGCAG AGTTTATGTA 360 

TGTACGACTT CTTCTCGAGA TCAAGAGTAT CAACTTTATA TAAACGAGCA GAACGAATGG 4 20 

15 AGGTGTATAT TCAAAACTTC TGTGTCTACA TATGAGCATG GTATGGCTGT TTTTCGCTCT 4 80 

TCGACAGGGG TGACTTATAT AGGTACCAGG CATCACATCT TCGCATCAGG TGTAAATGAT 54 0 

TTCGAGTTCA ACACTATCTA TGAAGACTCT ACACCTATGA GCTGTCGCTT TGCAGAGGCT 600 

ACGAATAGTG GCACCATOTA TCTGGCATTG ATGCATGAAA CCACAATGTC TACGACTATC 660 

CTTACTTATC AAAACGGTGA CTTCGTCGAT ATCTCGGAAA GTGAATTGAG TAACTCGATT 720 

20 ATTGCATCCA TGTGCTCTAA TAAAGAAGGT GATATAATAG CTCTTGTTAC TTCATATACA 7Q0 

GGATTTATGA GTGGAACCCT TGCGATCAGA AAAGCAGATG AAGGCAAATG GCAACTTGTT 84 0 

GGCGGAGATA TACAGAATGC GATCGTTCAA AATATATGCA TGATGGACGA CAACAAGATT 900 

GCTTGTGAAG TCTTCCGGAC TCCTAACGGA GTAGATGGTC GGACAAGGGT TTGTGTTTCT 960 

GACGCATCTG TCTTTGATTT TGAGTGGTAT GAAGATGAAA TATACGGAGG CCTGATATTT 1020 

25 GACACTTTCT TCTATAGCCC TTGGGACAAA CTTCT'lTATG CGAAATTTGG TGGGATTATG 1080 

CTCAGGAGTA AAGAGTCTTV TATAACCTCT TTCATTTCTC CGACAGTTGT ACAAGGAGTG 114 0 

GATGTCTATA CTTTGCCCGG GAAGATAAGG ATCGAAAGTG AAACTCCGGT GTCTGAGGTG 1200 

TTGCTTTTCG ACCTGGCTGG CAGGATGGTA CTTCGGCAAA CCATTGATAA TAAAATCTAT 1260 

TCGGACATAG ATACTAACGG ACTAAAGCGA AGCGGTATTT ACGTAGTCTC GGTGCGGCTC 1320 

30 TCTTCCGGAC AGGTATTCAG TCATAAGGTG CAGGTA 1356 

(2» INFORMATION FOR SEQ ID NO: 97 

35 (i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 993 base pairs 
(B) TYPE; nucleic acid 
tC) STRAW DEDMESS: double 
{ 0) TOPOLOGY: circular 

40 

(It] MOLECULE TYPE: DNA (genomic) 

(ili] HYPOTHF.TICAL : NO 

45 (lv) ANTI-SEHSE: NO 

(vi) ORIGINAL SOURCC: 

<A) ORGAWTSM: PORY P H Y RCMOl IAS GIWGIVALIS 

50 (lx) FEATURE: 

(A) NAME/ KEY*, mi sc_ feature 
(B( LOCATION I. . .993 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 



GGCCTGTACG GTATGTCTGT TGTACCTATA ATAATTTATC TTTGCGG TAT ATCAAATTAT 60 

GCAAGACTCA TGATAATCCG GTGTCTTATC CGTCGTCCGA GAACCGTCCT GTTCGGGTTG 120 

ATATTCGTGG TAGGTCTTTT CTCTGCGATG GCGCAAGAGA AAAAGGATAG TCTCTCTACG 180 

GTTCAGCCAG TGCCGAATAG CAGCATGGTG GAGCAGACCC CTCTTCTCTC CATTGATCAC 24 0 

60 CCCGTCCTGC CCGCTTCTTT TCAGAATACC CC.TACACTGA AAAGGTTTAG AGACAAACAT 300 

CTTTCCGATG CTTTGCTCAA TGGATTGAAG CCTCATCGCT CAT CTTTGCA ATTGAATGAG 360 

GAACTCAACT TCGCGGCAGA GCGTCGGGAT TTCGTTTCTC CCCTCTTGCA AACTCGCCAC 4 20 

GCTGCCGGTG TCCTTTCATG GCGACCGACC GATAGGATGC ATTTTTATAC ATCGGGCAAT 4 80 

ATCGGTCTTG GCCATGATTT ATTGACCGGT GTGCGCAAGG ACTTCGGATti GAATGCTGGT 54 0 

65 CCCGACTTCT TGCTGAGTCA AAATCTTACG GCACATGTCC AAGGCGGTTG GCAGCAGAAT 600 

TTCGGCTTTA TACCTATGAC GGCTGTCAAT GGCCAACTGC GTTGGCAAGC CACCGAGAGA 660 

TTGAGTTTTA CCACCGGTAT CGATTATCGA CAGGTACAGT GG AAT GCTTT CGATAATAGA 720 

ACGTTCTCGC TTAAAGGAAG TGCTCGATAC GAAGTGATGG ACAATGTCTT TGTCAATGGA 780 

TTTGGCAGCT ATCCTCTCTA CAGCAG TACG CGCTCAGGAC TCAATATGGC TCSTTCCGATG 84 0 

70 CATGGATTCG GCCCTCAGTA CGGTGGATCG CTTGAGCTGA AAGTCTCCGA GCGATTCGGC 900 

TTTGCCGTCG GTATGGAGCG CGAATACAAT ATCTGGACTC GTCGGTGGGA AACGCATTAC 950 

TTTGCTTATC CTGTAT TCTA TGGCGATAAG AAG 993 

75 12) INFORMATION FOR SEQ ID NO: 98 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 
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U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 987 base pairs 

( B) TYPE : nucleic acid 

5 <C> STRANDEDNF.SS: double 

(D) TOPOLOGY: circular 

{ii} MOLECULE TYPE: DNA (qenomic) 

10 (ill) HYPOTHETICAL: WO 

(iv) ANTI-SENSE: NO 

(vl) ORIGINAL SOURCE: 
15 (A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

Ux) FEATURE: 

(A) NAME/ KEY : mtscf eature 
4B> LOCATION 1 . - . 987 



20 



55 



70 



(x1 ) SEQUENCE DESCRIPTION: SEQ ID NO: 98 



GAGACGAACT CTTGGGTATC CAGCGATTGC AATTCGACGA CGATGAAAAC GAATAGACGA 60 

TACCCATTTG TTTTGCCGCT TCTGCTACTC ACCGGATTGT TGGCATGGGG GCAGGATTCT 120 

25 TCCCACGGTA GCAATACAGC CTTTGCAACT GATTCTTCGA GTAGAGAGTT GCCCACGGAG 180 

CAGTCCGCCT ACCGCATTCA TTCTGCCTAT ATGGTCGGTG GTGGCGGAAG CATAACGCGC 2-5 0 

UACACCTATT TGTCACCCCT TCGTTATGGA GGATGGACAC TGAATTTGTT GGGAGAGAAG 300 

ACGTTCCCTC TCAAAGCCTC CUATTCCCGT TGGATGATCC GTACCGGGCA TGAGCTGGAT 360 

TTTGCCCTGA TGGACAATCC GGCCAATAAT GCTCATTTCT ATTCCCTGCT GTATAACGGT 420 

30 TCCGCTGCGG CTCTTTACCG CCTTGGCGCT AAGCATCTGC GAGCCGCGTG GAT GGACAAT 4 80 

CTGCGCTTGG CATTCGGCCC GGGCTTGGAA ATCGGGCTTG GAGGAATTTA TAGTACACGC 54 0 

AACGGCAATA ATCCTGCGAC ATTGAAGCTC TACACCAATG CCATCGCCCA AGCCTCGATA 600 

GGATACTACG TCCCCTCCGA AACTTTTCCC CTGTATTTTC GGTTGCTCTC CCAGATCAAT 660 

CTCTTCGGTA TAGCCTATGG AAATGGTTTT GGTGAGAGCT ATTACGAGAA TTTTTTGCTC 720 

35 AATAACGGCA TTGCAGGCTC CCTGCATTTC ACTTATCCGG GCAAGTTTAC TCGGTTCACG 780 

ACACT CAT AA CGGCGGATAT TCCCATTCGG AACTTCTGTA CGCTTCGTGT CGGTTATCGC 84 0 

TATTCCCATT TGGGCTCTTC GCTTAACGCA TTGGATACTC GAATCCACAG TCATACGGCT 900 

TTTAT CGGTT TCGTCACGGA GTTTTACCGA TTCCGTGGGC GCAAAGCCAT GAATACCGGT 960 

^ CGGAGAACCA GTCTTTACTA TCATGAT 987 

(2) INFORMATION FOR SEQ ID NO: 9 9 

(1) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 957 base pairs 

(BJ TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

50 (il) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A> ORGANISM: PORY PHYROMONAS GINGIVALIS 



( Lx ) FEATURE : 
60 (A) NAME/ KEY : miscteature 

( B) LOCATION 1 . . . 957 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99 

65 TTTACGTCCG GTACGATATT CGTCCGTATA GCCATATCCG GCCGTGTAGT GTGCCGTGAG 60 

GTTCAGGATA ACGGAAGGAG AGAAGCTGTG CGTCATGATG GCATGGTAGT GACGCTGCTC 120 

GTAATTGTCG CTATTGTGGT AGTAGTGCGG TACTCCTTGC GCGTCCACGT ACATAAGACC 180 

GGCACTGTTG TATCGGCGGC CATATTTGGC TTCATCCTCC TTGGAAAGAC CGTTCCATGC 24 0 

GATACCCGTA ACTTCTTTTC CTCCGAAAGT GATGAACCTG AGAGCCGTGT TGCTACCGAA 300 

ATAGCCCACC TGTGCGAAAT AGGATTTCAG ATCCACGCTT CCTCTATCCA CGTAGCCGTC 360 

CGAACCGATT TTGGACAGGC GGGCATCCAC TGCCCAATGG CGACCGATGC GTCCGCTACC 4 20 

GAGTTTGACC GATCGGCGGA ATGTGCCGAA CGAACCTCCG CTCAAATCGA CACGGCCATA 4 90 

AGGAGCCAGT CCCAAATTAT CCGTACGCAT ATTGACACTT GCCCCAAAAG CTCCGGCACC 540 

ATTGGTGGAA GTACCCACAC CTCGCTGCAC CTGAAGGTCT TCGATGGAAG AGGCGAACTC 600 

/t> G6GCATATTC ACCCAAAAGA CGGACTGAGA TTCGGAGTCG TTGAGGGGTA CTCCATTGGT 660 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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AGTTATGTTG ATGCGATTGG CATCGGTGCC ACGCACGCGA AAGCCGGAAT ATCCGATACC 7?0 

CGTACCGGCA TCGCTGGTGG CTACCACGGA GGGAGTCAGC ATCAGCAGAT AGGGGATGTC 780 

ACGACCATAA TTGGACTTGG AAAGTTCGGC CTTGCGAACG TTGGTGTAAG CGACAGGGGT 84 0 

TTTCGCCGTG GCGCGAGTAG CTACGACCTG TACG&T CI' G G AGCTGCACAT TGCTAAGACT 90 O 

ATCTATCTCG CTGTTGGAGA CGGGTGCTTG TGCCGTCAGG CAGAAAGGCA GGACGGC 957 



10 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 100 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1842 base pans 
(E) TYPE: nucleic acid 
(C) STRAMDEDNESS: double 
(D> TOPOLOGY: circular 

(ii) MOLECULE TYPE : DfJA (genomic] 

(ill) HYPOTHETICAL; HO 

tiv) A1ITI -SENSE: NO 



25 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVAL IS 

llx) ir*EATURE: 

(AJ NAME/ KEY: misc feature 

( B ] LOCATION 1...184 2 



30 



35 



40 



45 



50 



55 



60 



(Xl) SEQUENCE DESCRIPTION: SEQ ID HO: 100 



CCATATAATG 
AACATTCAGC 
GCCGTGAGTC 
AAGGAAGTGG 
GGGTACAAAT 
GTGGGAGACG 
TTGTCTCACC 
AAAGACGAAA 
GTATTCGGTA 
TACGATGCAT 
AAGAAT G TAG 
GAAATCATCG 
TCTTACGCAG 
TCATGGGGTC 
GTTATCCTCG 
CCTCATCCTT 
GTTGTGGTGA 
ATCCACGTAA 
GTAGGCGACG 
CTCGGTCTGA 
GGCTCTCGTC 
GAAGAGGGCG 
CACCCCAGCG 
AACGAAAACC 
TTCGAGACGG 
AAGGGTGCTG 
GTGAAGGAAG 
TTCAATAAGG 
AAAATGGCTC 
GAAGCTGCTG 
CTCGGCGAGC 



TCCAATCTAT 
CCAGAGAGGA 
GTCAGGAGCA 
TAATGGGTAG 
CGGAAGGTGT 
AAGTGGAAGT 
GCAAGGGTCG 
TCGTAAAGGG 
TCGAGGCTTT 
TCGTTGAGAA 
TTGTTTCCCA 
GCAAGCTCGA 
TATTTATCGA 
GTGTGGCTCA 
ACTTTGATGA 
GGGATGCTCT 
TGGCAGATTA 
GCGAAATGTC 
AAGTGGAAGC 
AGCAACTCAA 
ACCATGCTCG 
TAGATGGCCT 
AGTTTACGGA 
GTCGTCTCAG 
TATTCACTGT 
TCGTTTCTCT 
ATGGCTCACA 
ATGCCAAG CCi 
AGCGTGAAGC 
CCGAAGCTGC 
TGGCCGCTTT 



TAGCAACAAG 
TTTCAACTGG 
GGAAGCTGCT 
GGTAACTGCT 
GGTACCTGCA 
TTATATCGAG 
TGCCGCTCGC 
CTATGTGAAG 
CCTCCCGGGA 
G AC GAT GG AG 
CAAGGTGCTC 
AAAAGGGCAG 
CCTCGGTGGA 
TCCGGAAGAA 
AGATCGCAAG 
CGACAGCGAG 
CGGTGCTTTC 
ATGGACACAG 
CGTGATCCTG 
CCCGGATCCT 
TGTTCGCAAC 
TATCCATATT 
AGTAGGTGCT 
CTTGGGTCAC 
AGGATCTATC 
GCCTTACGGT 
GGCT GTACTC 
AATCATTGTA 
CAATGCAGAG 
CAATCCTGCA 
GAAAGAAAAG 



ACGATTAAAA 
GAAGAGTTTG 
TATGACAAAA 
ATCAACAAGC 
ACAGAATTCC 
AATCAGGAAG 
TCTTGGGAGC 
TCTCGTACCA 
TCACAGATCG 
TTCAAGATTG 
ATCGAAGCAG 
GTACTCGAAG 
GTGGATGGTC 
ATCGTACAGC 
CGTATCGCTC 
CTTAAGGTAG 
GTTGAGATTG 
CACTTGCGTT 
ACGCTCGACC 
TGGGCTGATA 
TTCACCAATT 
TCCGACCTTT 
GATATCGAAG 
AAACAGTTGG 
CACGAAGGAA 
GTGGAAGGTT 
GAAGAGAAGT 
TCTCATAGCC 
CGTAAGGCTG 
CAGGCTGTAG 
CTTTCAGAAA 



AACAAATGGA 
AGGCCGGTGG 
CGCTCAATAC 
GTGAAGTGGT 
GCTACAATCC 
ATAAGAAGGG 
GCGTGAACGA 
AGGGTGGTAT 
ACGTGCGCCC 
TGAAAATCAA 
AGCTCGAACA 
GTATCGTCAA 
TTATCCATAT 
TGGATCAGAA 
TCGGACTCAA 
GCGATAAGGT 
CACAGGGCGT 
CTGCTCAGGA 
GCGAAGAACG 
TCGAAACTCG 
TCGGTGTATT 
CTTGGACGAA 
TTCAGGTAAT 
AAGAGAATCC 
CGGTAATCGA 
TTGCCACTCC 
TACCTTTCAA 
GTGTATTCGA 
AAGCCAAAGC 
AGAAAGCCAC 
AC 



AAACTTAAAG 
CGTCCATGCT 
CATCAAGGAA 
TATCAATGTA 
CGAACTCAAA 
CCAGCTCGTC 
GGCTCTCGAA 
GATCGTCGAT 
CATTCGCGAC 
TCAAGAATAT 
ACAGAAGAAA 
GAATATTACT 
CACTGACCTT 
GATCAATGTC 
ACAGCT r. AT G 
GAAGGGTAAA 
TGAGGGTCTT 
CTTCCTGCAT 
CAAAATGTCG 
TTTCCCTGTA 
CGTTGAGATC 
GAAGATCAAA 
CGAGATCGAC 
TTGGGATGTA 
AGTGATGGAC 
GAAGCACATG 
GGTTATTGAG 
AGATGAGCAG 
GGCTCAGAAA 
TCTCGGAGAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
7 20 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1600 
1740 
1800 
184 2 



65 



70 



<2) IHFORJIATION TOR SEQ ID NO: 101 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 729 baae pales 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



75 



(ii) 1-tOLECULE TYPE: DMA (genomic) 
(ill) HYPOTHETICAL: NO 



SUBSTITUTE SHEET (Rule 26) (RO/AV) 
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liv) Aim -sense: no 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPKYROMOHAS GINGIVAJ.IS 

5 

l ix } FEATURE : 

(A) NAME/ KEY : misc_featuce 

(B) LOCATION 1 . . .72 9 

10 (xi> SEQUENCE DESCRIPTION : SEQ ID HO: 101 

TCGACAATAA TGAAAAAAGC TATTCTTTCC GGACCGGCCT TGCTCCTCGG CCTATGTGCC 60 

AACGCACAAA ACGTGGAGTT GCACTACGAT TTCGGTCATT CCATCTACGA CGAACTAGAT 120 

GGACGTCCCA AACTGACTAC CACAGTGGAA AACTTCACAC CCGACAAATG GGGAAGCACC 1B0 

15 TTCTTCTTCA TCGACATGGA TTACACGGGC AAGGGTATCC AGTCGGCCTA TTGGGAGATT 24 0 

TCGCGCGAAC TGAAGTTTTG GCAAGCTCCC GTTTCCATTC ATTTGGAGTA CAACGGAGGC 300 

CTCTCCACAA GOTTTACTTT CGGACACGAT GCTCTAATCG GTGCCACCTA CACGTACAAC 360 

AACCCCTCCT TTACACGTGG ATTTACGATC ACGCCCATGT ACAAGCATCT GCGTGCGCAC 420 

GACTTCCACA CCTATCAGAT CACCGGCACT TGGTACATGC ACTTTCTGGA CGGTCTGCTT 4 80 

20 ACCTTCAACG GCTTCCTCGA TCTTTGGGGT TTCCCCCAAG AGAACCCAAT CGGGGGCCCT 5 -JO 

GTGCTCAAAG AAGGGGATAA GTTCGTATTC CTGTCCGAAC CGCAGTTCTG GATCAACCTC 600 

AATCGCATCA AAGGCATCGA CAAGGATTTC AATCTCAGCA TAGGGACAGA GATGGAAATC 660 

AGCAGGAACT TCGCTCGCAT GGACAAATTC TCCTGCATCC CTACTCTTGC GGTCAAATGG 720 

ACTTTCAAC 729 



25 



40 



(2) INFORMATION FOR SEQ ID NO: 102 



(i> SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 705 base pairs 

(Bl TYPE: nucleic acid 

(C) STRAW DEDNESS: double 

(D) TOPOLOGY: circular 

35 {ii> MOLECULE TYPE: DMA (genomic} 

(ill) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



{Vi} ORIGINAL SOURCE: 

(A) ORGANISM: FORYPHYROMONAS GINGTVALIS 



(ix) FEATURE: 
45 (A) NAME/ KEY: miscf eature 

(B) LOCATION 1. . .705 

Ixil SEQUENCE DESCRIPTION: SEQ ID NO: 102 

50 ATGAAAACAA TTAGTAAGAA CCATGCGGCA CGGATCTGTG CCGCCATTGC TTTGTTTGCA 60 

GTGTGTAACG GCCGGATAGC TGCTCAGGAT TTTCTCTATG AAATAGGAGG AGGTTTTGGT 120 

GCTGCTCAGT ATTTTGGCGA TGCAAACAGA GGCTTGTTCG GTTCATCCGG AGTAGGTTTG 180 

GAGTTGGTCG GACGTTACAA TTATAATTTT CGCTGGGCTT TCAGTACCAT GTTGGATTGG 24 0 

CGTACATTCA GAGGCGATAC CGATAAGTCC GGGAATGTCT TCCCCGATTT TGCTCAAGCG 300 

55 GATTTTAAGG TCGGCTTGAC TCAGCTCCAC GTTAGAAGCG AATTTAACTT TCTCCCTTAT 360 

AGCGATGGCT ATAAGTATCT TGGTACAGCT CGGCTGTCTC CTTATGTAGC GGCCGGGTTG 420 

TCTTTGGGTT TTGCTTCGGG TGCTAAAGCT TCGGCTTTTG CTCCCGGGAT TACTGCGGGA 480 

ATGGGAGTGA AGTATAAGCT TAAACCGCGG ATCAATGTCG GTATCGAGTA TTCTTTCACG 54 0 

GGGTTACTTA CCGATGCTTT GGATGCGCTG ACGGATAAAA GTGTTTGGCT CGAGGATCCA 600 

60 TATAAGATCA ATGACTCCTG GGTCAAAAAC AAGGATGCTA CAGGGGCCTT AGTGCTTAGG 660 

ATTACGTATG ACTTCGGCCT GCGTAAGACT TTTTGTAATA AACAA 70S 



65 



75 



(2] INFORMATION FOR SEQ ID NO: 103 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1300 base pairs 

(B ) TYPE: nucleic acid 

(C) STRAN DEDNESS : double 
70 ID) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
Uii) HYPOTHETICAL: NO 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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5 
10 
15 
20 
25 
30 
35 

* 40 
45 
50 
55 
60 
65 
70 
75 



(iv) ANTI-SENSE: NO 

(vi } ORIGINAL SOURCE: 

(A) ORGANISM: P ORY PHY ROMON AS GINGIVALIS 

UxJ FEATURE : 

(A) NAME /KEY ; tniscj ea turc 

(B) LOCATION 1. . .1309 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 103 

AATATAATGT ATAAAGACTA TAAGGGTTTG TATGCGTCGC TTCGGTGGTA TGCCCTGATC 60 

ATTGGGTTGC TATTTGCAGC AGACGGTATA CAGGCTCAGA ACAACAACTT TACCGAGTCG 120 

CCTTACACIC GCTTCGGCCT TGGCCGTCTC GGAGAACGGA CGACTAT TAG TGGGCATTCC 180 

ATGGGAGGAC TCGGCGTCGG TCTGCGTCAG GC.CACATACG TCAATGCCGT CAATCCTGCT 24 0 

TCATACTCGG CTGTGGATTC GATGACGTTT ATCTTCGATT TCGGTGCATC TACCGGAATT 300 

ACGTGGTATG CCGAGAACGG GAAAAAGGAC AATAGGAAAA TGGGAAACAT TGAGTATTTC 360 

nCCATGCTTT TTCCTATTTC CAAATCCATT CCTATGAGTG CGUGAGTGCT TCCTTACTCC 4 20 

GCATCCGGGT ACCAGTTCGG ATCCGTTGAT CAAGTGGAAG GAGGCAGCGT CCAGTACACC 4 80 

CGTAAATACT TGGGGACAGG CAATCTGAAC CiATCTCTATG TCGGTATAGG TGCAACCCCG 54 0 

TTCAAAAACT TCTCAATAGG AGCCAATGCT TCATCCCTTT TTGGGCGATT CACACACAGC 600 

AG GCAGGTAA TCTTCTCCAC GGAGGCTCC7T TACAATCCCG TACATCTCTC GACGCTGTAC 660 

TTGAAGGCTG CCAAGTTCGA CTTCGGTATG CAGTATCACC TTCTTCTCAA ATCAGATCGT 720 

TCGCTCGTTA TCGGTGCCGT CTATTCTCCG CGGGTGAAGA TGCATAGCGA GCTGACTCAG 7 80 

ATAAAGAATC AGGTTCAGAA CGGTGTAGTA GTGGAGAGCG AAACCCAAGA ATATATCAAG 040 

GGAATGGACT ATTATACCCT GCCTCATACA TTGGGGATAG GTTTTTCTTA TGAAAAGAAA 900 

GATAAACTTC TCTTAGGAGC AGACGTCCAA TATAGTAAAT GGAAAGGCGA GAAATTTTAT 960 

AAATCCGATT GCAAATTCCA GGACAGAATA CGGGTATCTC TCGGCGGAGA GAT CAT ACCG 1020 

GATATAAATG CCGTTGGGAT GTCGCCTAAA GTTCGCTATC GCTTCGGTTT ACATGGTGAA 1080 

AATTCTTACC TGAAAGTGCC GACTAAAGGC GGTGTATATC AAGGATACCA TATCGTAGGT 114 0 

GCTGTATTCG GTATAGGAAT CCCGCTCAAT GACAGACGTT CGTTCGTAAA TGTCTCTCTT 1200 

GAATATGACC GATTGATCCC GAAGGAGGGT AT GATCAAAG AAAATGCTCT GAAATTGACC 12 60 

TTCGGCCTCA CGTTCAACGA GTCATGGTTT AAAAAGCTGA AACTGAAC 1308 



(2) INFORMATION FOR SEQ ID NO: 104 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2835 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPE; DMA (genomic) 

(iiil HYPOTHETICAL : NO 

(lvj AHT1 -SENSE: NO 

(vi] ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

Ux) FEATURE: 

(A) NAME /KEY : misc_f eatur e 
(Bi LOCATION 1...2835 

txl) SEQUENCE DESCRIPTION: SEQ ID NO: 104 

AATACCATTT CGGAGAATGG GAACGATAGC ACCTATTTTT CTTTTCAAAC TTTTCTTATG 60 

CGATCGATTT ATCAATTACT GTTGTCAATA CTCCTTGCTT CTCTTGGTTT CGTCGGGCTG 120 

GAAGCCCAAC AAGCCGGAGT AGCAGGTAGA GTATTGGACG AAGAAGGCAA CCCCATGATT 180 

CAAGCCAACG TACAGCTTGT ACAGAGTACC GGCCAAGTAG CCGTTGCCGC AGGTGCCACT 240 

AATGAAAAAG GGTTGTTCAG CCTGAAAACG TCACAGGAGG GTGACTACAT TCTGCGCGTT 300 

TCATATGTAG GTTACACTAC CCACGACGAA AAAATATCTC TTAGAAACGG GCAAACCATT 360 

ACGCTCAAAG ATATATCCAT GAACGAAGAT GCCCGTCTTC TACAGAGTGT GACGGTGCAG 420 

GCTAAAGCGG CAGAGGTCGT GGTACGCAAC GATACGCTCG AATTCAATGC CGGATCCTAT 4 80 

ACCGTAGCAC AGGGAGCTTC TATCGAGGAA CTGATCAAGA AGCTACCCGG AGCAGAGATC 54 0 

GGATCCGATG GGAAGATCAC CATCAACGGC AAG GACATT A GCAAGATCCT TGTCGATGGC 600 

AAAGAGTTTT TCTCCAAAGA TCCACAGGTG GCAATAAAGA ATCTTCCGGC CGATATGGTC 660 

AATAAAGTAC AG GTACTG AA CAAACTGAGC GAGCTCTCGC GGATGAGCGG TTTCGATGAT 720 

GGAGAAGAGG AGACCGTAAT CAACCTGACG GTGAAGCCCG AAAAAAAGAA AGGCCTCTTC 7 80 

GGAACGCTTC AGGCCGGCTA CGGTACCGAC CAACGCTATA TGGCCGGAGG GAACGTCAAT 840 

CGGTTCGATG GAAATAAGCA ATGGACATTG ATCGGTAGTG CGAACAATAC GAACAATATG 900 

GGCTTTAGCG AGATGGACAG CGAGATGGGA TCCATGACCT TCTTCTCTCC CCAAGGCGGT 960 

GGTCGACGCG GCTTCGGCAA TAGTGGAGCT GTTACGTCTT CGTCGATGCT GGGCGGCAAC 1020 
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TTCAGTGTCG AATT CTCCTC TGCCCTTAAT ACAGGACGCG ATGCACGCTA CGGATACAAC 1060 

GACAAGGCCA TAGAGACGAC CAAACGCGTG GAAAATATCC TCGCCGAAGG GAATACTTAT 114 0 

ATGGACGAAA ATATATTGGA ACGCTCTTTC TCTCACAATG GTCAGGCGCG ATTTACGATG 1200 

CAATGGAAAC CGTCCGAACG TACCGAAGTG GTATTCGAGC CGGATCTTTC GATATCCAAG 12 60 

5 ATCGATGOGT TCTTTAACGA CACATACGAG ACGAAAGATG CCACCGGAAT CTCTATCAAC 1320 

AAAGGTTCTA TCCACCAAAC TACACAAGGA AACAACTTCA GACTGAACGG AGAATTGGAT 1380 

ATCAGTCACA AGCTCAACGA CGAAGGCCGT ACGATCAGTG CCTCCGTCAG TGGCGGTCTG 14 4 0 

ACCGACGAAG ACGGAGATGG CATATATCAG GCTGTGCTCC AAAGCGTGGA GACGAATCAA 1500 

AAGCAATTCA ACGACAACTC CAACCTGCAA TAT CGGCTTC GCCTCTCGTA TGTGGAACCG 1560 

10 TTGGGTAAAA ACTACTTCGC ACAAGCGATT CTGAACAGAC GTTTCTCCCG TCGCAATTCG 1620 

GATCGTGAGG TGTACCGACT GGGCGATGAC GGGCAATACT CCATATTAGA CAGTCAGTAC 1680 

GG ACT CTCCT ACAGTAACGA GTTCACCCAG TAT CGCATCG GACTCAACCT CAAGAAGATT 17 4 0 

GCCAAAACGT GGGACTACAC CGTAGGATTC AATGTGGATC CCAACAGAAC TCTCAGCTAT IB 00 

CGGAGCG TAG CCGGAGTAGA GCAGGACAAA CTGGCTTTCA ATCGTGTCAA TCTCTCCCCG 1860 

15 ATGCTCCGAA TCAACTACAA ACCGAGCAGG ACTACCAACC TCCGAGTGGA CTACCGAGGA 1920 

CGCACGACAC AACCATCCAT CAATCAGATC GCTCCCGTTC AGGACATCAC GAATCCGCTA 1980 

TTCGTGACGG AAGGCAATCC CGGTCTGAAG CCGAGCTATT CC AACAATGT GATGGCCATG 2040 

TTCTCGGACT TCGATGCCAA AAGTCAGCGA GCTTTCAACA TTGTTTTCTT CGGCAACTAT 2100 

ACATTCGACG ACATCGTCCC CAATACGCAC TACGATCCGT CTACAGGGAT CCGTACCACT 2160 

20 CGTTACGAAA ACGCCTCCGG TACGTGGCAA GCGAATCTTC ATGGGACACT ATCGCTTCCA 2220 

CTCAAGAACA GGGCATTTTC TTTCAGGATG TCCTTGTTCA ACAGGTTGGC CGAAGGACAA 2280 

AGCTTCATCA AT G AC GAT AA GAACAAAGCT CTCTCTTTCC GAACGAGGGA ACGCCTGACG 234 0 

CTGACCTATC GCAACAATTG GATCGATACG AGTATCGGTG GCAATATCGG ATTCTATATG 2400 

GCGAATAATA GTCTGAGCGG ACAGAAAGAT TCTCGCACAT ACGATTTTGG CGGCAATTAT 24 60 

25 CAAGTTGCCC TAACCCTTCC CTATGGATTC CGTATCGACA GCGATGTTGA ATACAATACG 2 520 

AACTCCGGTT ACAGCGGAGG ATTCAGTCTG GACGAATGGC TTTGGAATGC TTCGCTTTCA 2580 

TACAGCTTCC TCCGTGACAA GGCCGGTACA CTGCGTGTCA ATGGCTATGA CAT CCTCGGT 2 6-10 

CAGCGGTCAA GTATCAGCCG TTCTGCTTCG GCCATCAATA TAGAAGAGAG CATGTCCAAT 2700 

ACGATCGGAC GCTACGTGAT GGTGGACTTT ATCTACCGAT TCAACGCCTT CAGTGGTGGT 27 60 

30 GGATCTCGCA GCGATCATCA GCGTGGCAAT ATGAATCGTC CGGGCCCACC TTTCGGCGGT 2820 

GGCAGACGAC CGTCC 2835 



35 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 105 



U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1236 base pairs 
(B| TYPE: nucleic acid 
(C) STRANDEDNESS: double 
40 <D> TOPOLOGY: circular 



(ii) MOLECULE TYPE: DHA (genomic) 

(ill) HYPOTHETICAL: HO 

(iv) AHTJ-SENSE: NO 

(vl) ORIGINAL SOURCE: 

( A ) ORGAN I St I : PORY PH YROI ION AS GI HGI VALI S 



(ix) FEATURE : 

{AJ NAME/ KEY : misc_feature 
(B) LOCATION 1...1236 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105 

GGAGAGT AT C CTGCAAACAG TAACGACAAA AAAGAGATGG TTATGAAGCT GATTAAAAGA 60 

AGTTTGCTCC TGCTTGGAGC GGTACTGCTG ATTACGCTTC CTGCGTACTC GCAGAATGAT 120 

6ACATCTTCG AAGATGACAT CTATACATCG CGAAAAGAAA TACGTAAACA AAACCAAGTT 100 

60 AAAGACTGGC AAAACCAAGA GGACGGATAC GGCGACGATA CGGAATATAC AGTGGCTTCC 240 

GATCGCGACA TTGACGCCTA CAATCGTAGA GATGGCCAGT CCTACCATGG GAAAAAGTTG 300 

TCCAAAGACA AGAAAAGAGA CTCCACTCGT TCTTCTGTTC CCGGTCGCTA TAGTCGCCGC 360 

TTGGCTCGAT TCTATAAGCC GAATACGATC GTCATTTCAG GTGCCGACAA TGTATATGTA 420 

ACTGATGATG GTGAGTATTT CGTCTATGGA GACGAATACT ATGATGACGC GTCGTCTGTA 4 80 

65 AACATTTACA TCAACAGTCC TTGGTGCGAT CCGTTCCCTT ATACGTCATG GTATCCATCT 540 

TTCTCCGGCT GGTACAACTA TACGTGGAAC TATCCATGGT TCTACTAOGG TAGCCATATC 600 

GGATGGGGCG GTTATTACCC CGGATATAAT TGGTATTGGA GCTACTACTA TGATCCTTTC 660 

TACAATCCCT ATGGAATCGG TATGGGTTGG GGATAT CCTT ATGGCTGGGG CAGCTATTAC 720 

GCTTGGGGTG GCTATCCGGG AGTGATACAT CACTACCACC ACTACCCCAA GAAGACCTAT 7 80 

70 TCCAATGGTC AGCATTCCGG AG CTT ACT AT TCTTATGGCC GACCGAATCG TAT CAAAGGT S40 

GGAACGTCCG GTGCCAAACT TGGGACAGGA CGCTACGATA GAATTCAAAA TTCGTCTTCG 900 

CAAAAAAATA AGTTCGGATT GCAGTCGAAC AAACCCAATA ATAATCTGCA AAATGTCAAG 960 

TCGGGACGTA CCGGCCGAGC CAATAGAGAC CGAAATATAG AAACGGTAAC TCCAAACAAC 1020 

GGGCAAAAGC AGAATCGTCC CGTATTCCAG CAGAATCAGT CCGGCAATGA CCGACCGACC 1000 

75 GGACGGAATA TCCGCAGCGA GAGACAGGGG GAAAATAACG ATAGGACATT TTCGACTCCT 1140 
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TS^T^ ATAGTAACGG TGGCTTCTCC ACGCCTTCTC GCTCTTCTTC CGG^TCT AT G 
AGCGGAGGTG GCGGACGTAG TGGCCGGGGA CGCAAT - i d AT G 

(2) INFORMATION FOR SEQ ID NO: 106 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1803 base pair^ 

(B) TYPE: nucleic acid 
tCJ STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic ) 

(ill) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO v 

(vij ORIGINAL SOURCE; 

(A) ORG AH ISM: PORYPIIYROMONAS GINGIVALIS 
(lx) FEATURE: 

(A) NAME/ KEY : mifc feature 

(B) LOCATION 1...18 03 

[Ki) SEQUENCE DESCRIPTION: SEQ TD NO: 106 



1200 

1236 



AGTAATAGCA 
ATGATCCGCA 
GTACCAACGG 
TCTTTGAATA 
CAGAGCGATA 
TATATCATGG 
TATAAGCCCC 
AAGGGAAGCT 
GGTATGGTAT 
GGACGTATAG 
CGTAACTACT 
TCTGATCTGG 
TTGGCTATCG 
GTAGGCGAAG 
ACCAACTTTC 
CCCTCGGCAG 
TACT CCAAAA 
CGCAGCAAGC 
CAAGCTCACA 
TGGGCTTTCC 
TACGGTACCG 
AAAGAGAATC 
GGCGACCTCT 
AACTTTACGC 
GGAGAGAAGC 
AGTAATAAGG 
GACTGGATCT 
GAGCAGTATA 
CACGGAGCAC 
GGAGGTGTAT 
CTG 



GCTCCCACAA 
AGTTGATCTT 
ACAGCACGGA 
GGGATGATGC 
TGCTGCTTTC 
GTATAATCAA 
TGCCCGGACA 
ATCATTGGGC 
TCCGCACCTA 
TACTCACTCC 
TCGACCCCAC 
AGCTGAATGT 
GGGGATCGTT 
ATCGCAAGCG 
AAAAAGGAGG 
ACAATGACTA 
AAGGGAT GAG 
GAAGTGCCCA 
CTTATACGCT 
AAGGTGAACT 
GCTTGCGTAT 
CCGACGAACT 
ATTATTCGGA 
TCACCTACTT 
CTGAGAAGAT 
TAGCCCTCCG 
ACGGCATGGC 
ATATCGGAGA 
ATCGAGTAGC 
GTCGTGTGGT 



GTGGTTAATT 
ACTGCTTGCT 
ATCGAAAGAC 
TCCGGATAAA 
TACTGCCCAA 
GAACAATTAT 
TGAACCCGAG 
GGAGCTGACT 
TGAAGAGCGC 
TTTTGATGGA 
GGGCAAGGTA 
AGAGCGTTGG 
CGTTTCCAAA 
ACTCAACCTG 
TCTCGCCCTC 
TATCTACCAC 
TATCCTGTTG 
GCTCACACCG 
GGCGGCCATC 
GCGTTACAAC 
CAACGTTTCG 
GATCGGAACG 
TATAGATGTG 
GAATCAGATC 
CTATGCCAAT 
TACCGAACTG 
CGAGCTCTCT 
GACCAAGAAA 
TTTCTCTGCA 
CCCTGAGACT 



TAT TAT CAT A 
CTGATGCCGG 
AATCGTATCC 
TGGCAACCTA 
AAGTCCAA3A 
CTGGAGTTTG 
ATGGGGCGTG 
ATGGGAGACT 
AACCTCGGTA 
GTGCGTGTCA 
TTCAATTCCG 
AGCAGTGCCA 
CACGAAGCAG 
CCGCTCAATG 
TACGCAGAGT 
GACGGACAGG 
CAGGCCAAAC 
CTTATGATCA 
TACCCCTATG 
TTTGCTCGCC 
CATGTGCGTG 
GATG GCTACA 
CAGATTACTA 
TACAATAACA 
ATCTTCGTCT 
CAATATTTGC 
ATCCTGCCTT 
CATTATGTCA 
GGCAAAACCC 
CAGGGATTCT 



TAGAAAAGAC 
TAGCCTCTGT 
TTACAAGCAT 
TGCATGCCAA 
ACACCTGGTT 
GTGCCCGTTT 
GCGTTCCTCA 
TCTACGATCA 
TAGACAACGC 
AGGGTATTGC 
GCCGAGGCTA 
TGCGCGACAA 
ACGAAGATAT 
TCCCGATTAT 
ATGGATACAA 
CTGCACTCCT 
GTTGTGAGAA 
ACTATATGCC 
CTACTCAGCC 
GGACAGCTCT 
GTCTGGACAA 
CCGTTTCTTT 
AAAAGGTAAG 
AG GT ACT GCA 
ATGATGGTAA 
ACACGAAGCA 
CTCTGATGCT 
TGGGGTCTGT 
GTGCAGGGAT 
ACCTTTCTTA 



TAAAAGTATT 
GGCTTTTGCG 
GCAATCCTCC 
TTTCAGTATT 
CGGCAACTCC 
CGAGGATCTC 
CATGTATGTG 
GTTCGGTAGC 
GGTTCGCGGC 
AGGACAGCAG 
CCTACTGGGT 
TGACTATCAT 
ATTTGTGGGT 
GGGCCTGCGC 
ATACAACGAT 
CTCTGCCTCA 
CTTTGCTTTC 
GGCTTTTACC 
TC AGG GAG AA 
CGGTGGACGC 
AAAGATGCTC 
CTTCGGCATG 
CCCAGGATTC 
CGGTGCAGCC 
GTATAAGCTG 
GGATCAGGGT 
TTCCCTCTCG 
CACCTATACT 
GAACTGCTCG 
TAGCACCAAT 



(2) INFORMATION FOR SEQ ID NO: 107 

U) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 756 base pairs 
<B> TYPE; nucleic acid 
(C) STRANDEDNESS: double 
(D> TOPOLOGY: circular 

i^3.) MOLECULE TYPE: DNA (genomic:) 

Uil) HYPOTHETICAL: HO 

liv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1803 
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(A) ORGA1IIS1): PORYPHYROMOWAS GIIJGIVALIS 

<ix> FEATURE: 

(A) NAME/ KEY ; misc featurc 
5 ( B ) LOCATION 1...7 56 

(Kl) SEQUENCE DESCRIPTION: SEQ ID NO: 107 

AGGGGGTCGA GTTCGG GCAT TTCAGCCCGT GGACGCGACA TGCGTTCTTT ATTTTTGAGC 60 

10 GCGTTGCGCA GCTCCTCTCT CCATGGTTCA GAGCGACGCA GTCGGATAAG TTCTTCTGTA 120 

GTCATGTCAA TAAGGCAGAA AATAAGGTTA TTCCATCTCT CGGTATGCGC CCAAACGCAT 180 

GATCATCTCA TCGAAATCCA CTTGGTGTGC AT CGAATTCG GGGCCATCGA CACAGACGAA 240 

1TTCGTCTGT CCTCCCACGC TTATACGACA AGCCCCACAC ATACCGGTGC CATCCACCAT 300 

AATTGTATTG AGAGAAGCTA TGGTCGGTAT CTCGT AACGT TTGGTCAGGA GAGAAACGAA 360 

15 CTTCATCATC ACAGCCGGCC CGATCGTAAC GCAGAGGTCT ACCGTTTCCC GTTTGATAAC 420 

GCTTTCCACT CCATCCGTTA CGAGGCCTTT CGTCCCATAA GACCCATCGT CTGTCATGAT 4 80 

GATCACTTCA TCGCTATTGG CTCGCATTTG TTCTTCAAGG ATAACCAGAT CTTTAGTTCT 540 

GGCAGCCAAT ACGACAATTA CACCGTTGCC TGCTTTGTGG AAAGCCTCCA CGATCGGGAG 600 

CAAAGGAGCC ACACCCACAC CGCCTCCGGC ACAAACCACT GTGCCGACCT TTTCGATATG 660 

20 CGTACTCTGT CCCAGCGGAC CTACCACATC CGTGATATAG TCGCCGACTT CGAGTTCGGC 720 

CAATTTCTTG GAAGATTTGC CCACGGCCTG AACCAC 756 



25 



(2) IUFORIIATION FOR SEQ ID WO: 108 



(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 2370 base pairs 
IR) TYPE: nucleic acid 
tC) STRAW DEDIIESS ; double 
30 ID) TOPOLOGY: circular 



35 



40 



(li) MOLECULE TYPE: DMA (genomic) 

(iix) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIYALIS 



(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 
<B> LOCATION 1..-2370 

45 txii SEQUENCE DESCRIPTION: SEQ ID NO: 3 08 

CTCTTGTTTT CTTCTCCTTT ACCCCGAATG GATCGTCCTA AGCCTTCATA TAT TGTTCGA 60 

ATAGCAGCCA TTCTCTGCTT GTTTGTCGGC AGGCCTTTGT TTGCGCAGAG CTATGTGGAC 120 

TACGTCGATC CGCTGATCGG GACGCTAAGT TCTTTTGAGC TGAGTGCGGG CAATACCTAT 180 

50 CCGGTGATCG GTTTACCGTG GGGAATGAAT AGCTGGACAC CGATGACCGG TGTACCCGGT 240 

GACGGCTGGC AATATACCTA CTCGGCACAC AAGATTCGCG GATTCAAACA GACCCACCAA 300 

CCCAGTCCTT GGATCAACGA CTACGGCCAA TTCTCCCTTC TTCCCCTTAC GGCACCGCAG 360 

AAGCCATCAT CGAACGACTC CATAGCTCTG ACTAAATGGT GCAAGCAACT CTTTTCGGAC 420 

GAACAGACCT CGTGGTTCTC GCACAAAGCG GAGACGGCGA CGCCATACTA TTATAGTGTC 480 

55 TAT'VTGGCCG ATTACGACAC ACGCGTGGAG ATGGCTCCGA CCGAGCGTGC AGCTATCTTT 54 0 

CGCATACGTT ATTCCGGCAA TACCGAAAGT GGCTCCGGTC GATGGCTTCG TCTTGATGCC 600 

TTTACCGGTG GTTCGGAGAT TAGCATCGTG GATCCTCACA CCGTAGTGGG CATATCTCGC 660 

AAGAATAGCG GAGGTGTGCC GGCTAACTTC GCCTGTTATT TCATCCTGCA GTCCGATACT 720 

CCTATGGCCG ATGTCCTGCT T GAG ACA GAT ACCGGCAAGT CAGACGAAGG CACAAGGGCA 780 

60 TGGGCAGCCT GTCGCTTCGA TTCGCAAGAA G'JTACCGTCC GGGTGGCATC TTCTTTTATC 84 0 

AGTGTCGAGC AGGCCGAAAG AAATCTTGCG GAAGTCAAAG GGCAGAGTTT CGACCGGATC 900 

AGACTTGCCG GTCGCGAAGC TTGGAATAAG GTGCTCGGAC GCATACATGT GGAAGGAGGA 960 

ACGAAGGATG AGCGCACTAC ATTCTATTCC GCACTCTATC GCTGTCTGCT TTTTCOCCGT 1020 

CGCTTCTATG AGGAGGATGC TTCCGGCAAT TTTGTGCATT ACAGCCCCTA CAATGGAGAG 1080 

65 GTACTTCCCG GTTATCTCTA TACCGATACC GGATTTTGGG ACACTTTTCG AGCCCTTTTC 114 0 

CCCCTGCTCA ATCTGCTGTA TCCCGATGAA AACATTAAAA TTCAGGAAGG TCTGCTGAAT 1200 

GTATATCGCG AGAGTGGCTT TTTCCCCGAA TGGGCCACTC CGGGCCATCG GGATTGTATG 1260 

ATAGGCAACA ACTCTGCTTC TGTTCTGGCG GATGCCTACC TCAAGGGTGT TCGGGTAGAA 1320 

CATACCCGTA CACTGATGAA CGGACTCTTG CATGCTACGA AAGCCGTCCA TCCGAAAATC 1380 

70 TCCTCCACGG GTCGCAAAGG TTGGGAGTGG TACAACTCCT TAGGTTATGT TCCGGCTGAT 1440 

GCAGGCATCG ACGAAAGTGC TGCCCGTACG CTCGAATATG CTTATAACGA TTGGTGCATC 1500 

CTCCGACTGG GGCGCACATT GGGTTGGGAT AGAGCTGCAT TGGACACGTT GGCTCATCGT 1560 

TCGATGAACT AT CGTCATCT GTTCGATCCG GAAACCAAAC TCATGCGCGG TAGAAATCAG 1620 

GATGGTAGTT TCCGGACACC TTTTTCCCCT TTCAAATGGG GAGATGTATT CACGGAGGGC 1680 

75 AATGCCTGGC ACTACACTTG GTCGGTCTTT CATGATGTGC AGGGGCTTAT CGACCTGATU 174 0 
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GGAGGAGATC GCCCGTTCGT GTCTATGCTC GATTCGGTAT TCAATACTCC TCCTATGTTC 1800 

GATGAGAGCT ATTACGGATT TGTCATCCAC GAAATCAGAG AGATGCAAAT AG CGG ATATG I860 

GGCAATTATG CTCATGGCAA TCAACCCATA CAGCATATGA TAT AT CT GT A TAATCATGCC 1920 

GGTCATCCAT GGAAAGCTCA GGAGAGACTA CGCGAAG'J'GA TGGGGCGGCT CTATCGTCCT 1980 

5 ACTCCGGATG GGTATTGCGG CGATGAAGAC AACGGACAGA CTTCGGCTTG GTACGTTTTC 204 0 

TCTGCTTTAG GCTTCTATCC TGTTACACCC GCTACGGATC AGTATGTGCT CGGTTCGCCG 2100 

ATTTTTTCCA AGGTAATACT CTCTTTTCCC GACGGACACA AAACGGTGTT GCATGCTCCG 2160 

GCCAACAGTG CCGATACGCC TTACATCCGC TCGATCAGCG TAGAAGGAAA AGAATGGAGC Z220 

TGCAATTACC TGACTCACGA ACAGCTTCGC TCTTCTGCAT CCATTCAATG GAT G AT GG AC 2280 

10 ACGAAACCCA AT TATAATCG TGGTATGAAG GAAAGTGACA GACCTTATTC CTTCTCCACG 234 0 

GAGCAACAGC GTCGCGCTAA TCACAGTAAT 2370 



15 



25 



30 



55 



65 



70 



12) 1 1 1 FORMAT I OH FOR SF.Q ID NO: 109 



(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 856 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDMESS: double 
20 { D) TOPOLOGY; circular 



( i i 1 MOLECULE TY PE : PHA t genomi c ) 

(ill J HYPOTHETICAL: NO 

liv) ANTI-SENSE : HO 

(vli ORIGINAL SOURCE: 

tA) ORGANISM: PORYPHYROMONAS GINGIVALIS 



(ix) FEATURE: 

(A) NAME/KEY: mi sc_f eature 

(B) LOCATION 1. . .858 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 

ATTTGTGGCA GTAAAATGAA TCTATCGGGA TTACAATCTT TCACTATGAT GAAATCCATG 60 

CGCAGCGTGC TGTTGCTACT CTTTCCATTG TCTTTGATCA CTGCTTTGGG CTGTAGCAAT 120 

AACAAAGCTG CCGAATCGAA GTCTGTCTCT TTCGATTCGG CCTATCTCGA ACGCTACATC 180. 

40 CCTCTGCGGG CAGACATAGA TACGCCATCG CTGCATGTGA TGATCACCTA CGT CT ATCCT 240 

TCGGGAGATG ATATGCTCAC AGAGATTTTC AACGGTTTGC TCTTCGGCGA CAGCCTGATG 300 

GATTCCTCTT CCCCGGAGAA TGCCATGGAA GGCTATGCAC AGATGCTGGG AGAAGACTAT 360 

CGCTCTAACA ATGCCGAAGC CAATCTGCAA GGGCTTCCTT CTGACCTTTT GGACTATATC 4 20 

TACAAGCAGG AAAATACCAT CGCTTATTGC GATACGGGAT TGATCTCCAC GCGCATCAAT 4 80 

45 ACATATACTT ACGAAGGCGG TGCACATACG GAGAATAGAG TCCGGTTTGC CAACATCCTT 54 0 

CGCACCACCG GCAAGGTGCT CGAAGAGCGA GATATATTCA AGATCGACTA TGCGGAAAGG 600 

CTGTCCGCAC TCATCATAGG ACAATTGGTG CACGATTTCG GCAAGACCAC ACCTGCCGAA 660 

TTGGATGCAA TAGGTTTCTT CAACGCAGAA GAAATACAGC CCAATGGCAA TTTTATGATC 720 

GAT G ACAAAG _GTCTCACATA CTGTTTCAAT GAGTATCAGA TAGCTGCTTA TGCCAGAGGT 780 

50 GCTGTCTATG TCCGTCTCGG ATATGACGTA TTG'JCTCCTT TGCTAAGGGA TGATTCCCCA 84 0 

CTAAAGCGTT ACTTGCCG 858 



(2) INFORMATION FOR SEQ ID HO: 110 



UJ SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1134 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS; double 
60 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE; DNA (qenomic) 

(iii) HYPOTHETICAL: NO 

(iv) AIITI-SEHSE: HO 

tvl) ORIGINAL SOURCE; 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



tlx) FEATURE: 

(A) NAME/KEY: mi scf eature 
IB) LOCATION 1. . . 1134 

75 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110 
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GGGATAATAT CTGTTCTTTC ACATGTGGTT 
GCTGCTGGCA CGCATAGTCT GTATTCGATT 
AGGTCTATGG GTATTATCTC AGCCTGCCGT 
5 AGCGGTCTGT CGAGGGCAAA TGTAGGTATT 
AAAAAGAGAA AACTGCAAGA AAAAAATGTT 
AATAATTTGA TACCTTCGCT CCCATTTAGA 
AAAAAAACTA CTTTGACAGG ATCGATATGT 
AATCCCCAAT CGAAGTTAAA GATCAAGAGC 

10 ACGGCCGGAA ATGGTTTTGG TGGCAATATC 
GTACACCACA GCATTCTGCC CGAAGGGTTG 
GCACACTGGG AAGAGGCCGG AAGTCCGAAG 
GGTATTGTTG AAAAGGTAAT AGTCTTCGAA 
CTCGGCCGCT TCCTCATCAG TTTGGGGATA 

15 GGATGGACCA ATGACGCCCA TATTCTTTTC 
GACACCTACA CAAGAGCTGG CAGTGACCTT 
AGCAATGGCT TTTCACCGGG G AT CGGACTC 
AAGAGCAAAT ATGATTTTCG CCTCGGTTTC 
TATCCGTATC GTAATTTCAA GCTGGATGGA 

20 ATGAACCACA TCGGCCATGT GGGCTTCAAC 



GACCGTCCAC AATGGGGTGC TTCTCCCGAA 60 

CTACATCCCT CCGCCGGTAT TATCCGGATC 120 

ATCGCAATTC TTGCCGGCAA GCACCCCCGT 180 

CTTTCGTACA ATCCAGAAAA CACGCCCGAG 240 

TTCCTCCAAA TCCGGCTCCG TCAATCATTT 300 

ATCGATAACA CAAAAAAAAT CACTGAAATG 360 

GCTTTACTCC TGTTTTTGGG TCTCTCGGCC 420 

ATTGAGGCAG CTACCACTTT CAGTTCGGCC 480 

TTCGGCATGG ACATGAGCAT ACGGATGAGG 54 0 

GATTTTTCGG TAGGAATACA TGAAAGAAGA 600 

CTCATGTATA CGAATGTCCC AAGTATCATT 660 

GACGCAGAAG ACTTTTTTGA CAAAAAAGCT 720 

TCCTATACCA AGCATCTGGG AGCGTATTGG 780 

TCACCGATAC CCAAGAGCAA GGTCCACTAT 84 Q 

GTACTTCAGT CCGAAGATGT TCCCACAGTG 900 

AAAAGTTCTA TTTGGTGGAA AATGCCCATC 960 

AGCCTGGGCT ATGAGTATCT GAACCTGCTA 1020 

AATAAGCCGC TTTCAGCACT ATCTCCTCGC 1080 

TTTACCGTGG GTCTTTGGAC TAAT 1134 



(2) 1MFORI1ATIOK FOR SEQ ID MO; 111 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3807 base pairs 

(B) TYPE: nucleic acid 
(C> 5TRAHDEDNESS : double 
(U> TOPOLOGY: circular 

30 

(ii) t lOLECULE TYPE: DMA (genomic) 
(ill) HYPOTHETICAL: NO 
35 (Iv) ANTI-SENSE: NO 



(vij ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOMAS GINGIVALIS 

40 tlx) FEATURE: 

(A) NAME /KEY : miscf eatura 
(D) LOCATION 1 . . . 380"? 



(Xi) SEQUENCE DESCRIPTION: SI 

45 

GTAAAATGTA TGGGCAAGTA TAAAAGAGCT 
TCGGATTATT ATACCTTTGA GGGAGTTACT 
AAGAAACCCC AAGCCATTTT AGACTTAGAG 
TCCTCACAAG AAGGGATAAG CTGCTCGGTA 

50 GCAGTCGTTG AGCTTTGCTT GCGAGAATGT 
TTTCCTGCTC TAAAAAAGCT TGATCTATCG 
GAACGTCTTA CTTCGTTAAC AAAACTTCGT 
GGCCTGGATA GTCTCACCTC GCTAACAAAA 
CTAGAGGGTC TGGAACGTCT CACCTCGTTA 

55 AGTAAACTAG AGGGTCTGGA ACGTCTCACG 
CAAATCCGTA AGCTGGAGGG TCTGGAACGT 
GGTAACCAAA TCCGTAAGCT AGAGGGTCTG 
CTAAGAAGTA ACCAAATCAG TAAGCTAGAG 
CTTGAACTAT CGGGTAACCA AATCCGTAAG 

60 GCAACGCTTG AACTGTCGGG TAACCAAATC 
TCGTTAACAA AGCTTCGTCT AAGAAGTAAC 
CTCACCTCGC TAACAAAACT TTCTCTCTCC 
GAACGTCTCA CCTCGTTAGC GGAGCTTTAT 
GGCCTGGAAC GTCTCACCTC GTTAACAAAG 

65 CTAGAGGGCC TGGATAGTCT CACCTCGCTA 
AGTAAACTAG AGGGCCTGGA ACGTCTCACG 
CAAATCCGTA AGCTGGAGGG TCTTGATGGT 
OSCAACCAAA TCAGTAAGCT GGAAGGACTA 
GTTTCGGGCA ATGATATTCA AT CT ATT GAT 

70 CAAACTTTAG AAAAftGVGAQ AATCCATGAC 
CTCTCTCCTT ATGATAATCA TTTGCCGGAG 
AAACAGAAAA AGACTTCAGT TGAATATCAC 
CATTCTTCGG GTAAAACAAC ATTTCTTAGT 
AAT ACACAT G TGTTGTCGAT ACATCGAAGC 

75 GGGGGACAGG ACTATTATCA TGGGATTTAC 



'.Q ID HO: 111 

AAGTACCGCT ATTGGCTTTT TCCTTTCTGT 60 

TTTTTATGCG CATCTGACGA TATGACAACC 120 

AAGGCCTATA ACATTGAAAT TCCTGATCTC 180 

AATAGATATT TCAAGCAAGA TTCCTCCGGT 2 JO 

CAGATAGAAA GCATGACTTG GCTTATTGAT 300 

TATAACCAAA TCAGTAAGCT AGAGGGTCTA 360 

CTAAGAAGTA ACCAAATCCG TAAACTAGAG 4 20 

CTTTCTCTCT CCGATAACCA AATCAGTAAG 480 

GCGGAGCTTT ATCTTTTGGA TAACCAAATC 540 

TCCTTAGCAA CGCTTGAACT ATCGGGTAAC 600 

CTCACGTCCT TAGCAACGCT TGAACTATCG . 660 

GAACGTCTCA CTTCGTTAAC AAAGCTTCGT 720 

GGTCTGGAAC GTCTCACGTC CTTAGCAACG 780 

CTGGAGGGTC TGGAACGTCT CACCTCCTTA 84 0 

AGTAAGCTAG AGGGTCTGGA ACGTCTCTCT 900 

CAGATCAGTA AACTAGAGGG CCTGGAACGT 960 

GATAACCAAA TCAGTAAGCT AGAGGGTCTG 1020 

CTTTTGGATA ACCAAATCCG TAAGCTGGAG 1080 

CTTCGTCTAA GAAGTAACCA AATCAGTAAA 114 C 

ACAAAACTTT CTCTCTCCGA TAACCAAATC 1200 

TCCTTAGCGG AGCTTTATCT TTTGGATAAC 1260 

CTTGCTTCCT TAACAAGGCT TAGTCTAAGG 1320 

GACAGACTAA AGGTTTTGAG AAAACTTGAT 1300 

GATATTAAGC TAX'TGGCTCC GATTCTGGAG 14 40 

AATCCATTTG TTGCATCATC AGGCTTGATA 1500 

ATTAAAGCTC TTCTTGAAAA AGAAAAAGAA 1560 

CCATTTTGCA AAGTAATGCT ATTGGGAAAT 1620 

CAATACGATA CAAATTATAC GTATCAGAAA 1680 

AATAACCCTA ATGCGATCTT TTACGACTTT 17 40 

CAAGCCTTTT TTACCACCCA ATCGTTATAC 18 00 
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CTTCTCTTTT GGGATGCTAA GAAGGATCGA AACTTTGTGA GCGTAGATGA TAAAGAATAT 10 60 

CAGACTCTTA ATTTCAATCG CCCCTATTGG TTAGGACAGA TAGCCTATGC CTGCAATCGT 1920 

TGTATGTCCG TTGGAGGAAA TCCTGATGGC AAGGACACAC CACAGACCAC AGACGATACA 1980 

AT TAT CAT TC AGACTCATGC CGATGAAACG GGCGCTAAGC AGCAAACCTT AGGCTGTGCA 204 0 

5 GCCGAGAATG GAGTATTGGA AGAAATCTAT GTATCCTTAG AGCCCAAGGC GAATAGTGCC 2100 

GTACATGCGC TCAACT AT CT GAATGAGCGG GTGCGAGAAG TTGTCGCAAG CAGGAGTAAA 21 60 

TCAATTCAGA TCACAGAAAA AGATAAGGGA TTGTACGAAG CTCTTCCCAC AATCGCCGGT 2220 

GATAATAAAC ACATCCCTAT CTCTCTCGAA GCTCTTGCGG CTCAATTGAA TAAGGGAAGA 2280 

GCTGAAAATG ATCTTTACAC CAT AGAGT AT CTACAGACCG AATTGAACCA GCTTAGTCTG 234 0 

10 CGAGGGGAGG TGCTTTACTA TCGTGAGAAT GAGAAGCTGA ACAATTATGT CTGGTTAGAT 24 00 

CCGGCAGCTT TTGTCCAAAT GATTCATGGA GAAATCCTCC AAAAAGACAA CATCAATAGA 24 60 

GGAACAGTTC CTAAAGACAT TTTTGAATGC AAACT G CAT A ATCTAAGTTC CGGAAGTATA 2520 

TTTGAAGAAG ATGGCCAAAA TGGTAATATG ATCTTGCAGC TATTATTGGA AGAGCTGATC 23 80 

GTATATGAAG ATAAGGACTG CTATGTGATA CCGGGCTATC TCCCTTTGCA TTCCGATGAC 26-10 

15 GAAGCCTATA AATGGCTTAC TTTGGGATTC GAGAGGCCCA ATTTTGTCCT CAAATTCGAA 2700 

CGTTTTATCC CCTTTGGCCT GATCAACCAG ATTATAGCCT ACTATGGCCG GGAAGAAGGT 2760 

GCTCTAAAGC GGTATTGGCG AGATCAGGTC ATCTTCACAG CAGGCCGTGA GATGGATAGG 2820 

CAAACGCTTG AGCAAGAAGA AGAGAAAGAG GGTTTGCCCA AGACGAATGC CGAGGATTAT 2880 

CAGATCTGGA TCAAGCT'CGA CTTTACCGAC TTGGCCATAT CCGTATTCAT CAAAGAGCAG 294 0 

20 AGAAAGACAT CAGCTAAGGA TATGCAGCGG AAAGAGGCTA CTATCCTCAG TGATATGTTG 3000 

GATATGTATT GGAACAATAT CCCTCCGAGG GAGCAAATAG GAGATAAGGA TACGGAGCAA 3060 

ACGAGAAGCA CTATTCGTGA AACAAACAGA AAGAAGAGAC CCATCCAGGA TCTCTACCTC 3120 

TCCTGTGCCC AAGCGGATAA AGATTTGACG GAGTCTCATT AT AT CCATTT GGGCACGCTG 318 0 

GACGATGAAA GCAAGACTAC GGCGAGGATT GCAGCCTATC CGTTGAAGAA CGGCGTTATC 3 2 -JO 

25 GATAAAGAGC GGGTGCGAGA AGTATCGACT CGTCCCTACA AACATCTTTC CGTCAATAAA 3300 

AATCTGGCTA CTGCAAAACA GATCTTTATT TCCTATTCCA AAGAGGATCA GACTGAACTG 3360 

GAGACCTGTC TGCAATTTTT CAAACCCTTG GAGAAGAATG GTCAGATCGA GAT CT ACT AT 34 20 

GATAAGTTGA CTAAGTTTGA AACACCTATT CACCCTGAAA TAACAAAGCG TATTGTCGAA 34 80 

GCCGACTGTA TAATCGCTTT GAT CAGCCAA CGCTATCTGG CCACGGATTA CATCCTGGAT 3z>4 0 

30 CATGAGTTGC CTGTATTTCG GGAGTATAAC AAGACCATAG TGCCGATATT GATCAAGCCT 3600 

TGTACATTCG AAGACGATGA GTTCCTTCGG GAGAAATATT TTGCTCAGAA AGCTCAAATA 3660 

ATCAATCTTG GAAAAGAGGG AAAAACCATT AAAGCTTATG ATAGTATTAC GGCATCAGCC 3720 

CATCGTGATG AAAATTGGGT GGCAGTAGTC AGAGAGTTCA AAGAGAAGAT ATTAAGAATA 3780 

ACAAAACAGG AGGTAAATAC AGATGAA 380" 



35 



50 



[2] INFORMATION FOR SEO. ID NO; 112 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 693 base pairs 

(B) TYPE : nucleic acid 

(C) STRAWDEDMESS : double 

(D) TOPOLOGY: circular 

45 t ii J MOLECULE TYPE: DNA {genomic ) 

UiiJ HYPOTHETICAL: NO 
(ivl AtJTI-SEtJSE: HO 



(vi) ORIGINAL SOURCE: 

(A) ORGAJUSH: PORY PHYROMOHAS GINGIVALIS 



(ix) FEATURE: 
55 <A) NAME/KEY: misc_f eature 

(B ) LOCATIOH 1 ... 693 

Ui> SEQUENCE DESCRIPTION: SEQ ID HO: 13 2 

60 AAGTTTATGA TGAAAAAAGC ATTTGTTTTC GTACTACTGG TTTGCCTATT CTCCTCGTTC 60 

AGCAGTTCCG CCCAAACAAC GACGAACAGT AGCCGGAGTT ATTTTACAGG ACGAATCGAG 120 

AAGGTGAGTT TGAACTTAGG GGTCCCCCCC GTAAGCACAG AGGTTTGGGG AATGACCCAT 18C 

GATGCGAACG GTCTCCCTTT CGAAATACCT ATCTCTTTCA GTCGTTTCAA CAGCCAGGGA 24 0 

GATATAGCTA CCACTTATTA CAT AGCGAAT AGCGAGGCAA CTTTGAATGA ATGGTGCGAC 300 

65 TATGCACACC CGGGCGGCAT CGTGAGGGTA GAAGGTCGIT TTTGGAAAAT GACTTACAAC 360 

ATACCAACCT ACAATGCAGT CTGCACCCGG ATTACATTCG AAAATCAAGA AATAGAAGGA 4 20 

ACGATCGTCT TGATACCCAA GCCCAAAGTC TCGCTGCCTC ATGTGTCGGA ATCGGTGCCT 4 80 

TGCATCCGAA CCGAA GCCGG GAGGGAATTT ATCCTTTGCG AAGAAGACGA CACOTTTGTG 54 0 

TCTCACGATG GTAACGAAGT AACGATAGGC GGTAAACC1T TCTTGCTCAA TACCAACGTA 600 

79 AAGATTGTGG GGGACGTATC TCAAAAGTAT GCCGTGGGGG TAGGAGAAAT TCGATTCCTG 660 

CAGATTTGTG CCCAAACAGT ATCACAACAA AAA 693 
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{2) I M FORMAT I Ot i FOR SEQ ID MO: 113 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1689 base pair*; 
IS) TYPE: nucleic acid 
(CJ STRANDEDNESS: double 
<D) TOPOLOGY: circular 

<ii) MOLECULE TYPE: DNA (genomic) 

(111) HYPOTHETICAL: NO 

(iv> ANTI -SENSE: NO 



15 



20 



25 



30 
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40 
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60 



65 



70 



75 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: POR Y PHY ROMOMAS GINGIVALIS 
fix) FEATURE: 

(A) NAME/ KEY: mlsc featur- 

(B) LOCATION 1...1689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 



TCGCTAATCA 
ACAAAAACTA 
ACCCTCTGTC 
TACAATCAGG 
GTAATGACCC 
GACCAAGGTG 
CAAGACTTCG 
GAGCTCATGA 
AACGCGCAGA 
GTGGATATAG 
GCTTTGGCTT 
GTGTTCTCTC 
GAGAAGAAAA 
AATGCCTGGC 
ATCGGTTTCT 
AAAGTGAGTG 
AACAATACGA 
TCTGAATATT 
AAAGGAAAAA 
AGTGAGACCA 
GCCAAAAAAG 
ATTCATAACA 
TACACACCTC 
TATCTGCCGG 
GAAGACATCG 
GCTGTGATTA 
AGAGTAGTCG 
GCTAAGGGTA 
ATCGTGGAA 



ACAATTCAAG 
TGAAGACAAA 
CTCTCGCCCA 
AAAAACTGGT 
ACAGTGGATA 
CCACATACCA 
ATATCGTGGT 
ATAAGCCCGG 
ATGCGAAACT 
CCTCCAACTA 
TCGCTTACAC 
TGAATGGAGG 
TTGACAAGGT 
CGCTAATGGG 
TGTCGAACTT 
AAAGCGACAT 
TCAATGGGGA 
CGGATTGGGA 
CTCCGACTAT 
ACTCGGGGCT 
AAGAGAACGG 
AAGATTTGTG 
AAGTAGACAT 
GCAAACGGAT 
TAATGCAAGA 
GCCTGCCGAC 
CTGAGCCTTC 
CG T AC AT ACT 



AGACTGGAGG 
AGTTTTACGC 
AGCGCAGACG 
ACAAACGAGG 
CGACACCGGC 
AAAGTTGAGG 
AACGGGTAAG 
AGGATATAAG 
CGTGTATAAG 
TCGTTCGCCT 
CGGC1TCAAC 
GCAAAATTTC 
GGATCTCTCA 
TGTGGTATTC 
TGTCGACAAT 
GTCGTTCAGC 
GAGTTGCCAC 
CATTCGGTAT 
GGATGATCTG 
GGGGTATGAC 
TACGAACACG 
GAGCGACACA 
CAATCCGACC 
CCTTTGGTCT 
AGGCAGCATG 
GGCAGCAAAC 
TTTCTCCGGC 
CAAGGTCGTA 



GCTCCTGTCC 
AAATTCGTGG 
ATGGGAGGAG 
ATGAGTG TGG 
AATAGCAATG 
GATTGGGATC 
AATGAATCCG 
AGTAGAGTTG 
CAAGACTTCT 
TCTTCTCTTA 
AATACGCACA 
AATAAAAACT 
TTGGGTAGCA 
GAAATGAATA 
GATCCCGAAT 
CCCAAAATCC 
AACTTCATGA 
GTATATCCCA 
GTGGAAGCTT 
AAGAACGCCA 
CTGAAATACC 
TTTACGTATA 
AAGGGTCTCG 
GATACGCAGT 
AAGCTCTACC 
TGCAAGGCTG 
AACGAATACA 
TCCGATACGG 



GATTGAGTCT 
TGGCGGCTTT 
AT GAT GT CAA 
CGGACAACGG 
TGAAGATCTT 
CATCGGATGA 
ACATCAAGAT 
CGGTCTTCAG 
CCAATGTGCA 
ACAATGGTGG 
AAATAAGTTT 
TACTCTTCAG 
CCTCTGAATC 
AACAAGGGGG 
TTCAGTGGTC 
AAATGTTGCT 
TTACGTACAG 
AGAAATCGTT 
TCCTTACAGC 
ATCACTACCT 
GCTGGGCCAA 
CATCATCTGC 
TGTGCTGGTC 
GGACCCATGC 
CGAATCCGGC 
TTGTTT ACG A 
GGCTGAACGT 
AGCGTTTCGT 



CAAAAAAAAG 
CGCCGTCGCA 
GGTGGTCCAG 
ATGGATCTAT 
CCGCTCCAAA 
TTATCAGTTT 
TTGGTCGGTA 
TCGCGATGCC 
GTTGTACGAT 
CAACCCTTTT 
TGTGGACTAT 
TCAAGATGGA 
CATGGGTCAC 
AAAAAGCGAT 
AGGCCCTATA 
GGACGAGGAT 
CGATTATGAT 
CAAGTATGAA 
TTCGTACCAG 
GATTACATAT 
TTATGACAAG 
CAATGCTCTC 
ATGGGTGGAA 
CAACGGTGTA 
TCAAGAATAT 
TATGCAGGGC 
GCAGCACTTG 
AGAGAAGCTC 



(2) INFORMATION FOR SEQ ID NO: 114 

(i> SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 2350 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY : circular 

Ui> MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(ivj ANT I -SENSE : NO 

(vi) ORIGINAL SOURCE : - 

(A) ORGAN J SM : PORYPHYROMOUAS GINGIVALIS 
(ix) FEATURE: 

(A) NAME/ KEY : mlac feature 

(BJ LOCATION 1. . ,2358 

(y.i) SEQUENCE DESCRIPTION: SEQ ID NO: 114 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1689 
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ATCGTTTACC 
AACTTCCTCC 
GTCTCTTCGG 
GGTACCACGA 
GTCAGCGGAT 
TCGGCCGACA 
CCCGAACTGA 
TGCTATGGCT 
GACAAATACG 
GCTTTTCCCG 
TCGGGAGCCT 
ACGTTCTGCT 
CGACCGATCT 
GATTCGACTG 
GACATCGGTG 
GCCGTTTCAT 
GTGTTCAAAA 
ATAGAGCGTT 
AAGACGATAT 
GCCAATGGAG 
TACACCGTGT 
TATCCGGAGA 
TCGCTGCCGA 
CTTATCGTCA 
TATGAAGCCA 
AACGACGACT 
TCGCATGCGC 
GGG CAT AAAG 
AATAAGGAGT 
GATCCCAAAC 
AACTATGCCT 
AAGGTTTTTT 
GAGCCGGACA 
CACAAATTGA 
GGTAATGACG 
CCTGACTGTC 
ATTCGGCTCA 
AATCAACCGA 
CATGGAACAA 
CTAAGCAATC 



TTTGTCACTG 
TCTTTTGTTG 
GTAGACGACT 
ATCCGGGAGC 
CGACAGAGAA 
ATAGTTACTG 
GTGGAATACT 
CTTCGCTCAG 
GTATAGATGC 
AAGGAGAAGG 
CTGTTCGATT 
ACAATCCGCA 
CTGCCGGCTA 
CAGGGAAATA 
TCCGATTCTC 
TCGTCAGCGA 
GTGTCGGAGG 
GGGAGGAAGC 
TCTATACCGC 
AATATCCTAT 
TCTCTCTTTG 
AGCAGTTGGA 
GATGGGAGCT 
TCAATGACAC 
TGAAAAAAAA 
ATCTCACCCT 
TGGAATACTA 
AAGACGCAGC 
ATGGTATGCT 
AGGGTGAAAA 
TTTTCGTTCC 
CGGAAAGGTT 
TCGCCTATCC 
CCCGGGAGTT 
ATGCCGGTAC 
CGGGCAGCCC 
ATCCGCAGTA 
CAGATTCCAT 
GGCATATCAG 
GTCCTCGA 



TATGAACCAC 
TCTGTCGCTG 
GACAGAATAT 
AGTATTGCCC 
TCGCTTCCAC 
CATCGGTTTC 
GCTGATGGCC 
TCGAGAATAT 
AGCCGTGACC 
CCATATCCTG 
CTTAAACGAC 
AGCAGTTTTT 
TTGGAAGAAG 
TAAGCAGTAC 
GTTCAACTGC 
AGCCAATGCG 
GAATCCGGCC 
CCTCGGTACG 
ACTCTATCAC 
GATGGGCAGT 
GGACACGTAC 
TATGGTACGG 
GTACGGACAG 
TTGGCAAAGG 
TGCTTCTTCG 
CGGCrTCGTA 
TCTGGCCGAC 
TCTATTCGGA 
GTGTCCATTG 
CTTCGAGCCT 
CCACGATATA 
GCAGAAAGTC 
TTACCTCTTC 
GATAGACAAA 
GATGAGTGCT 
CACCTATACA 
TTATCCTCAG 
TTACATCCAT 
CCATGCCGAT 



AGACGATCAA 
ATAGCGTTTG 
GTCAATCCCT 
AATGGGTTGA 
AAAGATTCGC 
AGCCATGTCA 
ACTTCCGGCA 
GCGCGCCCGG 
GTAACCGAGC 
CTGAACCTGG 
TCCACAGTCG 
CGTCAGTATT 
CAGCCTCCTA 
GACGGCTACA 
GATCAGGGGG 
CTCTATAATC 
AAGGCTTTCT 
GTCCAAGTGG 
CTGCTGATAC 
GGCAAAACGG 
CGCAATGTAC 
ACACTGATCG 
GAGACCCTGA 
GGCCTTCGTG 
GCAGGAGCGA 
CCGCTTCGCG 
TGOAATCTGT 
AAACGCTCGT 
CTGCCGGATG 
AATCCCGGTT 
CAAGGGCTTG 
TTCGATGAAG 
TCCTATTTCC 
CATTTTTGCA 
TGGCTTGTCT 
CTGACCTCGC 
GGGGAGTTGA 
ACGGTTTCTC 
TTGGTGCGCT 



AAACCATGCT 
CTGCCGATGC 
TTATCGGAAC 
TGAGCGTTAC 
GTTGGTGGAG 
ATCTGAGTGG 
CATTCGATCC 
GAGAATACAA 
GGACTGCTTT 
GACAGGCCCT 
TCGGCAGCAG 
TCGTACTTCA 
TGACAGTGGA 
AGCGTGAGAT 
AAAAGATCTA 
TGGAAGCGGA 
CCGCTATACG 
AAGGAGGCAC 
ATCCGAATfcT 
GTAATACGGC 
ACCCGCTGCT 
ACATGTACCG 
CGATGGAGGG 
CTTTCGATAC 
CCCATCCGAT 
AACAGTACGA 
CCCGGTTTGC 
TGGGCTACAG 
GATCATTCCT 
TCCACGAGGG 
CCCGGCTGAT 
GATATTATGA 
CCAAGGAAGC 
ATGCTCCTAA 
ATTCCATGCT 
CGGTATTCCC 
TCATTACGAC 
TTGGCAATAA 
GCGGTCACCT 



GACGAT CCGA 
ACAAAGCTCT 
CGCCAATTAC 
CCCTTTCAAT 
TGCGCCTTAT 
AGTAGGCTGT 
TGATTACTGC 
GGCTGTATTG 
GACCGAATTT 
AAGCAATGAA 
GCTGATGGGG 
GGTGAGTCGG 
AGCCCAATGG 
GAGCGGTGAT 
TGTACGATCG 
GCAAGAAGAG 
CTCTCGCGCT 
ACCGGATGAA 
CCTACAAGAT 
TCACGACCGC 
CTGCCTCCTC 
AGAGAGCGGG 
CGACCCCTCG 
GGCAACGGCC 
CCGTCCTGAC 
CAATTCCGTA 
CCACGCACTT 
ACACTATTAT 
CACTCCTTTC 
CAGTGCTTAT 
GGGAGGAGCA 
TCCGACCAAC 
ATGGCGAACG 
CGGCTTGCCC 
GGGATTCTAC 
CCGAGTTAGG 
CAATACAGAG 
AACACTTCCG 
CCGTTACGAA 



60 
120 
1B0 
2*10 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 

ieeo 

1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2358 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 115 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2442 base pairs 
(Bl TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY: circular 

<ii) MOLECULE TYPE: DNA [genomic) 
(Hi) HYPOTHETICAL: NO 
liv) ANTI-SENSE: NO 



60 



(Vl> ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHY ROMONAS GIHGIVALTS 

( ix } FEATURE ; 

(A) NAME/ KEY : mi £ cofeature 

(B) LOCATION 1...244Z 



65 



70 



75 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 



CCCCGTTTTC 
AAATCAAATC 
AAAATAAGAT 
CTCCAAGCTC 
GCCGTAGAAA 
AAAGAAGCTC 
GTAAGCAGCA 
AATATCGACA 
ATCGATCTGA 
GCAGGAGGAT 



ATCCCGGAGG 
AGTCATCGTG 
TCAGCCTCTT 
AAGAGGAAGG 
CGCCCAAGAA 
CCCATGAGGC 
TAGCCTACTC 
TCTTGGACGA 
TAGACAAAAC 
TCGGCCTCTC 



GACACACGCT 
CTTTGGCGGT 
GCAGGCTCTT 
TATTTGGAAT 
AGTCTTT GCC 
AAAGATCTTC 
CGAGCAACTA 
AGCAGGCCGT 
GCTCAATCGC 
CGTTCTGGAT 



CAGCTTTGCA 
GGCACAGCTC 
GTCGTCT6CT 
ACCCTCTTGG 
GTAGCCAACG 
G ACCGTAT C A 
AAATCCTTGG 
GTGACCAACG 
CTTTTGATCG 
GTCCCCGAAG 



GAAATGGGAG 
CGCGATTTTT 
TATTGTTCAC 
CTATCCACAA 
GAGTACTTTA 
GCGGACTCAG 
TCATATACTA 
TACCTGCATT 
TAGGCAACAG 
CTCGCATACC 



AAGAAACCAA 
TATTATGTGC 
CTCTTTTTCT 
GACGGAAAAA 
CTCGGTGGGC 
CGATACATCG 
TGCATCAGGC 
GAAAGACAAT 
GGCTTATTTG 
GGCTACCTAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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GCCAACGGAA CTAAGGTGAC CGATGTGGCT AAGTTGGACA ATGATCGCT7 GCTGATGCTC 660 

AAAGAAGGGC AGCTCTTCAT CGGAAAAGAG ACCGATAACC TGCAAGATCC GGCCGCATGG 7 70 

ACAGCCTTGT CTTTGAATTT GCCGATGGGC TCGGTCACCG GTCTGGGCAT TGTCGGGGAA 780 

GACATCTGTT TCCTGCTCGC CGATGGCCGT GTATATGTCG CTGCAAACCA ATCGTTTGAG 84 0 

5 CCGGAGCTAT TGCTCTCTTC CTCCGCCGAT TCACGACTGT ATGTGACGGA TCGTGGTCTG 900 

TTCATCTGTG CCGAGAATCG AATTTATTTC ATAGAAAAAG GTCGCAAAAC GACACAATTT 960 

CCTATAGCCG ACGTCCTTGG TGTCGGTGCC ATGAACGAAA GCAATACGGC ATACATAGCA 1020 

TTGGGAGAAG AAGGTTTGGC TTCACTTCTT CTCGCAGAGG GAAGTACGCC CGAAGCCATG 1080 

CCTGTAGCAT TCGACGGACC GGGGGACAAT GATTTCTACG AGATGCGGTT TAGTCACGGA 114 0 

10 CGTCTGTATC CAGCCAGCGG ACTCT GGGGA ACAAACCTGA TGGGACATGC CGGTATGGTG 1200 

AAGCTATACG ACGGCAACCG ATGGACTAAC TTCGACAAGA AGACCGTACA GGAACAGTTG 1260 

GGCGGCGGAT TCAGTTTCAA TGATGCTATC GATATAGCTG TTTCCAACGG AGACCCCGAT 1320 

CACTTTTTTG TCGGTACATG GGGAAACGGT CTGTTCGAAT TCAAGGATGG CAAAGCGATA 1300 

GCTCGCTATT CGGGAAACGA AACT GCTATC GCAGAATGTA ATCCCGGAGA TGCCCGTGTG 1 4 «1 0 

15 AAAGCGATTG CCTTTGACAA TAAGGGCAAC CTCTGGGGGA CGCTCGGTGC CGTAGGCAAG 15 OQ 

AACATCTTCA TGTACGATCC GCAGAGT AG C ACATGGCATT CTTTCAGCTA TCCGGATGTA 1560 

GCCAATCTGG CCTCCTTCGG CAATATGATT ATCCTACCCA ACGGAGACAA ATGGGTAAAT 1620 

ATCCTTCACC GTAGTGGCGG ATCCACGCGC AAAGGTGTCT TGATCTTCAA CGATCGGGGT 1680 

ACACCGGAAA CGACTTCGGA CGACAGCCAT CTTTACGTCG AGCAGTTTGT CAATCGCCTC 17 4 0 

20 GGGGCAGCCA TAGGACATAA GACTATCTAT GCAATGGCCG TCGATCATAA CGGCTCTGTC 1800 

TGGATGGGAT CGGATATAGG CATTTTCGGC GTCTACAATG CAGCCGGAGT ATTCTCCTCG IB GO 

ACTTCTACCC CTATCGCTGT TCGGCCGGTC GGAGGAGAAG AACCCAATTT GTACTATGTG 1920 

CTGGACAAGG TGACGGTGAC AGACATCGTC GTGGACAAAC TCAAT CACAA ATGGGTTGCC 1980 

ACCCAAGGGA CAGGACTCTA TCTCCTTTCG GAAGATTGCA GTAAGATCCT CGCGCAATTT 204 0 

25 ACCGTAGAAA ACAGCCCTTT GCTTTCTAAC AACATACTAT CCCTGGCCTT AAATGACGAT 2100 

AACGGACTGC TGTACATCGG TACGGCGGAC GGACfGATGA CGTTCCAAAC GGGTACGGGG 2160 

AGTGGATCAG CTTCCGAACT GGACGGCGTC TATGTATACC CCAATCCGCT AAGGCCGGAA 2220 

TATCCCGATG GCGTCACCAT TGCCGGACTG CAAGCCGGCT GTAGTGTCAA AATCACCGAT 2280 

ACCACCGGCA GACTGCTATA CCACACTGAG AGCGTAACCA CCGAAGTCAA ATGGAATGCT 234 0 

30 CGAGGTGCCG ATGGCAATAG GGTAGGTTCG CGCGTATATG CCGTTGCAGT GTACGATCCO 24 OO 

GTATCGAAAA AGT CCA AACT AATTCGCTTC GCAGTGATTC CC 24 4 2 



35 



45 



50 



<2) INFORMATION FOR SEQ ID NO: 116 



(11 SEQUENCE CHARACTERISTICS: 

iA) LENGTH: 3486 base pairs 
IB) TYPE: nucleic acid 
(C) STRANDEDHESS: double 
40 tD) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

<iii> HYPOTHETICAL: NO 

(iv) ANTI-SENSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHY R0M0N AS GIKGIVALIS 



tlx) FEATURE: 

(A) NAME/ KEY : misc_f eature 
( B > LOCATIOM 1...3486 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 6 

GCTATTTCTC AGATGAAACG AATACTTCCA ATAGTCGCAT TCCTTTCTCT CTTCCTTGCC 60 

CTTGCTTTGC CTGCGAAAGC GCAACGAGCT ATGGGGAAGA CGGCCGACCG TTCGCTAATG 120 

GCTTCGGGAC ATTGGGTCAA GATACGTGTC GATGCAAGTG GAGTGTATCG CCTTACGGAC 180 

60 GAACAGCTCC GTGCCAATGG CTTCTCCGAT CCGTCCAAGG TAGGTGTGTT CGGTTATGGT 240 

GGAGGGGTGC TTCCCGAAGA T CTGAGC CGG ATCACGACAG ACGATTT'GCC TCCGGTACCG 300 

GTACTCCGTC AGGGCAATGC GCTGTATTTC TATGCCGTGG GCCCGGTGAC ATGGTTCTAC 360 

AATCCGGCCA AAACCACCAT GGAGCATACG GTGAATACAT ACAGTACGCA TGGCTACTAC 4 20 

TTCCTGTCGG ATGCTGCCGG AGCACCTTTG CACATGTCCC AATATACGGG TGGAGGTGCG 4 80 

65 TCGGCCGAGG CTT TGATCGA CTACTACGAT GAGCTGATGC TCCATGAACA GGAATTGTAT 54 0 

TCGCCCAAAG AATCGGGACG AGATCTGTAT GGCGAGTCTT TCAGTGCAGT CAATACGCGT 600 

ACGGTCAAGT TCCCTTTGAG GGG CAACACC CGCTCGTCTG GCG AACT CGG TACCGTATTC 660 

TCATACATAG CCAAGGCCAG ATCGGCCGGT GGCGGCCGTG AGATGTCGCT CTCGGCGAAT 720 

GGCATTCTGA TCTTCAGCGA TCCTTTTTCC ATGACATCGA ATGAAGTCTC CAATTCCTAT 7 80 

70 TTGGCCGGCA AGAAGCGTCG TCTCTATCAC AGTACGCCGA TGAACAGCTT GGTCAATGAG 840 

TTGCGCTi'GG ACGCGAACTA TAGCATGACA GGAGATGCGG TCAATCTGGA TTTCATAGAG 900 

GTGGCTACAC AGAACGACCT CCGGTACGAT GGCGCACCCA TGCATATCAG GCGGTTTTCC 960 

AATTTGCCCG TTTTGGGGGG CGAGTCCTGC CGGTTCGTTA TCAGTGAGGT GCCGGAGTCT 1020 

CTGGTGGTTT TGCAGGCCAA TTCTTCCCTG ACAGCATCGC TTGTTCCCGT TAAGACTGTC 1080 

75 GGGGATAAGA CCATTGAGTT CGTGGCTCCG CCGAAGGGTC AGGATCGTAG GACTATCAAT 114 0 
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10 



15 



20 



25 



30 



35 



40 



ACGTTTTATG 
AATCAAAACC 
CTCCTTGAGG 
GTCGTGTTGC 
TACCGCCTCT 
ACCTTCCCGA 
GTAGCTTGGC 
AATTCGACGA 
CCGGCerCGG 
CGTACTCCGG 
GAGAGTCGTG 
ACCGAGACTT 
CGCGCCTTTC 
GCAAAGAAAA 
CATGGCGGTC 
TTCAATTATA 
AGTCAGACGA 
ATGTTCTCGA 
CTTCGGCGTA 
CGATCGGCCA 
TTCTTTCTGA 
ACCGCAATCA 
GAACGGGTAG 
AGTGGCAAGG 
GAGGGAAACG 
GCCGAGGTGA 
TCCGAGCACG 
ATGGGGGTAG 
AATACACCGC 
GAGGTTAATC 
ACGGGTAGCG 
TACAACCTCA 
TTCATGATAC 
TTCAATAATG 
GTGGCTGATG 
CACAATCGCC 
CTTGTGAACA 
AAGTGGGATC 
TGTGTGGTGA 
GGACAA 



CCGTGGACTT 
TGCATGGAGA 
CTGATCGACT 
AGGAACAGGT 
TTGCCAAAAT 
TGCAAATGCT 
AGAAACCGTA 
ACGTAAACAG 
TCAATATCGG 
CCGAAGCTCG 
CCTGGCGTAT 
CCCGTTTGAT 
AGGACGTATA 
AGATGCTGGA 
CTGCCGGATG 
AGCATATGCC 
CCTCGGCAGG 
CTACGCGTGT 
TGTTCGAGAA 
AACAGGGGAT 
TGGGTGATCC 
ACGGGCAGGA 
CTCTGAAGGG 
TTTTCCTGAC 
ATCTCTCTCT 
AAGACGGACT 
AAGGCCGGAT 
ACTTCTCCAT 
CTGAAAT CAT 
CTACTCCTCT 
GAGTAGGGCA 
ATGCATATTT 
CGOCTTTGG C 
CCGTCCATCA 
TGATTCTATT 
CCGGAAGCGA 
GTTTGCCAGT 
TGACCTCCAA 
ACICTCCCGG 



GTCACAGGCT 
GGAAATCCCT 
GGCCACCTAT 
GTTCAACGAG 
GTTCTACGAC 
TCTCTTCGGT 
TCTCCAACAA 
TTATGTGACG 
TTCGCGCAAT 
CATCGCAGTG 
TCCTGCCTGT 
CGATACCGTC 
TCCGCATGTC 
AACCCTTCAG 
GT CGGACGAG 
CATTTGGATT 
GGAGGAGGTT 
CGTTTACAAT 
AGCTAAGGAT 
GCTCAGTACT 
GTCCGTGCGT 
TCCCGAAGGG 
TAAGGTAAGC 
CGTCTTCGAT 
TGTATATTAT 
CTTCGAAACT 
CAATCTTTAT 
CAGAGTCCAA 
AAGCTGCTTC 
GTTTATGGCC 
TGATATTACG 
CACAAGTTCG 
CCAAGGAGAT 
TGACTTTTCA 
CCCGAATCCG 
TTTGAACGTG 
CAAGACCTAT 
ATACGGAGTG 
AGGACAGACG 



TCTGCTCCGG 
GATCTGATCA 
CGTAGAGAGA 
TTTTCGGGTG 
AGATGGAAGG 
GATGGGGCTC 
ACGGAGTTCT 
GATGATTACT 
TATAATATGG 
GACAAGACCA 
TTTGCGGCAG 
AAGCGTTATG 
ATCGAGAACG 
TCGGGTATTA 
CATTTGCTGA 
ACTGCCACGT 
TTCCTCCATG 
ACGCAGAATG 
GGGCGTTATC 
GTTTTCCCCG 
ATGAATCTTC 
CAGTATGGAA 
GATGAAAAGG 
GGCAGAAAGA 
GACTATCCTA 
TCGTTTATCG 
GCTTATAACG 
CCGGGTATTC 
CTCAATGACA 
GAAGTATTCG 
CTTTGTATCG 
GCTACGGATG 
CATACTGCCC 
TTCAGAGTGG 
GTACGCGAGA 
GCCGTGGAGA 
TCGTCTTCCT 
AAGATCGGAA 
GCCTCCATGG 



AGATCCTCGG 
TTGTCTCTAC 
AAAACGGGCT 
GAACTCCCGA 
CAAATGCACC 
ATGACAACAG 
TGCTGACATT 
TCGGCTTGCT 
CTGTAGGGCG 
TCCGATATGA 
ACAACGGGGA 
CTCCTGCCAT 
GGTTGCACAG 
TCCTGCTTAA 
CGCTCAACGA 
GCGACTTTGC 
AGAAGAGTGG 
AGAAGATCAA 
GTACGATGGG 
ATTCGATCAA 
CTACCCACAA 
CTATTAT GCT 
GGACATTCGA 
AAATGACAGC 
ACGTGATGTA 
TACCCAAGGA 
AGAGCACAAA 
CTGATGAGGT 
GTACATTCCG 
ACTTGAATGG 
ATGGCCGTGC 
CACGTGTGGG 
GACTGACGGT 
TAGATGGCAT 
GTGCTACGTT 
TCTATGACTT 
ACGGAGAACC 
ACGGATTCTA 
CCAAGAAAAT 



AGCGG TACCC 
TCAGGCGCTC 
GAAGGTTTTG 
TGCTACAGCA 
TGTGGGAGAC 
GAAGGTCTCC 
CCAAGCCGTC 
GGATGATCAG 
ATTCCCCGTA 
GGAGGATCGA 
CAAGCACGCA 
CATGCCGGTA 
CATTCCGGGT 
TTATGCTGGT 
TATACACAAA 
CAACTATGAC 
CACTCCGATC 
TGGTTTTATG 
CGAGATTATC 
CCAGTTGAGT 
AGTGCAATTG 
CAAGTCTTTG 
CGAGACATTC 
TTTGGAAGAG 
TGCCGGTATT 
TGTGAACTAT 
GGCGGAAGCC 
AACGGAAGAT 
AT CGGGAG AT 
AATCAATATC 
CGACCTGACC 
CACTATTCTC 
TTGGGACATT 
TCCTCCGGAT 
CCGAATCTTC 
CACCGGTCGT 
TATAGAGATC 
CCTCTATCGT 
GATCGTGGTA 



1200 
1260 
1320 
1380 
14 4 0 
1500 
1560 
1620 
1680 
1740 
1800 

leso 

1920 
1900 
2040 
2100 
2160 
2220 
2280 
234G 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
31B0 
324 0 
3300 
3360 
3420 
3480 
34B6 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 117 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2919 base pairs 

(B) TYPE: nuclsic acid 
(CI STRANDEDWESS ; double 
( D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genonuc) 

(ill) HYPOTHETICAL; NO 

(iv> ANTI— SENSE : MO 



60 



(vi| ORIGINAL SOURCE: 

IA> ORGAHISK: PORY PHYROMON AS GINGIVALIS 

(lx) FEATURE: 

(A) NAME/ KEY : misc^Ieaturc 
<B> LOCATIOtl I... 2919 



65 



70 



75 



(xlj SEQUENCE DESCRIPTION: SEQ ID HQ: 117 



TTTTACACCG 
T TACT CAT AT 
ATCGCCCGAG 
TCTGTTAAAC 
TTCGCGATCG 
TATCACCAAA 
GCATGGAATC 
TAT CAT TAG A 
ATCAAAGCCA 
CATGATGAGG 



GAATTGACTT 
TGTCGATTTT 
AGAAAAAGCG 
AGCTTTACGA 
AGGTCGGCAA 
AAGGGTTGAA 
ATCTCGGAAC 
AGGCTCTTTC 
GATCGGCGGC 
CCGAAAAGAA 



GCATATAGAA 
GGTCGGATGT 
TATTACTGCA 
GAACAGCGTA 
ACGGATGCGT 
CGCTGCATTG 
GGATAGCCGT 
CTTGATAGAA 
CCTGAACGGC 
TTTCCTGAAA 



AGCCAAATGA 
GGAAAAAAAG 
TTGCTGTACG 
CAGAACAAGA 
AATATGTCCC 
AATCTAAGGG 
CGTATCGGTG 
TCTTTTAGCG 
ATCGGCAACA 
GCACTGCAAG 



AG AAACTT TT 
AAAAACACTC 
AAAAAGAACT 
ACCTTGTGGG 
AATACACAGA 
ACACCATCGT 
CTTTGGCAGA 
GAAACCAGAA 
TCAATCTTGA 
GTGAGAAAGA 



CCCATTATTA 
TGTAACTGAA 
CCCCACTGAT 
ACAAATGTTA 
TGCGATGCTA 
AGCCGCACAA 
AGCTTCGGAT 
TAGGCCTGCG 
GTTAGGATAC 
ACT CGACAGT 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/OI023 



81 / 490 



CCTCTCGGGC AAGCTATCAA CT AT GCG AAC CTCGGACGTA TCTATCGACA ACGCAAAGAA 660 

TACGACAAGG CTCGTACCTA CTTCCTCTTG TCTCTGGAGC AGAACAATAT GGCAGAGAAT 720 

CTGATGGGTA TCGGACTCTG TAGCATCAAT CTCGGAGAAG TAGACGAAGA AAAAGGGGAT 780 

TAT CAAAAGG CTTTGCAAGA GTATGCCACG GCATACAAAC TGATGGAACA GTTGT CCGAT 84 0 

5 CGATGGCACT GGCTGAATTC CTGTATCCCG ATGGCACGTA TCAATCTCAA ACAAGGTAAC 900 

GAAAGGCTCT ACCAGCATTT CATTTCTTTG GCCGAAGGGA CTGCGAAAGA AATTAATTCG 960 

ACTTCACATC TGATAGAAAT ATACAATCTT CAATACGAGA ATCTCGAGCG TAAAAAAGAA 1020 

TACAAACAAG CCCTCGAAGC ATTCTGTCTG AGCAAGACGT TGAGCGACAG CATGTCCATT 1080 

GCGCACAAGG TCAGCAGCAT ACAAGAAACG CGATTCAACT ACGAACGAAA CAACTCCCAA 114 0 

10 AAAGAGCTTG AAGAAATACA GCAAGTAAGC AAGGCAAAAC AAGAGAAATC GAAGTTTATC 1200 

CTCTTGAGCA CTCTTTTTGC CCTTTTCATC TCGATTCTTT TGATTTCTGT TCTGACATAT 1260 

GCATACCGTC AGGGCAAGAA GCATAACAAG CTGATCAAAG AGACGGATAA ACTTCGCTCC 1320 

GGCTTTTTCA CCGGTATTAC ACACGAATTT CGTACGCCTA TCACCGTCAT ACAAGGTTTG 1380 

AATGAGAAAA TGAGTTCAAG TCCTGATCTC CAAGCATCGG ACAGAACCGA GCTGCACAAG 14 4 O 

15 ATAATAGACA GACAGAGTAG CCATATGCTG AATTTGGTCiA ACCAGCTGTT GGATATTTGC 1500 

AAGATCAGAA GCGGAGTATC CACGCCCGAA TGGCGCAATG GCGACATCGT CTCCTTCGTA 1560 

CAGATTCTCA TCGATTCGTT TGCACCATAC GCACAGGCTC AAGACATAAC CTTGGAGCTA 1620 

CAACCCGAGA GCAAACCTAT TGTCGTGGAC TTCGTCCCCT CCTACTTGCA AAAAATCATA 1680 

TCCAATCTTT TGTCCAATGC CATCAAGTAT TCTTTAGCCG GAGGGAGAGT GGTCATATCT 174 0 

20 CTGGCAAAAA CCAAGAATGA AAAAAATCTG ATCATACGCG TTGCAGACAA TGGCATAGGA 1800 

ATAGATAAAA CTGATCAGCC TCATATCTTC GACATCTTCT ATCGAGGACA CTCCGCTACC 18 60 

GAAAAGCATG GATCAGGCGT CGGACTCTCG TTTACCAATA TACTGGTCGA AAACCTTCGA 1920 

GGTACGATCA AAGTGGAAAG CCAGCCGGGG AAAGGAAGTG CCTTCACCAT CAGTATTCCT 1980 

ACACAAAACC AGTCCTCTTC GGCAGAGATT CTTCCTTGGC TACCCTCCTC CGATGACATT 204 0 

25 GTCATGCCTG TCCACATCGC GCCCGATGAC TCACCGACAT CTCCGATGGT AGCAGOrCTG 2100 

AATCATCGCT TCGAGGACGA ACGTCCGACC ATACTGCTCG TCGAGGACAA TAAGGATATC 2160 

AACCTGCTCG TCAAACTACT CCTTTGCGAT CGCTACAATG TGCTATCCGC CGCAAACGGA 2220 

AAAGAGGGTA TAGCCCTCGC TACCGAGCAT ATTCCCGACA TTATCATTAC GGATATTATG 2280 

ATGCCGATAA TGGATGGGAT AGAAATGACA ATCCGGATGA AGCAATCGCC TCTGCTCTGT 23 4 0 

30 CACATTCCCA TTGTCGCTTT GACGGCCAAG AGTACCGAAC AGGACAGATT GGAAGGAATC 24 00 

AAAAGCGGTG TAGTCTCTTA TCTATGCAAG CCATTCTCTC CGGAGGAGCT TTTGATGCGG 24 60 

ATCGAGCAGC TTCTGAAAGA CCGTGAGTTG CTCAAGAAGT TCTATATGCA AAAACTCATG 2520 

CTGGATCGGA AGCCGGAGGA GGAGCCTCAA CCGAT AGATG ACAGCAGTAT GCAGTTTCTC 258 O 

CTTGCTGCCA AAGATGCAGT GTCCGGTGGA ATCAAACAAA ATCCGGATTT TTCCGCTCAA 264 0 

35 GACTTGGCCG AAAAAATGTG CATGAGTCCA TCCCAACTCA ACAGAAAGCT CACGAGTGTC 27O0 

GTAGGTTGCT CCACCATCGG CTACATACAG CAGATCAAGA TAAAATTGGC CTGCAAGCTC 27 60 

CTTGCCGATG AGAGCAAAAA CATCTCCGAC ATTAGCATTG AGGCAGGCTT TTCGGATCCG 2820 

GCTTACTTCT CTCGCACCTT CAAACGCTAC ATGAACTGCT CTCCCTCCCA ATATCGGCAA 2880 

AAACTCCTTG CCATGCCGGG GAGCGACAAG GAGACAGTT 2 919 



40 



55 



12) INFORMATION FOR SEQ ID NO; 118 



(i> SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1689 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDF.DNESS: double 

(D) TOPOLOGY; circular 

50 (ii> MOLECULC TYPE: DNA (genomic) 

(ill) HYPOTHETICAL; NO 
tivj AHTI -SENSE: NO 



Ivl) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRCMONAS GINGIVAL! S 



(ixl FEATURE: 
60 (AJ NAME/ KEY: misc_feature 

(B) LOCATION 1...1689 

(xi> SEQUENCE DESCRIPTION: SEO ID NO: 118 

65 CATCATAAAA CATATCAAAC AATGAAAAAG CTTTTACAGG CTAAAGCCTT GATTCTGGCA 60 

TTGGGACTCT TCCAACTGCC CGCAATCGCC CAAACGCAAA TGCAAGCAGA CCGAACAAAC 120 

GGTCAATTTG CAACAGAAGA GAT GCAACG A GCATTCCAGG AAACGAATCC CCCTGCAGGT 180 

CCTGTGCGTG CTATCGCTGA GTACGAACGC TCTGCAGCCG TTTTGGTACG CTACCCGTTC 24 0 

GGTATCCCGA TGGAATTGAT CAAAGAGCTG GCCAAGAACG ACAAGGTGAT TACCATTGTG 30O 

70 GCGAGTGAAA GCCAAAAAAA CACCGTTATA ACCCAGTACA CCCAAAGCGG TGTGAATCTC 3 60 

TCTAATTGCG ATTTCATCAT TGCGAAAACT GACTCTTACT GGACACGCGA CTATACCG3T 420 

TGGTTCGCAA TGTACGATAC GAACAAAGTA GGTCTCGTGG ACTTTATTTA TAACCGCCCT 480 

CGTCCTAACG ATGATGAATT CCCCAAATAC GAAGCACAAT ATCTGGGCAT CGAGATGTTC 54 0 

GGGATGAAGC TCAAGCAGAC CGGTGGCAAC TACATGACGG ACGGATATGG ATCCGCTGTG 600 

75 CAGTCACATA TCGCATATAC GGAGAACTCC TCTCTGTCTC AAGCTCAAGT AAATCAAAAG 660 
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10 



15 



20 



ATGAAAGACT 
ATCAACCAT G 
GTGCCTGACA 
CAGACCTGCG 
CCGTACACGA 
TCCGTGGACA 
GGTGTCAAAG 
CACGAGGTAG 
GCAGGCCCTG 
CCGGTACAAT 
ATGGAATCAA 
GAATACTATA 
GAACCTGATC 
GCTGCCAAAG 
AGTTTGAATA 
GCTGCAATGA 
CAGGCTCCGG 
ATTCTCAAA 



ATCTCGGCAT 
TGGACTGTTG 
ATCACCCTCA 
CATGGGGAAC 
ACTCTCTGAT 
ACGATGCTCT 
GGGCTTCAGG 
CGGATAAGGG 
ATTATAAGAT 
GTTACTATCG 
CAGGTCACTA 
TCTCTGCCGC 
CTTTCAAGTT 
CTCTTCGTGC 
TTGCCGGCAC 
CCAAGGAATT 
GCACATATGT 



CACACATCAT 
GGGCAAGTAT 
GCACCAAGCC 
GAAGTACGAG 
TCTGAACAAC 
GAACGTCTAT 
AACACCTTGG 
CTATCTCTAT 
CGAAGCAGAT 
TATCAATGGT 
TACTTATAGC 
TGACAATAGT 
TACGTGTATG 
ATGGTTCAAC 
ATATCGGATA 
AGTAGCAGGG 
TCTGGTTGTT 



GATGTGGTAC 
TTGGCACCGA 
CTGGAAGATA 
GTATATCGCG 
AGGGTATTTG 
AAGACGGCAA 
TTAGGAACAG 
ATCAAGCACT 
GTCGTCTCAT 
TCCGGTAGCT 
TTTACAGGTC 
GGTCGCAAAG 
AACGAAACCA 
GCCGGTCGTT 
AAGCTTTATA 
ACGAGTGTCT 
GAAGGAAATG 



AAGATCCGAA 
ACAAAATCCT 
TGGCAGCCTA 
CTTTGGCCAC 
TTCCTGTCAA 
TGCCCGGTTA 
ATGCCCTGCA 
ACCCGATACT 
GCGCCAATGC 
TTAAGGCTGC 
TTAACAAGAA 
AGACTTATCC 
ATACATGTAC 
CAGAACTGGC 
ACACCGCAGG 
TCAGTATGGA 
GAATCCGTGA 



CGGCGAATAT 
CATCAGGAAA 
CTTCGCAGCA 
CAATGAACAA 
TGGCCCCGCC 
CGAAATTATA 
TTGTCGTACT 
GGGCGAACAG 
TACTATCTCG 
TGATATGACti 
TGATAAGGTA 
CTTTATCGGC 
TGTGACCGGA 
TGTTTCGGTA 
AGAAGAAGTC 
TGTGTATTCT 
GACAATGAAA 



720 
780 
840 
900 
960 
1020 
1080 

mo 

1200 
1260 
1320 
1380 
14 4 0 
1500 
1560 
1620 
1680 
16G9 



(2) INFORMATION FOR- SEQ ID NO:119 



25 



<i> SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 1311 base pairs 
tB) TYPE: nucleic acid 

(C) STRANDEDNF.SS: double 

(D) TOPOLOGY: circular 



30 



(ii> MOLECULE TYPE: DNA (genomic! 
(Ill) HYPOTHETICAL: MO 



35 



l xv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS GINGIVALIS 



40 



(ix) FEATURE; 

(A) NAME/KEY: mlscfeature 

(B) LOCATION 1. . .1311 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 19 



45 



50 



55 



60 



65 



ACCACAAATA 
ATTGCATTGC 
GGGATTCGCA 
CATTCCATAG 
GGAGAGGAAA 
CTCACTATCT 
GATGTAACGA 
TTGGACTTGA 
AGTTTGGACC 
ACTGAGCTGA 
CTGACGGAGT 
CGGATCACGA 
AATGAGCTTT 
TCGAACAAAC 
CATTCCAATC 
TCTCCCGGC5 
AAGAATATCT 
GACTTCAACG 
GCAGTAGATG 
GAGATCCCCG 
GTCTATAGTT 
GGCACTTATT 



GAAAACCAAA 
TGACGATGGG 
TGTCTGTCAC 
AGAAGAAAGG 
TAACCGTATT 
ATGGTAATAC 
AAAACCCTAA 
CGCAAAACCC 
TGAGTGGCAA 
AGACCGATAA 
TGGAACTCAG 
AACTCGATCT 
CGACCTTGGA 
TTACATCCTT 
AGATCGCAGG 
CAGGCGCTCA 
GTACCGTAAA 
GTGATTCTGA 
CTCCCACTGT 
AGTCTCTTTT 
TCGCGGTAGA 
TGTTCCGTCT 



TACTAATATG 
ACATGCTGTG 
TACAACCAAG 
CATCTGGATC 
CGATGAGGCA 
GACCCGATTG 
TCTGACCTAT 
AAAGCTGCTG 
TCCGGCTTTG 
CAACCCCAAG 
TGCCAATCCT 
GAGTGCCAAT 
TCTTTCCAAG 
GAATCTGTCG 
TGAAGAAATG 
GAGCAAGTTC 
GGATGTGGAA 
CAATATGCTT 
CAGGATATAT 
AGGGCAGGAA 
GTCTCT'l'CGT 
CGATAACTAT 



AAACTTT CAT 
CAGGCACAGT 
GCCGTAGGCG 
GATCTCAATG 
TACCACGAAT 
GGCTGTCGAT 
CTCGCATGCC 
CGAGTTTGGT 
ATCATCCTCG 
TTGGCCTCTC 
CGTCTCAATG 
CCTCTATTGG 
AATTCGGACG 
GGGGTGAAGG 
ACGAAAGTGG 
GTCGTTGTAG 
AAAGd'AAAA 
CCATACGAAG 
CCCAATCCGG 
GCTGCTTTAT 
CAGAACATTG 
ACCACTAAGC 



CTAAGAAAAT 
TTGTTCCGGC 
AAAAAATCGA 
GGGATGCCAC 
ACACGATCGG 
CTACCGGTGC 
CGAAAAATAA 
GCGACTCTAA 
GCTGTGACAG 
TTTGGTGTTC 
ATCTTTGGTG 
T AACACTTTG 
TTGCTTACCT 
GACTGAGTGT 
TGAATGCTTT 
ACCTCAAGGA 
GTAAGAACTG 
GAAGTCCGAC 
TAGGAAGATA 
ACGATATGAA 
ACCTGACACA 
TCATCAAACA 



CTTAGCAATC 
TCCCACCACA 
ATTGTTGGTT 
TTACCAACAA 
GACGCAAACC 
AACGGCTGTC 
TCTGAAATCA 
CGAAATAGAA 
GAATAAGCTG 
TGATAATAAC 
CTTCGGTAAT 
GTGCAGTGAC 
TTGGTGTTCA 
TTTGGTTTGT 
GCCCACACTA 
CACTGATGAG 
GCGAGTATTT 
ATCGAACTTG 
TGCGCTCGTC 
TGGGGTAAAA 
TCTTCCCGAC 
G 



€0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1311 



70 



75 



U) INt'ORJIATION FOR SEQ ID NO: 120 

11) SEQUENCE CHARACTERISTICS: 

I A) LENGTH: 954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS : double 
(Dl TOPOLOGY: circular 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



83 / 490 



(ii) MOLECULE TYPE; DNA (genomic) 

(ill) HYPOTHETICAL: NO 

5 tlV) ANTI-SENSE: NO 

fvi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

10 fix) FEATURE : 

(A) NAME/KEY: mi;rc_f ea ture 

(B) LOCATION 1. - . 954 



15 



35 



45 



50 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120 



AATCATTTGA AGACAAATAT CAAGATGAGA AAAACAATAA TTTTCTGCTT GTTGCTCGCC GO 

CTATTTGGCT GTTCTrGGGC ACAAGAAAGA GTCGATGAAA AAGTATTCTC CGCAGGAACA 120 

AGTATTTTTA GGGGCATCCT TGAAAAGGTG AAAGCACCGC TTATGTATGG AGATCGTGAG 180 

GTATGGGGTA TGGCTCGTGC GAGCGAGGAT TTCTTTTTTA TACTTCCCGT TACGGATGAC 240 

20 CTCACTCCCG TGCTTTTCTA TAACCGTCTT ACAAACGAAC CCTGCTTTGT GTCAGACCAA 300 

GGAATAACTG AGTATTTCAA ATTCGCTCAA G AAGGTG AT T ACATTGAAGT CGAAGGAAGC 360 

TCTGTATTCA TGCCGAATCT TTTGTACTAT CGTTTTTTCC CGACAAGAAT TACCTCCTAT 4 20 

AATGCTCCCA TTGAAGGTGT TGTGAGCAAG ACGGGAAAT C CTGCTTTTAC AATCCCGATG 480 

CTCCCGGGGG TTTCTGATTG CATAGAAATC TCAAACAACC GCAAAGTCTT TCTCACCAAT S40 

25 CAATTAGGGG TTGTAAACAT CACTGACGGG ATGGAACCTC CGATTATTGC CGGAGTCTCT 60O 

GCTTCCTATG GATCTTCCGT CCGGGTGTAT GGTCATGTCT CACAGCGGTG GGACATCATA 660 

GGCCATTGCT ATTTGGATAT CTACCCAACC AATTGCTATC CGCTCAGCAC GAAACCCGTT 720 

GCAGGAGACG ATGAGGTTTT TGTCAAACAA CAAGGCAGGC AAATAGAGAT CG AT AG C AAC 7 80 

AGCCCCATAG TCCAAGTGGT CGTATACGAT CTTGAGGGGA AAAGTGTTTT TCGCAAAAGA 04 0 

30 ATGACCGAAA ACGCTTATAC CCTATCCTTT AGAGCACCCA TGCTCGGCTT TATGACCATC 900 

ATGATCGAAA CACAAAATTC GATTATCAAT AAAAAACTTA ATGTTACACA GCTA 954 



(2) I N FORMAT T OH FOR SEQ ID NO: 121 



li> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 base pairs 
IB) TYPE: nucleic acid 
fC) STRANDEDNESS: double 
40 ID) TOPOLOGY: circular 



Hi) MOLECULE TYPE: DMA (genomic) 

<iit) HYPOTHETICAL: NO 

(xv) AHTI-SE1ISE: NO 

<vi> ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYR0MONAS GINGIVALIS 



(iX) FEATURE: 

(A) NAME/KEY: misc_ieature 

(B) LOCATION 1...1383 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 

AGACGGGCAG TTAAAATCCG ATCACCTCCC CATATCCACT CATTATTTGT ACGTAAATGT 60 

CTTTTTTCAG ACTTTAAATA CTTATATTTA TCCCGAAAAA TTACCCAAGA GAGATTGGGT 120 

CGATTGTCGA TCCGTCTTAA GTCATACAAT CCAATAATCA TTATTGAAAT GAAAAAAACA 180 

GO ACCATTATTT CTTTGATTGT CTTCGGTGCT TTCTTTGCAG CCGTGGGCCA AACCAAGGAC 24 0 

AATTCTTCTT ACAAACCTTT TTCGAAAGAA GATATTGCCG GAGGAGTTTA CTCTCTCCCG 300 

ACTCAAAATC GTGCGCAGAA GGACAATGCC GAGTGGCTTC TTACAGCGAC CGTCTCCACA 360 

AACCAGTCTG CAGATACTCA CTTTATCTTC GATGAGAACA ACCGCTATAT CGCTCGTGAC 420 

ATAAAAGCCA ATGGGGTAAG AAAATCCACG GACTCCATTT ACTACGATGC CAACGGGCGA 4 80 

65 ATATCGCATG TGGATCTTTA TATCTCGTTC AGTGGCGGAG AGCCTGCACT CGACACCCGA 54 0 

TTCAAGTACA CCTATGATGA CGAGGGAAAG ATGACCGTGA GGGAAGTATT CATGCTGGTA 600 

ATGGATCCGA ATACACCTAT CTCACGCTTG GAATATCATT ATGATGCACA GGGCAGACTG 660 

ACCCACTGGA TTTCTTTTGC TTTCGGGGCA GAATCCCAAA AGAATACGTA TCACTATAAT 720 

GAAAAAGGTC TGTTGGTCAG CGAAGTGCTG AGCAATGCAA TGGGGACAAC CTATTCAGAC 780 

70 ACCGGCAAAA CGGAATACAG CTATGACGAT GCAGATAATA TGGTCAAGGC CGAGTACTTC 840 

GTCUTCCAGC AAGGAAAGGC ATGGCAAGTA CTCAAAAGAG AGGAATACAC CTATGAGGAC 900 

AATATCTGCA TACAATATTT GGCTATTAAC GGTACCGACA CAAAGGTGTA CAAGCGAGAC 960 

ATCGAGAGCG ATAAGTCCAT CTCCGCAAAT GTCATTGACA TTCCGTCAAT GCCGGAACAG 1020 

ACCTGGCCTA ATATGTACGG ATTCAACGCA AAGCGACTGA AAGAGACTTA TTCCTCCTAC 1080 

75 GAAGGAGATG TGGCTACTCC TATATTCGAC TATATCTATA CGTACAACGC TCTTACCTCA 114 0 
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ATGGCAACAC CTTCGACAGA AGCTCAGGTA GCAGTCTATC TCAATCCGTC AACGGACCGG 120O 

TTAGTGATTC TGGCCAACGG CATCACACAT CTGAGCATGT ACGACTTGCA GGGTAAGCTT 12 60 

ATCCGTGATT GTGCCTTGAG CGGCGATAAG GTGGAAATGG GTGTCGGATC TTTGACCAAA 1320 

GGGACATACC TGCTTAAAGT GAATACGGAT CAGGGAGCCT TT GT GAG AAA AGTCGTGATT 1380 

CGA 1383 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



|2) INFORI1ATIOM FOR SEQ ID NO: 122 

{ I J SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13b3 base pairs 
IB) TYPF. : nucleic acxd 

(C) STRANDEDNESS: double 

( D) TOPOLOGY: circular 

<ii) MOLECULE TYPE: DNA (genomic) 

[iiij HYPOTHETICAL: MO 

(iv) ANTI— SENSE: NO 

(Vl > ORIGINAL SOURCE: 

{A> ORGANISM: PORYPHYROt ION AS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY : mien_t eature 
(B> LOCATION 1...1353 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 



ATGGCAAAAG 
CCCGAGATGC 
GAAGGTGTTA 
CTGATGCACC 
ATCGCGGTGA 
CTGAACGAAT 
AAGGAGCTTT 
GTGGCT'ACAC 
TTGGCTCCGG 
GATGCCTTGG 
CTGGGCTTGC 
GGCGTGCl'GA 
GCTACCGACC 
AT GAT TG CCA 
TGCAATGTCT 
ATCGATGGCA 
CGGTGTGACC 
GCTGCACCCC 
AAAAACAAAG 
AGCAACGAGT 
ATTATAGCAT 
GACTTTGCCA 
GAGGGCTTGG 



TTATAAAAAC 
TGGCCGAACC 
TCCCCAAGGT 
ACAAGGCATA 
ATCGCGGTGC 
ACGAGTCATT 
TACTGTCGAG 
CGGATATAGC 
ACTTCGATTT 
CCAAACTCAC 
ACAATGCAGA 
TCAATCATAC 
TGATCGTGAT 
TGACCGGCTC 
TTGCTTCCTT 
ATGTGCTGAC 
AG AT C ACGGT 
GTCTCGATCA 
AGTACGTACT 
ATGACCGCGT 
TCGACATCGA 
CTTGCGAATT 
ATATGCT CT A 



AAAAAAAGGC 
GGCCCAAAGT 
GACGGCTCGT 
TCCGGAGATG 
CAAGCGCAAG 
CCCTGTCGGG 
CGGTATGTGG 
TCCACGCGAC 
CAT CGTTCG A 
GACAGGAAAG 
AATCGTAGAA 
GAAGCCAATC 
CGGACGTTTC 
AGACGCTGCA 
CCCCGGCCGA 
CGGTAAGAAG 
GATCCCCGAA 
GTACAGCATG 
CGATGCCCGG 
TTTCCCGATG 
CAAGATGGAG 
TGTGGATACA 
TAAGGAAATG 



CTTGCACTTA 
CCTACTTACG 
CCGGGGGATA 
AAGTTTACAA 
GTGTTGAGCA 
GATCCGTCTG 
GGTTTTATTA 
ATTTATATTA 
GGAGAAGAAC 
GTGTATGTG<3 
GTACACGGAC 
AATCGGGGCG 
CTGCTTACGG 
GCTCACGGAT 
CTGACAATAA 
CTCTGCGAGA 
GGCGACGATG 
AC CAGAGCTT 
AT CAAGGGTG 
GACATCTATC 
GACTTAGGCA 
TCCAAGATCG 
AAT 



ATCTGAAAGG 
CGGTCGTGCC 
AGGTGCGTGC 
GTCCGGTTAG 
TCGAGGTGAA 
CCCTCTCTGC 
AGCAACGTCC 
CTGCCAACTT 
GCGCCCTGCA 
GCCTGAAGCC 
CTCATCCGGC 
AAACGGTGTG 
GCAAAGCCGA 
ACGTCCGTAT 
AGGAATCTCA 
AGGAGCCTTT 
TGGACGAACT 
ATTTCTCTTG 
GCGAACGTGC 
CGGAGTATTT 
TATATGAAGT 
AGCTGCAGCG 



AAAACCGCTG 
CGACGATTTT 
CGGCTCAGCA 
CGGCGAAGTG 
ACCGGACGGA 
CGAACAGATC 
TTACGACATA 
TACTGCACCA 
GACTGCCATC 
GGGTTCATCT 
AGGTAACGTG 
GACGCTCAAG 
TTTTACCAGA 
TATGCCGGGT 
CGAGCGTGTG 
CCTGTCAGCC 
CTTCCGGTGG 
GTTGCAGGGG 
TATGATCATG 
GCTCAAGGCT 
GGCTCCGGAG 
TATCGTTCGC 



(2) INFORMATION FOR SEQ ID NO: 123 

fit SEQUENCE CHARACTERISTICS: 

tAJ LEMGTH: 585 ba?e pairs 
( D) TYPE: nucleic actd 
(C) STRANDEDNESS: double 
ID) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

till) HYPOTHETICAL: NO 

(iv> AUTI -SENSE: NO 

(Vi) ORIGINAL. SOURCE: 

tA> ORGANISM: PORYPHYROt IOHAS GINGTVALIS 

tlx) FEATURE: 

(A) NAME/ KEY : micc_feature 

(B) LOCATION 1 . . .585 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1 080 
1140 
1ZOO 
1260 
1320 
1353 
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(xi) SEQUENCE DESCRIPTION: SfclQ ID NO:123 
ATGGATAAAG TGAGCTATGC TCTGGGATTG AGCATCGGTA ATAATTTCAA GTCTTCtzrrzr 

5 £££££ SEES! 6 CAAGGTCTGT c£SS££ 

^rrrrl,'' TCTCGTATGA CGAGGCCAAG CGCGAAATAG AGGCGTATTT CATGGATTTr i on 

nnS^^ CTGTCAAACT GAACAAAGAG GCCGGAGAAG AATTCCTCAA GA^T^S 5J2 

CACAAGGAAG GTGTGACGAC CTTACCGAGC GGCTTGCAAT ACGAAGTCAT TAArATrrrn Inn 

GAGGGCCCGA AACCCACCCT TTCGGACACG GTAACCTGTC ATT^^C^ JJSS^tc 1% 

10 AACGGTATCG TTTTCGATAf: rrrrnT^n,. ^^~,i_;r ™CftCGG TACGCTCATC 3fin 



15 



20 



35 



70 



75 



360 
420 



S ™ CAATTAATGC CTGTAGGATC C^T<S 480 

CGAGCcATCT GGCGTATGGA GATCGTGGTG CCGGCGAACA TATCAAArrr c. n 

GGTAGTACGC TCATTTTTAT AATCGAATTA TTGAGTATCA ACAaT TATCAA * C CG 540 



(2) INFORMATION FOR SEQ ID NO: 1 24 

(1) SEQUENCE CHARACTERISTICS - 

<A) LENGTH: 819 base pairs 
(BJ TYPE: nucleic acid 

(C) STRAW DEDNESS: double 

(D) TOPOLOGY: circular 

25 MOLECULE TYPE: DMA (genomic) 

(ili) HYPOTHETICAL: HO 

(iv) ANTI-SENSE: NO 

30 (Vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



fix) FEATURE : 

(A) NAME/KEY: raise teaLure 
(BJ LOCATION 1 . . . 619 



SEQUENCE DESCRIPTION: SEQ ID NO: 124 

40 £g££tg^ gcacSa" SSSSS ^? TGGGAG cagt ™gttt tgcaagtgct 

ACTATTCAGG t™^2£ fc^SS^ AATTCAGTAC AGCAACCTCG TGAGGAAGCC 120 
™n~I*~ GG . TTTGTGGAG A ATTGGCAGAG CAAGTTGACT GCATTGGGAC AGGTAATTr-r inn 

sssss sssss ssss ^sssss? esses SEE 

45 I?^S?C ^'^AAGATA GTGATCCCGA SJSS^S 

sssss sssss sssss sss? ssss ™ 

55 (2) INFORMATION FOR SEQ ID NO: 125 

11) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1662 base pairs 
Rn <B) TYPE: nucleic acid 

OU (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

65 ( J i i ) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALS 
(ix) FEATURE: 

(A) NAME/KEY: raise feature 

(B) LOCATION 1 . - .1662 
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(xl } SEQUENCE DESCRIPTION: SEQ ID NO;lZ5 



10 



15 



20 



25 



30 



ATGCCAAGAA 
GTTTTATTTA 
GAGACGATGT 
GACAATGGAT 
CTGTATCGTT 
GTGCCTTATG 
GATATCAATG 
GGCGTTTTGC 
AAGGATTTCC 
TCTCCTTACG 
AGCGATCACA 
AAGCGTATTT 
AGTACATCTC 
AATAAGAACC 
CGCTATGCGT 
CCTTTGGGAG 
GTGGGTGGAG 
AAGCAAT CTT 
CACTGTATAC 
TTTCAAGATC 
TATGGTTGGA 
CCGCAAGTAG 
AGCCCATATG 
TCCGATACGG 
ATGAAGCTCT 
GGGCACGAGG 
GGGAAAGAAT 
GTAGCGGATA 



TTATGAAATT 
GCCCTCTGGC 
CCACCCTATC 
GGATCTATGT 
CCAAAGACGG 
ACTTCGATGT 
TTTGGACAGT 
TGATGCATCG 
CTAATAATAG 
GTTTGGGGGG 
GCTTCTTGGA 
ACACAAGACC 
CTTCTCTTGG 
TTGATGGCTT 
GGTCTGAACC 
CACTAAGTAT 
AACGCTCCCA 
TCAATTATTC 
GTATTCCGGC 
ACAATCTAAT 
GTTGGCCATA 
CACTCAATCC 
ATGAAACCAA 
AGTGGGTACA 
ACCCCAATCC 
CAGTCGTATA 
ATAAGCTGAA 
CGGAGTATTT 



AAAAATT GCA 
CAAGGCCCAG 
AGGATATTCA 
GATGTTGGAT 
TGGTGCTACT 
CTCGCATTGC 
CATGACAGCA 
CCATGATGCA 
ACTGATGGGT 
CGATCCTTTT 
CTATATTTTT 
CCAAAAACTG 
ATTTAATACT 
CGACATT GGT 
GATAATAATA 
AGAGATCCAA 
TAACTTCCTG 
TCCCGGACAT 
TTTGGCATAC 
GAGATACAGA 
TGTATATGCA 
TACCAATGGA 
ACCACATCCT 
TGCTCTGGAC 
TGCCAAAGAA 
CGACATGCAG 
TGTGCAGTAT 
CGTGGAAAAA 



CTCAGACTGC 
ATGGATATTG 
GAGGATTTTT 
TTCTCTCGTA 
TACCAAAAGT 
GATTTTATTG 
TTCGAATATG 
GATATCAATA 
GTAGCCATCG 
GCT CTCGCT G 
TCGTTAGATG 
ACTATCAATA 
TGGCCACTAA 
TTCATTTCCA 
GAAGAAGACT 
ATGATGTTGG 
AT CACTT ACC 
ACACCGACAA 
GATAAGGAAG 
TGGATCAAAT 
AAAGAAGCTA 
AAGGCTTGTT 
ACTCCTGTAA 
GTGGGGGACG 
TATGTTCTGA 
GGCCGAATCG 
CTGTCCAAAG 
ATCATTGTAG 



TGCTGGCGAC 
GTGGAGACGA 
ATTACAAGAT 
TTTATTTTGA 
TAGGGTCTTT 
TAACGGGAAA 
TAGGTGGTAC 
ATACAGAGTG 
CCTCCAACTA 
TCGCCGTTAG 
GTGGAGTACA 
GAGTAGACCT 
TGGGAGTCGT 
ACTTTGTGGA 
GTGGATGGAC 
ATGACAATTC 
CGGGCCATTA 
AGAAAGATCT 
GCGATCGTTA 
ACGATGACAT 
AAGATAAAAA 
GGGTATGGCA 
TTATTAAACA 
TATTGCAGAA 
TCAACCTACC 
TGGAGAAAGT 
GTACGTACAT 
AG 



TTTTG CCAT A 
TGTAT'l'GATC 
GGCTGTGGCA 
TGATGTCAGG 
GGGGTCTTTG 
GGATGAAGAT 
TATTGGCAAT 
TGTGTACAAG 
CCGTGCGCCC 
TGGCTCCGGA 
CTTTGAGCAA 
TTCATTAGGC 
ATTCGAAATG 
CTATGATCCC 
TGATTTTAAT 
GGATAATACC 
CGTATATCCG 
GGTCTTTAAA 
TCTGACTACT 
TAACTCTTTT 
GAGGCGCCGT 
TACTCGCAAG 
TTTCCTATGG 
GGAGGGTAGC 
CAAAGAAGGG 
TTCATTTTCA 
GCTGAAAGTT 



€0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1300 
1440 
1500 
1560 
1620 
1662 



35 



40 



45 



(2} INFORMATION FOR SEQ ID HO; 126 

< i ) SEQUENCE CHARACTERISTICS : 

I A) LENGTH: 1650 base pair* 
( D) TYPE: nucleic acid 
(CI ST RANDEDNE SS : double 
{D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 

tiii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE; NO 



50 



55 



60 



65 



70 



75 



tvi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 

< ix ) FEATURE : 

(A) NAME/ KEY : misc_feature 
IB) LOCATION 1...1650 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO; 126 



ATGAAATTAA 
CCTCTGGCCA 
ACCCTATCAG 
ATCTATGTGA 
AAAGACGGTG 
TTCGATGTCT 
TGGACAGTCA 
ATGCATCGCC 
AATAATAGAC 
TTGGGGGGCG 
TTCTTGGACT 
ACAAGACCCC 
TCTCTTGGAT 
GATGGCTTCG 
TCTGAACCGA 
CTAAGTATAG 
CGCTCCCATA 
AATTATTCTC 
ATTCCGGCTT 
AATCTAATGA 



AAATTGCACT 
AGGCCCAGAT 
GATATTCAGA 
TGTTGGATTT 
GTGCTACTTA 
CGCATTGCGA 
TG AC AGO ATT 
AT GAT GCAGA 
TGATGGGTGT 
ATCCTTTTGC 
ATATTTTTTC 
AAAAACTGAC 
TTAATACTTG 
ACATTGGTTT 
TAATAATAGA 
AGATCCAAAT 
ACTTCCTGAT 
CCG GACATAC 
TGGCATACGA 
GATACAGATG 



CAGACTGCTG 
GGATATTGGT 
GGATTTTTAT 
CTCTCGTATT 
CCAAAAGTTA 
TTTTATTGTA 
CGAATATGTA 
TATCAATAAT 
AGCCATCGCC 
TCTCGCTGTC 
GTTAGATGGT 
TATCAATAGA 
GCCACTAATG 
CATTT CCAAC 
AGAAGACTGT 
GATGTTGGAT 
CACTTACCCG 
ACCGACAAAG 
TAACGAAGGC 
GATCAAATAC 



CTGGCGACTT 
GGAGACGATG 
TACAAGATGG 
TATTTTGATG 
GGGTCTTTGG 
ACGGGAAAGG 
GGT GGT ACT A 
ACAGAGTGTG 
TCCAACTACC 
GCCGTTAGTG 
GGAGTACACT 
GTAGACCTTT 
GGAGTCGTAT 
TTTGTGGACT 
GGATGGACTG 
GACAATTCGG 
GGCCATTACG 
AAAGATCTGG 
GATCGTTATC 
GATGACATTA 



TTGCCATAGT 
TATTGATCGA 
CTGTGGCAGA 
ATGTCAGGCT 
GGTCTTTGGT 
ATGAAGATGA 
TTGGCAATGG 
TGTACAAGAA 
GTGCGCCCTC 
GCTCCGGAAG 
TTGAGCAAAA 
CATTAGGCAG 
TCGAAATGAA 
ATGATCCCCG 
ATTTTAATCC 
ATAATACCGT 
TATATCCGAA 
TCTTTAAACA 
TGACTACTTT 
ACTCTTTTTA 



TTTATTTAGC 
GACGATGTCC 
CAATGGATGG 
GTATCGTTCC 
GCCTTATGAC 
TATCAATGTT 
CGTTTTGCTG 
GGATTTCCCT 
TCCTTACGGT 
CGATCACAGC 
GCGTATTTAC 
TACATCTCCT 
TAAGAACCTT 
CTATGCGTGG 
TTTGGGAGCA 
GGGTGGAGAA 
GCAATCTTTC 
CTGTATAGGT 
TCAAGATCAC 
TGGTTGGAGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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10 



25 



55 



70 



TGGCCATATG TATATGCAAA AGAAGCTAAA GATAAAAAGA GGCGCCGTCC GCAAGTAGCA 1260 

CTCAATCCTA CCAATGGAAA GGCTTGTTGG GTATGGCATA CTCGCAAGAG CCCATATGAT 1320 

GAAACCAAAC CACATCCTAC TCCTGTAATT ATTAAACATT TCCTATGGTC CGATACGGAG 1380 

TGGGTACATG CTCTGGACGT GGGGGACGTA TTGCAGAAGG AGGGTAGCAT GAAGCTCTAC 14 40 

CCCAATCCTG CCAAAGAATA TGTTCTGATC AACCTACCCA AAGAAGGGGG GCACGAGGCA 1500 

GTCGTATACG ACATGCAGGG CCGAATCGTG GAGAAAGTTT CATTTTCAGG GAAAGAATAT 1560 

AAGCTGAATG TGCAGTATCT GTCCAAAGGT ACGTACATGC TGAAAGTTGT AGCGGATACG 1620 

GAGTATTTCG TGGAAAAAAT CATTGTAGAG 1650 



<2J INFORI1ATIOM FOR SEQ TD HO: 127 



II) SEQUEUCR CHARACTERISTICS: 
15 (A) LENGTH: 1170 base pairs 

( B ) TYPE: nucleic acid 

(C) STHANDEDNESS: double 
<D1 TOPOLOGY: circular 

20 (ii) MOLECULE TYPE: DHA {genomic} 

{iil> HYPOTHETICAL: NO 

<lv> AUTI-SEHSE: MO 



(vl> ORIGINAL SOURCE: 

(A) ORGANISM; PORY PHY ROMON AS CINGIVALIS 



( i.X ) rEATURE : 
30 (A) NAME/ KEY : misc_ feature 

(B) LOCATION 1...H70 

(xi) SLQUEUCE DESCRIPTION: SE0 It) NO; 127 

35 ATGAAACGAT TACTCCCCTT TCTCCTTTTA GCAGGACTCG TACCCGTAGG AAACGT GTCT 60 

GCTCAGT CAC CCCGAATCCC TCAAGTGGAT GTACACACTC GCATCGCAAG AAATGCCCGT 120 

TATCGACTGG ACAAGATCAG TGTCCCGGAT TCTCGTCAGA TATTCGATTA CTTCTATAAA 180 

GAAGAAACGA TACCCACTAA AATACAAACG ACCACAGGAG GTGCAATTAC AAGCATCGAT 24 0 

TCGCTTTTCT ATGAAGACGA CAGGTTGGTT CAGGTGCGCT ATTTTGACAA TAACCTTGAA 300 

40 TTAAAACAAG CGGAGAAGTA TGTATACGAC GGTTCTAAGC TGGTCCTTCG AGAAATTCGC 360 

AAGTCGCCGA CAGACGAAAC GCCAATAAAG AAAGTTAGCT ATCACTATCT CTGTGGCAGC 420 

GATATGCCTT TTGAGATTAC GACAGAGATG AGCGATGGCT ATTTTGAAAG CCATACGCTT 4 80 

AACTATCTGA ATGGAAAGAT TGCCCGAATA G AT AT CAT G A CTCAACAGAA CCCATCGGCC 54 0 

GAATTGATCG AAACGGGTAG AATGGTATAT GAGTTTGATG CCAATAATGA TGCTGTACTG 600 

45 CTTCGTGACA GTGTATTTCT TCCTCTTCAA AACAAGTGGG TAGAAATGTT TACT C ACCGT 660 

TATACATACG ACAATAAGCA TAATTGTATT CGTTGGGAAC AAGACGAATT CGGCACCCTC 720 

ACCCTTGCCA ACAACTTCGA ATACGACACC ACTATCCCTC TGTCGTCTGT ATTGTTCCCC 780 

ACGCATGAGG AGTTCTTCCG TCCTCTTCTT CCCAATTTTA TGAAGCATAT GCGTACGAAG 84 0 

CAAACGTATT TCAATAACTC CGCAGAAGGC TTGTCAGAGG TATGCGATTA CAACTACTTC 900 

50 TATACCGATA TGCAGGGTAA TGCACTGACC GATGTTGCCG TGAACGAATC GATCAAGATT 960 

TATCCTCGTC CTGCCACGGA TTTTCTGCGT ATAGAAGGTT CGCAACTGCT TCGCCTTTCG 1020 

CTATTCGACA TGAACGGGAA GCT CATCAGA GCTACCGAAT TGACAGGCGA TTTGGCCATT 1080 

ATCGGAGTTG CATCTCTTCC GAGAGGCACT TACATCGCAG AAATAACTGC TGCAAACAGC 1140 

AAAACCATAC GTGCAAAAGT ATCGCTCAGA 1170 



t2) I M FORMAT I ON FOR SEQ ID MO: 128 



(i) SEQUENCE CHARACTERISTICS: 
60 <A> LENGTH: 1233 base pairs 

(B) TYPE: nucleic acid 
{C) STRAHDF.DNESS: double 
(D) TOPOLOGY: circular 

65 (ii) HOLECULC TYPE: DMA (genomic) 

UiiJ HYPOTHETICAL: HO 

(iv) ANTI -SENSE: NO 



fvi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 



(i>:) FEATURE: 
75 (Al NAME/ KEY: misc feature 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



„ji j *yH nTM ,JL ItZ 



W.J 'O £ « FuJ IL3 



WO 99/29870 



PCT/AU98/01023 



88 / 490 



i&] LOCATION 1. . .1233 
(Xi> SEQUENCE DESCRIPTION: SEQ ID NO:128 



10 



15 



20 



25 



ATGAGACAGC 
GTCGGTTGCA 
TCTTTGGACG 
ATGGCACAGG 
AATCATTACA 
TTTGTGGATT 
GGATTTTGGG 
GTCTACTTTT 
AAATTCTCCT 
CTGCCCGAAA 
GGACACAGCC 
GTGGTCGGCT 
ATCATAGTAT 
CATTATCTGC 
CCGAAGCCCT 
GTTTACCGCA 
GAACTGGTCG 
CGTCCGGGAG 
GCTGTCGGAC 
TATCCGCTCA 
TTGCTGAACG 



ATTTATCTCT 
GAACAGTCCG 
GCTTCCGACC 
AGGGATTGAG 
GCATGGCTAC 
CGCTACTGGG 
GCGGCGAGCC 
GGGT AGGAT C 
CCACCGTTCC 
AG GAG CG AC C 
AAACGCCCGA 
ATTTCCGCAA 
CCGATCACGG 
CTGCGGACAG 
CCTATACCGA 
AGGGGGAGGT 
TGATTCCGGA 
CGGCACATGG 
CCGATTTCCG 
TATGCAGGCT 
GCCTGATCCG 



ATTTCCTTTT 
ACAAACACCT 
GGACTATACC 
CGGGTCGCTC 
GGGGCTTTAC 
CATCTTTCGT 
GGTTTGGAAT 
CGAAACGGCT 
GTT TCGTGAC 
GCGCTTGCTC 
AAGCCCGCTG 
GCGGTTGCAC 
TATGGCCACG 
TTTCCTCTAC 
GCGAGCCTAT 
GCCCAAGCGT 
CATAGGCTCC 
CTATGACAAC 
TCCGGGCAGT 
GTTGGGTATA 
AGACAAACGA 



ATCTTGTTTC 
AAGCAGTCGG 
GATCGGGCAC 
CAACCATGCT 
CCCGATCATC 
ATATCCGACC 
ACGGCCGCAC 
GTGAACGGAA 
CGTGCCGACT 
ATGTGGTACA 
ACACTGGCAA 
TCTCTGCCCA 
TACGAAAATG 
ATGGCCACCG 
GAGATCCTGC 
TTGCGCTGTG 
AOCGTCTTTT 
CAAGCACCGG 
AGGGTGGAAA 
GAGCCTGCAC 
CCA 



TGCTTCTTGC 
AACGGTACGT 
GTACACCGGC 
TCCCCTCGCT 
ACGGTATCGT 
GAAAAGCCGT 
GCCAAGGCAT 
ATCGGCCGTG 
CCGTCATCGC 
TCGAGGAGOC 
TGGTAGAGCG 
TAGCCGCACA 
AGAAATGTGT 
GGGCCTTCAC 
GGGCCATTCC 
GCACCAATCC 
TCGCAATAAA 
AAATGCGGGC 
ACCTGCCGAA 
CCAACGAT GC 



CTTCTCTTAT 
CGTAGTCCTG 
GTTGGATCGG 
TACATTTCCC 
AGCCAATGAG 
GGAGACCCCC 
CCGTACCGGT 
GCGGTGGAAA 
GTGGCTCGGA 
GuATATGAT C 
GTTGGACAGT 
GACCGACTTC 
CAATCTGTCG 
CCACTTGTAC 
ACATATATCG 
TCGTTTGGGC 
TGAAGACGTT 
TTTACTCCGG 
TATCACCATC 
GGACGAAACG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
12C0 
1233 
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(2) INFORMATION FOR SEQ ID HO: 129 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 
<B) TYPE: nucleic acid 

(C) STRA1IDEDNESS: double 

( D) TOPOLOGY: circular 

(ii> MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: HO 
(iv) ANTI-SENSE: NO 



45 



(vi) ORIGINAL SOURCE; 

iA) ORGANISM: PORYPHYROMONAS GINGIVALIS 

{ ix ) FEATURE : 

(A) NAME/ KEY; misc_£eature 
(B } LOCATION 1. , .738 



50 



55 



60 



65 



Ixi} SEQUENCE DESCRIPTION : S£Q ID HO: 129 



ATGAAAGTAG 
GCCACGTACA 
TGTTGCGGCC 
CGATTCGAAG 
GCTTTCGTGA 
GCAGCCAAGG 
CCCTCCCGAT 
CATCTGTCCA 
GAGATGGTGC 
GGCGGTATGT 
CTGGATCACA 
CATATGCAGG 
ATTTTAGCAG 



GTTTGTTCAT 
AACTGCTGAA 
AGCCTATGGC 
AGCTGTTCGA 
AAGAAAACTA 
TTCGGGATAT 
TCGCCCATAA 
CCCCCAGTGA 
AGGGCATAGA 
ACTCGGTGGA 
TATCCACAGG 
GAGTGATAGA 
CAAACTTA 



CCCCTGTTAT 
GAGTTTGGAC 
CAATGCCGGA 
GTCGTAT GAT 
TGATCATATC 
ATGCGAGTTC 
GGTGAGCCTG 
AGTGCACCGA 
GGTATTCGAG 
GGAGCCGGAG 
TGCGGAGTAC 
CAGAGAGAAA 



GTCAATGCAG 
ATAGATGTCG 
TTCGAACAGA 
GTAGTCGTAG 
CTCAGACCGA 
TTGCACGATG 
CACAACAGTT 
CCGTACCACA 
CCCAAGCGAA 
GTATCCACCT 
ATCACAGGGC 
TTGCCGATCA 



TGTATCCGGA 
ACT ACCCG AT 
AAGCTCAAAA 
GGCCATCGGC 
CAGGACATGT 
ACCTGAAGAT 
GCCACGGTGT 
ACAAGGTGCG 
•rAGACGAATG 
GTATGGGGCA 
CGGACAGCTC 
AGACAATTCA 



AGTGGGTATC 
GGATCAGACA 
GCTGGCTTTG 
CAGTTGCGTT 
CTGCAAGTCG 
CACCAGCCTC 
GCGCGAACTG 
CCGGCTATTG 
CTGCGGTTTC 
TGACAAGGTG 
GTGCCTCATG 
TGCAGTAGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
738 



(21 INFORMATION FOR SEQ ID NO: 130 



70 



ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRAtlDEDNESS: double 
t D) TOPOLOGY: circular 



75 



t 1 i } MOLECULE TYPE: DNA (genomic) 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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89 /490 

(Hi) HYPOTHETICAL: HO 
(lv) ANTT -SENSE: NO 

<vi> ORIGINAL SOURCE : - 

|A> ORGANISM: PORYPHYROMOHAS GINGIVALIS . " . . 



(ix) FEATURE: 
^ _ (A) NAME/KEY: nUac feature 

10 (B) LOCATION 1...738 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 130 

ATGGATATTG TAAGTATGGC CGATAAAGCT CTTGTAGTGG AGATGAGAGA TGTGACGCTC 60 

XO TGTCAGGAGG AAAACGTCAT TTTTCAAAAT TTGAATCTGA CCCTTTCCGC CGGAGACTTC 120 

GTCTATCTGA TAGGCTCAGT GGGATCGGGG AAGAGCACTT TGCTGAAGCC TTTGTATGCT 180 

GAGGTGCCTA TCTCTGCCGG TTATGCCCGC GTGATAGATT ATGATCTGGC AAAGTTGAAA 240 

CGGAAGCAGT TGCCCTATCT GCGCAGGAAT TTGGGCATTG TGTTTCAGGA TTTCCAGTTG 300 

CTGAACGGAC GTACTGTTGC GGAGAATTTG GATTTCGTTT TGCGAGCTAC GGACTGGAAA 360 

ZU AACCGAGCCG ATCGCGAGCA GCGTATCGAG GAGGTTTTGA CCCGTGTGGG AATGTCTCGG 420 

AAGGCTTATA AGAGACCGCA CGAACTGTCC GGAGGGGAGC AACAACGTGT GGGTATAGC^ 4 80 

AGAGCTTTGC TGGCGAAGCC TGCGTTGATC CTGGCCGACG AACCCACAGG CAACCTCGAT 540 

TCGGTGACCG GATTGCAGAT CGCTTCTCTG CTCTACGAAA TCAGTAAGCA GGGCACTGCA 600 

GTACTTATGA GCACGCACAA CAGCAGCCTG CTGTCGCATC TGCCGGCACG GACATTGGCC 660 

<£0 GTTCGTAAGA ATGGCGATGC CTCCfCTTTG GTCGAGCTGA GTGCAGATGC TGTTTCAAGA 720 

AAAAATACGG AAATAGAT 7 3g 



(2) INFORMATION FOR SEQ ID NO: 131 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: circular 

(ii> MOLECULE TYPE: DNA (genomic) 

< i i i > HYPOTHETICAL : NO 

(iv> ANTI-SENSE: NO 

Ivi J ORIGINAL SOURCE: 

tA) ORGANISM: PORYPHYROMOMAS GINGIVALIS 

t ix ) FEATURE : 

(A) NAME/KEY: misc_f catur= 

(B) LOCATION 1...723 

(Xi) SEQUENCE DESCRIPTION: SEQ ID HO: 131 



ATGGCCGATA AAGCTCTTGT AGTGGAGATG AGAGATGTGA CGCTCTGTCA GGAGGAAAAC 60 

GTCATTTTTC AAAATTTGAA TCTGACCCTT TCCGCCGGAG ACTTCGTCTA TCTGATAGGC 120 

TCAGTGGGAT CGGGGAAGAG CACTTTGCTG AAGGCTTTGT ATGCTGAGGT GCCTATCTCT 180 

&b GCCGGTTATG CCCGCGTGAT AGATTATGAT CTGGCAAAGT TGAAACGGAA GCAGTTGCCC 240 

TATCTGCGCA GGAATTTGGG CATTGTGTTT CAGGATTTCC AGTTGCTGAA CGGACGTACT 300 

GTTGCGGAGA ATT TGGATTT CGTTTTGCGA GCTACGGACT GGAAAAACCG AGCCGATCGC , 360 

GAGCAGCGTA TCGAGGAGGT TTTGACCCGT GTGGGAATGT CTCGGAAGGC TTATAAGAGA 420 

CCGCACGAAC TGTCCGGAGG GGAGCAACAA CGTGTGGGTA TAGCCAGAGC TTTGCTGGCG 480 

AAGCCTGCGT TGATCCTGGC CGACGAACCC ACAGGCAACC TCGATTCGGT GACCGGATTG 540 

CAGATCGCTT CTCTGCTCTA CGAAATCAGT AAGCAGGGCA CTGCAGTACT TATGAGCACG 600 

CACAACAGCA GCCTGCTGTC GCATCTGCCG GCACGGACAT TGGCCGTTCG TAAGAATGGC 660 
GATGCCTCCT CTTTGGTCGA GCTGAGTGCA GATGCTGTTT CAAGAAAAAA TACGGAAATA 
GAT 



(2) INFORMATION FOR SEQ ID HO: 132 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 696 base paire 
(E) TYPE: nucleic acid 
<C> STRANDEDNESS: double 
(D) TOPOLOGY: circular 

75 (ii) MOLECULE TYPE: DNA (genomic) 



720 
723 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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90 / 490 

{ill) HYPOTHETICAL: NO 
(iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: PORYPHYROMONAS GINGIVALIS 
(Ix) FEATURE: 

«A) NAME/ KEY: misc feature 

( B ) LOCATION 1 . . . 696 



PCT/AU98/01023 



(X!) SEQUENCE DESCRIPTION: SEQ ID NO: 132 

15 CTTTCCGCC^ ^rnS^S TCAGGAG ^ AACGTCATTT TTCAAAATTT GAATCTGACC 
SSSSS S™** 5 ™ 6 ^CGGGGAA GAGCACTTTG 

GATCTTGGCAA AGTTGAAACG SSSSES IS^ CCGGTT ATGCCCGCGT GATAGATTAT 

" ~ TTTCAGGATT S££552S GAACGG^AOGT ^^TTT GGGCATTGTG 240 

20 CGAGCTACGG ACTGGAAAAA £££SS ^JS" 6 ** TTTCG ^™ 300 

CGTGTGGGAA Trrrrrv=Paa ^™T^ CGCG AG CAGC GTATCGAGGA GGTTTTGACC 360 

caaS?^ gSSg^^ GG ^ ATAAG agaccgcacg AACTGTCCGG AGGGGAGCAA 420 

f CCCACAG^S SSS? ?™ CCTG «™TCCT GGCCGACGAA 430 

„ AGTAAGCAGG SSJSE? A^^I^ OTCTCTGC T CTACGAAATC 54 O 

25 CCGGCACGGA CATTGGCCGT tS^aI^ CA GCAGCCTGer GTCGCATCTG 600 

GCAGATGCTG S^S aS^ CCT ™GT CGAGCTGAGT 660 



<2) INFORMATION FOR SEQ ID MO: 133 



(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 657 base pairs 
<BJ TYPE: nucleic acid 
qc tc) STRAHDEDNESS: double 

03 <D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

. (ill) HYPOl'HETICAL: NO 

(iv) ANTI -SENSE : NO 

(Vi ) ORIGINAL SOURCE ; 
45 tA) ORGANISM: PORY PHYROT10NAS GINGIVALIS 

( ix ) FEATURE : 

(A) NAME/ KEY: misc feature 

(B) LOCATION 1...657 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 

£££££ SS££S£ SSI TTCAGAACAG AAGAAATAGA GACGGTAGCC 
GGATGCGGTA AGTcScTC^ rS^S^ GGCGAATTTA TCGCCATAAT GGGGCCTTCG 
55 ATCTACAAGC TCGATG^^rr* t?£^™ TC ^^CCmC TCGACAATCC CACTTCCGGT 
CGTAAGGGCA AT^TCG^ct5 C^JSSEr^ ** OTCC «* AAAAAGACAG GACTGCCGTC 
AGCGAGAACG T^A^T^C 2?2S2£S AGCTTCAACC TCATCGAAGA GATGACGGTA 
CGAGTGGAGG AGGC^C^r S25SS£ T ° TGGGTGTGA AGGCCTCCGA GCGGAAAGAG 
CAGCTOTCCG Sat^SS? ^^^C ATCAGCCACC GGGCCGGCCA CTTCCCCAAT 

60 aagScIt^c ?cgSg^g^ SSSSS 6 gctatcgccc gtgccgtggt ggccaat^g 
atggaactgc ^gag^ct ^^ ACr ccaaa aa^ agccgatgtc 

CACCACGATG «S SS^SS SSSS SSST* 



<2) INFORMATION FOR SEQ ID NO: 134 



Ul SEQUENCE CHARACTERISTICS - 

(A) LENGTH : 1785 base pairs 
7n *B) TYPE: nucleic acid 

<0 STRANDEDNESS: doubl- 
(D) TOPOLOGY: circular 

di) MOLECULE TYPE : DNA (genomic) 

75 (ill) HYPOTHETICAL: HO 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
657 
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(iv) ANTI— SENSE : HO 

tVi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

lix) FEATURE: 

(A) NAME /KEY: nusc_f ea ture 
IB) LOCATION 1...1785 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 



AT GAAAGAAT 
ATCTTGTTCT 
GCAACGGGAG 
TCTTTCCCTG 
GTATCGCTCT 
GGTATCTTCC 
CGCGCGTTGC 
ACCCAAAAGG 
AT GTTGGGGC 
TTCGGCGTGA 
ATGCTCAACA 
TGGGACAAGA 
ATGTTTGCCG 
GTGGATGAGC 
GGAGAGAAAG 
AACAAAACGA 
ATAATAAAGA 
ATCAAGGCAG 
GGAGGTAGTG 
AAGCCTATCG 
GCAGCCAACA 
ATGTTCCCGA 
ACATCCAAGT 
CTCATCCAAC 
CGCAACCGCA 
AAAGCTCTTG 
GCCGCGAAGC 
AACTTCTTCG 
ACCATTCTCT 
CCTTCGGGCA 



TTTTCAAAAT 
GTATCTTTCT 
GAACCATTCC 
AGATCGTATC 
CACAGGCAGT 
TCGATCTGGA 
AGGATTTCAA 
GTTACTACCT 
TTATCGGGAT 
AGATGGAGAT 
GGATGAGCGA 
TCACATCCGA 
ACAAAGGCGA 
TGGCTTACCG 
ATGAACTTCG 
AAGGCAGTCG 
AGCCGTTCGA 
CAG CCGATGA 
CTTTCACTTC 
TGGTCTCCAT 
GTATCGTGGC 
ACTTCCCGGG 
ATGCAGACTT- 
GCTACATAGA 
CCAAGGCACA 
CACTCGGTTT 
TGGCTCAGCT 
AAGAGTTGCT 
CCGATCCGGA 
TACAGGCACG 



GTTTTTCGCC 
ATTTATCTTT 
GAAGATCGAA 
GGCCAATCCC 
CGAAGCCATC 
CAACCTTTCC 
CATGTCGGGC 
CTCCAGTATT 
TGCGACCCAA 
CTTCAAGGTA 
TGCCAATCGC 
T ATT G CAG AG 
AATGTTCGGT 
TACCGATGTG 
GTTCGTATCG 
GATCGCCGTT 
CACTGACGGT 
CGATGATATC 
CGAACAGATA 
GGGCGACGTA 
AGAGCATACG 
CGTAGCCAAG 
GGGCAACACC 
GCAGGGCTAC 
GAT CGACAGC 
GGTGGATGAG 
CGGTGGCAAC 
CTCCTCATCA 
AATAGAAGTT 
TCTCCCCTAT 



TCGATCCTCG 
TTCGGCATCG 
GCAAACTCCA 
TGGAGCATGC 
GGCCAACCCA 
GTCGGTATGG 
AAGTTCGTCG 
GCAGACAAAC 
ACAATGTTCT 
GGCACCTACA 
GAACAAATCA 
TCGCGCAAGA 
CTTCICCCAGA 
GAGAAAGAAC 
CTTTCTCAGG 
CTCTTTGCCG 
AGCTCCATCA 
AAAGCCGTAG 
TGGAAGCAGG 
GCAGCCTCGG 
ACTCTGACCG 
AAGATAGGAG 
TTCGCTCCGA 
GACCTCTTCC 
ATCGCTCAAG 
CTTGGAGGTT 
TACAGCATAG 
GCAGCGGATA 
CTGCGCGAAC 
TACTTCATGC 



GGGTTATAAC 
TAGCCGGTAT 
TCCTACATAT 
TCACAGGCAA 
AAAATAATCC 
CATCGGCAGA 
TATCCTATGC 
TCTACCTCAA 
ACAAAGATGC 
AGGCAGCCGT 
CCACATACAT 
CGGCAATGGA 
AAGCGGTGGA 
TCAAAAAGAT 
TTCTGGCCAA 
AAGGTGAAAT 
CACAAGAACT 
TACTTCGTGT 
TAGCCGATCT 
GCGGATACTA 
GCTCCATCGG 
TGAATATGGA 
TGACGGTCGA 
TCACTCGCGT 
GCCGTGTATG 
TGGACACAGC 
AGTATGGCAA 
TGAAGTCTGC 
TCCGCTCCAT 
CGTAC 



GGCAGGAATC 
TGCCTCCAAG 
ANACAATTCT 
AGACGAGTCC 
CAACATAACC 
GGAATTGCGT 
CGACAGATAC 
TCCGAAAGGA 
CCTCGACAAA 
AGAGCCATTC 
AAACGGGCTT 
TTCCGTGAAA 
GATGAAGCTC 
GTCCCAACGC 
TGGCCCGATG 
AACGGAAGAA 
CGCCAAAGAA 
CAATTCTCCG 
CAAGGCCAAA 
CATAGCCTGC 
CATATTCGGC 
CGTCGTACAG 
AGATCGTGCC 
ATCGGAAGGC 
GCTCGGCGAC 
TATCAAACGG 
GACCAAGCGC 
CATCCTGAGT 
GCCGCCCCGT 



45 (2) INFORMATION FOR SEQ ID NO: 135 



(1) SEQUF.NCE C1IARACTEKISTICS: 

(A) LENGTH: 17 67 base pair' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(VI) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOMAS GIHGIVAI.TS 
( ix ) FEATURE : 

(A) NAME/ KEY : raise feature 

(B) LOCATION 1 .17 67 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 

ATGTTTTTCG CCTCGATCCT CGGGGTTATA ACGGCAGGAA TCATCTTGTT 
CTATTTATCT TTTTCGGCAT OGTAGCCGGT ATTGCCTC^A Irr^ll^ 
^? ATCG ™ C ™*W CATCCTACA? C^S^ 
2SSS£ TC CCTGGAG ^ GCTCACAGGC AAAGACGACT C^raS 
rlrt^^ TCGGCCAAGC CAAAAATAAT CCCAACATAA CCG^AT^T 
GACAACCTTT CCGTCGGTAT GGCATCGGCA GAGGAATTGC GTCGCGC^T 

SSSSSS TTGCACACAA ^ CCTAT GCCG ^ ACAC^S 
CTCTCCAGTA TTGCAGACAA ACTCTACCTC AATCCGAAAG GAATGTT GGG 



CTGTATCTTT 
AGGAACCATT 
TGAGATCGTA 
CTCACAGGCA 
CCTCGATCTG 
GCAGGATTTC 
GGGTTACTAC 
GCTTATCGGG 



60 
12C 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
"720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1785 
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ATTGCGACCC 
ATCTTCAAGG 
GATGCCAATC 
GATATTGCAG 
GAAATG1TCG 
CGTACCGATG 
CGGTTCGTAT 
CGGATCGCCG 
GACACTGACG 
GACGATGATA 
TCCGAACAGA 
ATGGGCGACG 
GCAGAGCATA 
GGCGTAGCCA 
TTGGGCAACA 
GAGCAGGGCT 
CAGATCGACA 
TTGGTGGATG 
CTCGGTGGCA 
CTCTCCTCAT 
GAAATAGAAG 
CGTCTCCCCT 



AAACAATGTT 
TAGGCACCTA 
GCGAACAAAT 
AGTCGCGCAA 
GTCTTGCCGA 
TGGAGAAAGA 
CCCTTTCTCA 
TTCTCTTTGC 
GTAGCTCCAT 
TCAAAGCCGT 
TATGGAAGCA 
TAGCAGCCTC 
CGACTCTGAC 
AGAAGATAGG 
CCTTCGCTCC 
ACGACCTCTT 
GCATCGCTCA 
AGCTTGGAGG 
ACTACAGCAT 
CAGCAGCGGA 
TTCTGCGCGA 
ATTACTT CAT 



CTACAAAGAT 
CAAGGCAGCC 
CACCACATAC 
GACGGCAATG 
GAAAGCGGTG 
ACTCAAAAAG 
GGTTCTGGCC 
CGAAGGTGAA 
CACACAAGAA 
AGTACTTCGT 
GGTAGCCGAT 
GGGCGGATAC 
CGGCTCCATC 
AGTGAATATG 
GATGACGGTC 
CCTCACTCGC 
AGGCCGTGTA 
TTTGCACACA 
AGAGTATGGC 
TATGAAGTCT 
ACTCCGCTCC 
GCCGTAC 



GCCCTCGACA 
GTAGAGCCAT 
ATAAACGGGC 
GATTCCGTGA 
GAGATGAAGC 
ATGTCCCAAC 
AATGGCCCGA 
ATAACGGAAG 
CTCGCCAAAG 
GTCAATTCTC 
CTCAAGGCCA 
TACATAGCCT 
GGCATATTCG 
GACGTCGTAC 
GAAGATCGTG 
GTATCGGAAG 
TGGCTCGGCG 
GCTATCAAAC 
AAGACCAAGC 
GCCATCCTGA 
ATGCCGCCCC 



AATTCGGCGT 
TCATGCTCAA 
TTTGGGACAA 
AAATGTTTGC 
TCGTGGATGA 
GCGGAGAGAA 
TGAACAAAAC 
AAATAATAAA 
AAATCAAGGC 
CGGGAGGTAG 
AAAAGCCTAT 
GCGCAGCCAA 
GCATGTTCCC 
AGACATCCAA 
CCCTCATCCA 
GCCGCAACCG 
ACAAAGCTCT 
GGGCCGCGAA 
GCAACTTCTT 
CTACCATTCT 
GTCCTTCGGG 



GAAGATGGAG 
CAGGATGAGC 
GATCACATCC 
CGACAAAGGC 
GCTGGCTTAC 
AGATGAACTT 
GAAAGGCAGT 
GAAGCCGTTC 
AGCAGCCGAT 
TGCTTTCACT 
CGTGGTCTCC 
CAGTATCGTG 
GAACTTCGCG 
GTATGCAGAC 
ACGCTACATA 
CACCAAGGCA 
TGCACTCGGT 
GCTGGCTCAG 
CGAAGAGTTG 
CTCCGATCCG 
CATACAGGCA 



540 
600 
660 
720 
780 
640 
900 
960 
1020 
lOflO 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1767 
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(2) INFORMATION FOR SEQ ID NO: 136 



30 



(ii SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 939 base pairs 

( B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

( D) TOPOLOGY: circular 



lii) MOLECULE TYPE: DMA (genomic) 
35 (iii) HYPOTHETICAL*. MO 

[ivj AMTI— SENSE : NO 



40 



(vi) ORIGINAL SOURCE: 

(A J ORGANISM: PORYPHYROMONAS GINGIYALIS 



45 



50 



55 



60 



65 



Ux) FEATURE: 

(A) NAME /KEY ; ml sc_f eature 
(D) LOCATION 1. . . 939 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136 



ATGAGAGCAA 
ATCGGACACG 
GAAGCTATCC 
TT6GTGGAGA 
ATCAAATGCC 
GAATGCTTCC 
CTCCAACCAC 
ACGGGGTTGC 
ATGGTGGAGCJ 
AAGTCTTACC 
CTCAAGCACG 
ACCTTCGATT 
GATGCTCCCG 
GCATGCTCCA 
ATCTCCCAAC 
GAATATATAC 



ACATTTGGCA 
CTCAGAGTCG 
GACAGATGAG 
CAGGAGCCGG 
ACAAATCGTG 
GCAGCTATTC 
GCTATCGTCC 
AACGCTGTGG 
TGTATGAGCT 
CCGGGTCCAA 
AAGCTTACTT 
CTTTGGCCGA 
TGGATTTCCC 
TCTCCAAACA 
GCTATGGCAT 
CCCAAGAGGG 



GATACTTTCC 
AAACCGTACA 
CCGCTACAAT 
AGCCAGTACA 
GACCGGCAAG 
GGCCGCTCGC 
CCTGTTCAAA 
CTATGCCACC 
ATATGCTTTG 
CAAAAAATCC 
CAGCTACGGA 
AGAGTTCGAC 
GATCGAAAAG 
CACACAGCAC 
CCGGATGAAG 
CGATATACT'G 



GTTTCGGTTC 
TACGAGGCTT 
ATACCGGCAA 
CTGGCCAGCG 
CGCACCTATC 
GAATCGTATG 
CTCGACAGAG 
AATCGGGGCT 
GATCGCGAGA 
CAT CAAACG A 
CTGCTCTACA 
ATGAGAGCCT 
GGCGATGTGA 
GTAGTGCGTG 
AACCTCTACA 
CGCTTGCGC 



TCrTTTTCTT 
ATGTGAAACA 
GCATCACCAT 
TACACAACAA 
GTACCGACGA 
AAGATCATTC 
AAGACTATCG 
ATGCCAATCT 
AGl'ACCCCTC 
CCAAGCAGAA 
TCATAGCCAA 
CCAAACTGGC 
TCTATCTGGA 
TGGGCGATTC 
AGCTCAACGA 



CGGGACAGCG 
GTACGCCGAC 
AGCACAGGCT 
TCACTTCGGG 
TGCGCCGAAC 
CCGATTTCTG 
GGGCTGGGCT 
GCTGATCAAG 
ATGGTTCCAC 
GCAGAGCGGA 
GCAAGGCGAT 
CAAATACAAC 
GAAAAAGCAC 
GAT GCACAGT 
CAAGGATGGC 



60 
120 
1B0 
240 
300 
360 
420 
400 
540 
600 
660 
720 
780 
840 
900 
939 



(2) INFORMATION FOR SEQ ID NO: 137 



70 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1569 base pairs 

( B) TYPE: nucleic acid 

(C) STRAW DEDNESS: double 
t D) TOPOLOGY: circular 



75 



(ii) MOLECULE TYPE: DNA (genomic) 
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(Hi) HYPOTHETICAL: NO 
(iv> ANTI- SENSE: NO 



10 



15 



20 



25 



30 



35 



40 



(vi> ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

[ Ix ) FEATURE : 

(At NAME/ KEY: misc feature 
ID) LOCATION 1...1569 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 



ATGGACGGAC 

AAAGTAGAGG 

TATCATAAGC 

ATCTACAAAC 

AATGATCGTA 

CGTCTTGTCG 

TTGTTGAGCC 

CATCAGAAGG 

ACGAACATGG 

GGTTTGGCTA 

GGTCGTTCCG 

CATCTGATGC 

AACGAAGGAG 

AAGGTGGAAG 

AATTCGCAGC 

GGTATGGATG 

GAAGCCAATG 

CCTATCACGC 

GATGAAGCTT 

GTGGTTCATC 

ACGGATGGTA 

GGGAAAAGCA 

TGGAAAGAAC 

GAAAACAAAG 

GTAGAAGCCA 

GAGGCTCCTT 

CAACAACGT 



GTCGATATTC 

CTGCGACACA 

TGGCAGGGAT 

TGGACGTTGT 

TCTATAAGAC 

AAGAGGGCAG 

GTATGTTACG 

AGGCCGAGAT 

CCGGTCGTGG 

T CAT T GG T AC 

GCCGTCAGGG 

GCCTCTTTGC 

AAGTGCTCGA 

AGAACAACTT 

GTGAAGTCAT 

TACTCAATAC 

ATTTCGAAGG 

AAGAAATATT 

ACAAGTCTTT 

AGGTATTCGA 

AACGTGTCTA 

TCATCAAAGA 

ATCTGCGTGA 

AT CCA CT ACT 

TGAACCGTAA 

CCCAAGAAGA 



GGATCGCCTC 
GACATTTGCA 
GACCGGTACT 
AGTTATTCCG 
GGCACGTGAA 
ACCTGTACTT 
CTTGCGTGGC 
TGTAGCTCAG 
TACCGACATC 
GGAAAGGCAC 
TGATCCCGGT 
CACAGAAAAG 
AAACAACATG 
CGGTATCCGT 
TTATACCCGT 
CATATACGAC 
CTTCAAGGAA 
CAGAGGTAAG 
CCAACGTAAG 
GACCCAAGCC 
TAACATAGGA 
ATTTGAGAAA 
GATGGACGAG 
TATCTATAAA 
GACCGTAGCG 
GCTGGAACAC 



CATC AGG CT A 

ACTATCACTT 

GCTGAAACTG 

ACAAACAAGC 

AAATATGCAG 

GTCGGTACTA 

ATCCAACACA 

GCCGGTCAGA 

AAGCTCTCTG 

GAATCCAGAC 

TCGTCCATAT 

ATTGCATCAT 

CTGAGTAAGT 

AAACATCTGC 

CGCCGTCATG 

GTATGTAAGG 

GATCTGATGC 

AAAGCAGAAG 

ATGGATCTGA 

GCCGTGTACG 

TGCAATTTGC 

GCTATCGTAC 

CTTCGTAATT 

CTCGAATCTT 

ATCCTAATGC 

AGGCSGCAAA 



TCGAAGCCAA 

TGCAGAACTA 

AAGCGGGAGA 

CTATCGCCCG 

CAGTTATCGA 

CTTCGGTGGA 

ATGTACTCAA 

AAGGAACTGT 

CCGAGGTTAA 

GAGTGGACAG 

TCTATGTTTC 

TGATGGATCG 

CCGTGGAGCG 

TTGAGTACGA 

CTTTGATGGG 

CTCTGATTGA 

GTGCACTCGC 

AGCTGACCGA 

TCGCAGAAGT 

AGCGCATTCT 

GTGAAGCGGA 

TGCATACTAT 

CCGTTCAGAA 

ACGAACTGTT 

GTGCTCGGAT 

TAGAAATCCG 



AGAGCATGTG 
TTTCCGCATG 
GCTTTGGGAC 
TAAGGATATG 
AGAGATTGTA 
AATATCCGAA 
TGCCAAATTG 
TACCAT CGCA 
GAAAGCCGGG 
ACAGCTTCGT 
CCTTGAAGAT 
TTTAGGTTTC 
TGCTCAAAAG 
TGATGTAATG 
AGAGCGTATC 
CAATTATGCA 
GATAGAATCT 
TATGCTTTTC 
GGCCCACCCT 
AATCCCCATT 
TGAAACTCAA 
CGATGAGTCT 
TGCCAGCTAC 
CCGCAAGATG 
ACCGGTACCG 
ACATGCAACC 



60 

120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1569 



45 



50 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 138 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1125 base pairs 
fB> TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPF.: DMA (genomic) 

(111) HYPOTHETICAL: NO 

(iv) ANTI— SENSE : WO 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GIWGIVALIS 

(lX) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...1125 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 138 

ATGAATTTCT TAAAAAAAGA ACCGTTTAAA ATATTCTCTA TGATTTATCT 
ACAATAACAA ACCGTGCCGG TACAGAACGC GCCGTGATCA ACTTGGCTAA 
S?^ TGGTC ATCGCGTATC ATTAGTCAGC GTTT GTACAA AAGAAGGAGA 
CAAGTAGAAA AAGGAATAGA AGTACACCAT CTCGGAATTA GGCTTTATGG 
GCCCGCAAAA CAGTATATTT CAAGGCTTAT CGAAGGATAA AAGCCCTATA 
GAACCGGTTT TATTGATAGG GACTAATATT TTTATCAATA CAATTTTGTC 
AACAGAGGCA GAATATTTAC GATCGG AT GC GAACATATCT CTTATGATA'" 
ATTACAAAAC GCATAAGGGG GTTTCTGTAT TCAGGGCTTG ATGCCGTTG"" 
AAAAGAGATC AGCAATCGTT CGAGGCAATC TTACGTGGAC GCTCTAAAG" 
CCCAATCAAG TTTCATTTAC TACAGTCCAA AGAGATGCTA CTACTCACAA 



GCTGTTAGAT 
CAACCTGCAT 
GCCTTCCTTC 
CAATG CATT A 
CAAGAAGCGT 
TCAGATCAGT 
TGCCCGCCCT 
AGCACTGACA 
ATATGTCATA 
ACAAATGTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



O V S Hi Ji k~'ci ab u sr. n" *» ^ , ,^L^ . , 



10 



WO 99/29870 PCT/AU98/01023 

94 / 490 

GCGATTGGCA GGCTTACCTA CCAGAAGGGT TTTGAATTCA TGATAGAAGA TGCATCACGA 660 
GTGCTGCGAG AAAGGCCTGA TTGGAAGCTT ATCATAGTCG GAGATGGCGA AAATGAATCG 720 
ATGCTACGTA AAGAAATTGC ATCTCGCAAT ATGGAGTCGC AAATAGAAAT ACATCCATCT 180 
ACACCGGAAA TTCGCAAATA CTACGAATCA TCTGCTATTT ATCTAATGAC GTCCCGTTTC 84 0 

GAAGGACTAC CAATGGTACT TCTCGAAGCA GAAGCATATG CACTACCTAT AATCTCATAC 900 
GATTGTCCGA CCGGCCCGAG GGAACTGATC GAAAACGGTC GCAATGGTTT CCTTGTGCCA 960 

ATGGAAGCAC ATGAAGACTT CGCGGATAAG TTACGCTTAT TGATGGATGA TGAAACTCTT X020 

CGTAAGAAAA TGGGACAAGA ATCAGAGTTG ATGGTCAAAT CCTACTCTCC GGCAAATATC 1000 

TATGAATGTT GGAAGAAACT ATTCGTCGAA ATCGGCTACA TGAAT 1125 

(2) INFORMATION FOR SEQ ID NO: 13 9 



(1) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1086 base pairs 

(Bl TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

20 . <ii> HOLECULE TYPE: DMA (genomic) 

Uii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



25 



55 



65 



(vij ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOWAS GINGTVALIS 



ilr.) FEATURE: 
30 (A) NAME/ KEY: miscj eature 

(B) LOCATION 1...1086 

ixi) SEQUENCE DESCRIPTION: SEQ ID MO: 139 

35 ATGATTTATC TGCTGTTAGA TACAATAACA AACCGTGCCG GTACAGAACG CGCCGTGATC 60 

AACTTGGCTA ACAACCTGCA TGCCAATGGT CATCGCGTAT CATTAGTCAG CGTTTGTACA 120 

AAAGAAGGAG AGCCTTCCTT CCAAGTAGAA AAAGGAATAG AAGTACACCA TCTCGGAATT 180 

AGGCTTTATG GCAATGCATT AGCCCGCAAA ACAGTATATT TCAAGGCTTA TCGAAGGATA 240 

AAAGCCCTAT ACAAGAAGCG TGAACCGGTT T TATTG AT AG GGACTAATAT TTTTATCAAT 300 

40 ACAATTTTGT CTCAGATCAG TAACAGAGGC AGAATATTTA CGATCGGATG CGAACATATC 360 

TCTTATGATA TTGCCCGCCC TATTACAAAA CGCATAAGGG GGTTTCTGTA TTCAGGGCTT 420 

GATGCCGTTG TAGCACTGAC AAAAAGAGAT CAGCAATCGT TCGAGGCAAT CTTACGTGGA 4 80 

CGCTCTAAAG CATATGTCAT ACCCAATCAA GTTTCATTTA CTACAGTCCA AAGAGATGCT 54 0 

ACTACTCACA AACAAATGTT GGCGATTGGC AGGCTTACCT ACCAGAAGGG TTTTGAATTC 600 

45 ATGATAGAAG AT GCATCACG AGTGCTGCGA GAAAGGCCTG ATTGGAAGCT TATCATAGTC 660 

GGAGATGGCG AAAATGAATC GATGCTACGT AAAGAAATTG CATCTCGCAA TATGGAGTCG 720 

CAAATAGAAA TACATCCATC TACACCGGAA ATTCGCAAAT ACTACGAATC ATCTGCTATT 780 

TATCTAATGA CGTCCCGTTT CGAAGGACTA CCAATGGTAC TTCTCGAAGC AGAAGCATAT 840 

GCACTACCTA TAATCT CATA CGATTGT CCG ACCGGCCCGA GGGAACTGAT CGAAAACGGT 900 

50 CGCAATGGTT TCCTTGTGCC AATGGAAGCA CATGAAGACT TCGCGGATAA GTTACGCT T A 960 

TTGATGGATG ATGAAACTCT TCGTAAGAAA AT GGGACAAG AATCAGAGTT GATGGTCAAA 1020 

TCCTACTCTC CGGCAAATAT CTATGAATGT TGGAAGAAAC TATTCGTCGA AATCGGCTAC 1080 

ATGAAT 108 6 



(2) INFORMATION FOR SEQ ID NO: 14 0 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1920 base pairs 
60 (B> TYPE : nucleic acid 

(C) STRANDEDNESS: double 
ID) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 



70 |vi> ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 

(lx) FEATURE: 

(A) NAME/ KEY; mi ec_ feature 
75 (B» LOCATION 1...1920 
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(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 140 

ATGGGAAAAA TCATTGGAAT TGACTTAGGC ACAACGAACT CTTGTGTCTC TGTATTGGAA 60 

5 GGTAACGAAC CTATCGTTAT TACAAACAGT GAGGGCAAGC GCACAACGCC CTCGGTAGTG 120 

GCTTTTGTGG ATGGTGGCGA GCGTAAGGTG GGCGATCCGG CCAAGCGTCA GGCCATCACC 180 

AATCCGACCA AGACGATATA CTCTATCAAA CGCTTCATGG GCGAAACTTA CG ATCAGGTT 240 

TCCAGAGAAG TGGAGAGAGT GCCATTCAAG GTAGTACGTG GGGACAATAA TACTCCGCGC 300 

GTAGATATAG ACGGTCGTCT CTATACGCCG CAGGAAATTT CGGCCATGAT CCTTCAGAAC 360 

10 ATGAAGAAGA CGGCCGAAGA CTACCTCGGT CAGGAAGTAA CGGAGGCCGT GATCACTGTG 4 20 

CCCGCATACT TCAACGACGC TCAACGTCAG GCAACGAAAG AAGCAGGAGA GATCGCCGGC 4 80 

CTGAAAGTTC GCCGTATTGT GAACGAGCCT ACGGCAGCTT CTCTGGCCTA CGGTCTGGAC 54 0 

AAGTCCAATA AGGATATGAA GATCGCTGTC TTCGACTTGG GTGGCGGTAC CTTCGATATC 600 

TCTATCTTGG AATTGGGCGA CGGCGTTTTC GAAGTGAAAT CGACCAACGG TGATACGCAC 660 

15 CTCGGAGGAG ACGACTTCGA CCACGTGATC ATTGACTGSC TGGCAGAAGA GTTCAAGTCT 720 

CAGGAAGGTG TGGATCTTCG CCAGGATCCT AT GGCT AT GC AGCGTCTGAA AGAAGCTGCC 780 

GAAAAAGCCA AGATAGAGCT CTCCAGCACT TCATCTACGG AGATCAACCT CCCCTATATC 94 0 

ATGCCGGTGA ACGGCATCCC CAAGCACTTG GTGATGACGC TTACAAGGGC TAAGTTCGAG 900 

CAGTTGGCCG ATCGTCTGAT TCAGGCATGT GTGGCACCCT GCGAAACGGC CTTGAAAGAT 960 

20 GCCGGTATGT CACGTGGCGA TATCGATGAA GTGATTCTCG TAGGTGGTTC CACACGTATT 1020 

CCTGCTATTC AGGAGATTGT GGAGAAGATC TTCGGTAAGG CTCCGTCCAA GGGTGTGAAT 1080 

CCCGACGAAG TGGTAGCTGT GGGTGCCGCT ATTCAAGGCG GTGTTCTGAC CGGTGAGGTA 114 0 

AAGGATGTCT TGCTGTTGGA CGTTACCCCC TTGTCGCTC'3 GTATCGAGAC TATGGGAGGC 1200 

GTGATGACTC GCTTGATCGA TGCCAATACC ACTATCCCGA CGAAGAAGAG CGAAATCTTT 1260 

25 ACCACAGCAG TGGACAATCA ACCTTCGGTA GAGATTCATG TACTTCAGGG TGAGCGTTCT 1320 

TTGGCTAAGG ACAATAAGAG CATCGGCCGT TTCAACTTGC ACGGTA'l'TGC TCCGGCGCCC 1330 

CGTCAGACAC CGCAGATCGA AGTAACGTTT GACATCGATG CCAACGGTAT CCl'GAATGTA 144 0 

ACGGCTCATG ACAAAGCT AC CGGCAAGAAG CAGAATATCC GCATCGAAGC CTCCAGCGGT 1500 

TTGTCCGATG AT GAG AT CAA GCGCATGAAG GAAGAGGCGC AGGCCAATGC CGAAGCAGAT 1560 

30 AAGAAAGAGA AAGAACGTAT CGACAAGATC AAT CAGGCCG ACAGCATGAT CTTCCAGACG 1620 

GAAAAGCAGT TGAAGGAGTT GGGAGACAAA TTCCCGGCCG ACAAGAAGGC TCCGATCGAT 1680 

ACCGCTCTCG ACAAACTGAA AGAAGCACAC AAAGCACAGG ATGTAGCTGC TATCGATACA 17 40 

GCCATGGCCG AACTGCAAAC CGCTCTTTCC GCAGCGGGCG AAGAGCTTTA CAAGAATGCC 1800 

GGAGCAGCCC AAGGTGGCGC ACAACCCGGT CCGGACTTCG GCGGTGCTCA AGGTCCCTCT I860 

35 GCCGGTGATC AGCCCTCTGA CGACAAGAAC GTCACAGACG TAGACTTCGA GGAAGTGAAG 1920 

(2) INFORMATION FOR SEQ ID NO: 141 

40 (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1347 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 
(0) TOPOLOGY: circular 

45 

(ii) MOLECULE TYPE: DNA (genomic) 

(iiij HYPOTHETICAL: HO 

50 (iv) ANT1-SEHSE: NO 

Ivi) ORIGINAL SOURCE: 

(A) ORGAHIS11: PORYPHYRCHOHAS GIHGIVALIS 

55 (ix> FEATURE i 

(A) NAME/ KEY: mlscj eature 
(0) LOCATION 1...1347 



60 



65 



(Xll SEQUENCE DESCRIPTION: SEQ ID NO: 141 



ATGCGCTACG ACTTAGCTAT CATCGGTGGA GGGCCGGCCG GTTATACGGC TGCCGAACGT 60 

GCTGCCAAAG GTGGCCTGAA AACCCTCCTA ATTGAGAAGA ATGCTCTCGG TGGTGTATGC 120 

CTCAACGAAG 6ATGTATACC GACCAAGACG CTACTCTACT CGGCCAAAGT GCTACATCAA 100 

ATTGCTACGG CATCTAAATA TGCAGTAAGT GGAACGGCCG ATGGACTTGA CCTCGGCAAG 240 

GTGATTGCCA GAAAAGGTAA AATCATTCGC AAGCTGACTG CAGGCATCCG TTCACGCCTG 300 

ACAGAGGCCG GAGTAGAGAT GGTGACGGCA GAAGCTACCG TAACGGGATG CGATGCAGAC 360 

GGCATCATCG GCATTACTGC GGGCGAAGCA CAGTACAAAG CTGCCAACCT GCTACTATGT 420 

ACCGGTTCGG AGACGTTTAT TCCACCCATC CCCGGAGTGG AGCAGACAGA GTATTGGACA 4 60 

AACCGTGAAG CTCTACAGAA CAAAGAGATT CCGACCTCTC TCGTCATCAT CGGTGGTGGA 54 0 

70 GTGATCGGAA TGGAGTTCGC TTCTTTCTTC AACGGTATCG GTACGCAAGT GCACGTGGTG 6O0 

GAGATGCTGC CGGAAATACT CAACGGTATC GATCCCGAAC AT GCAGCT AT GCTACGCGCT 660 

CACTATGAAA AAGAAGGAAT CAAATTCTAC CTCGGGCACA AAGTAACATC GGTTCGCAAC 720 

GGAGCTGTTA CGGTAGAATA CGAAGGAGAA AGCAAAGAGA TCGAAGGAGA ACGTATCCTG 780 

ATGAGTGTGG GACGTCGCCC CGTGCTGCAA GGATTCGAGT CGCTCGGATT GGTGCTTGCC 84 0 

75 GGCAAAGGTG TAAAGACTAA TGAGAGGATG CAAACTTCCC TGCCCAATGT CTAT GCTGCA 900 
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GGTGATATTA 
GTAGATCAGA 
GTGTACACCA 
GGACGTGCCT 
AACGAACAAG 
GCACACCTCA 
ACCGGCATGA 
ATCCTAAAAG 



CAGGCTTCTC 
TTTTGGGCAA 
ATCCCGAGGT 
ACACTGTTCG 
GCAATGGAGA 
TTGGCAATCC 
CGGATCGACA 
AAACTCTCCC 



GCTTTTGGCA 
AACAGACGAA 
CGCCGGTGTG 
TCGCCTTCCT 
GTGCAAACTA 
GGCCGGCGAA 
AATCGAACGA 
CGGAGGT 



CATACGGCTG 
ACGATGACCT 
GGAGAGACGG 
ATGGCCTTCT 
CTACTTGATG 
CTCATCGTAA 
ATCATATTCC 



TACGGGAAGC 
ACCGTGCCGT 
AAGAATCGCT 
CCGGTCGATT 
AAGAGAACCG 
CCGCTGCCAT 
CTCATCCGAC 



AGAGGTAGCA 
ACCAGGTGTG 
TCGCAAAGCA 
TGTAGCAGAA 
CTTGATCGGA 
GGCCATCGAG 
TGTAGGCGAA 



960 
1020 
1060 
1140 
1200 
1260 
1320 
1347 



(2) INFORMATION FOR SEQ ID WO: 14 2 



15 



(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 2823 base pairs 

(B) TYPE r nucleic acid 

(C) STRANDEDNESS; double 

(D) TOPOLOGY; circular 



20 



til) MOLECULE TYPE: DMA (genomic) 
(ill) HYPOTHETICAL: MO 



25 



(iv) ANTI-SENSE: NO 



(vll ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRCMONAS GIHGIVALIS 



30 



(ix) FEATURE: 

{A) HAHE/KEY: misc feature 
( B ) LOCATION 1 . . . 2023 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142 



35 



40 



45 



50 



55 



60 



65 



70 



75 



ATGGAATTGA 
CTTTCTGCCC 
GACAACGGAT 
TTTATCGCAC 
TTCTTGGAAC 
\ TATCTCGAAA 
. AAGACGGAAT 
TTGCTTATCC 
1 GAGCGCGGTG 
GAGGCTATAC 
CTGATGGACG 
TATCGTCCCG 
AACAAGATCA 
TATACGCCGG 
ACTACCACGC 
TCGATATTCG 
OGTCTGTCCG 
TCTAACTTCA 
GAGGGTGAAG 
TTCGGTATCA 
AATCAATACA 
TACTTCACCG 
TTTGCTCCTC 
GTGAAGAATG 
GAAGCAGACT 
GACGAAGTCT 
GAGAAGAAAG 
TACCTCAAGA 
GGTATCCTCT 
GGTGGCTTGG 
TCCGTATCTC 
GAT AT GO AAA 
GAAGCGTTCA 
CCGATGGCTG 
AAACCCATGA 
AATGAGCGAT 
GCCAACATGA 
GATAAGATGA 
GAGAAGGAAA 
TATACGCTCA 
ACGGCTACCG 
GAGCAATATC 



AAAGATTTTT 
AACAGGCTCA 
TGACTTATTT 
AAAAGGTAGG 
ACATGGCTTT 
CGATCGGTGT 
ATACGATAAT 
TGCATGATTG 
TGATCCAGGA 
TTGCCAAGG C 
TCGTGCTCAA 
ACCTGCAAGG 
AAGAACTCTT 
TAGAGGACAA 
AGCTCTCCAT 
GACTTGTGGA 
AGATTACTCA 
TGTACATCAC 
CGGAGAAAGC 
CCAAAGGCGA 
ACGAAAGAGA 
ATGGCGGCTA 
AGGTTCCTCT 
CTGTCGTTAC 
TCCTCGCTGC 
CCGACCAAAA 
ATCAGAAGTT 
AGACCGATTT 
CCGGAAAGCA 
GCAACTTCGA 
CCTCTTTGTC 
CTTTCTTCCA 
AGGCCACACA 
CGCTTATGGA 
AAQCTGCTGA 
TCGCTGATGC 
AG C CATT GAT 
ATAAGGCTCA 
TGGATACTCC 
AGAACAGTCT 
TTCGCGAGAA 
CTCAGCCCAA 



ATCACTTGGT 
GCCACTCCCT 
CATCCGTCAC 
TT CT ATTCTT 
CAACGGTACG 
ACGTTTCGCT 
GGATGTGCCG 
GAGTAACAAT 
AGAGTGGCGT 
TATGCCGGGT 
CTTCAAGCAT 
TCTGGTGATC 
CAAC GACGTT 
CGATGAGCCT 
CAGCTTCAAG 
AGACTATATG 
CAAGCCTAAC 
CCAGACTAAG 
GAT GAACG C A 
ATACGATCGT 
CAAGCGTAAG 
TATCCCGGGT 
GGAAGCATTC 
CCTCACCGGT 
TTTCAAAGCT 
ATTGATGGAG 
CGGTACCACG 
CAAATCAAAC 
TGCTCCCAAC 
TGCTATCCAG 
TCTGCTCAGT 
GTTGATCTAT 
GGAAAAGTTG 
CTCTATCCGT 
CGTGGAGAAA 
CGGCGACTTT 
CGAAACTTAT 
GCTACCGGCT 
TTCGACTACT 
CCTGCTGGAA 
GGAAGGCGGT 
GGCTCTGATG 



CTTCTGCTTG 
ACAGATCCGG 
AACGAGAACC 
GAAGAAGATA 
AAGAACTTCC 
CAGAACCTGA 
ACTACACGTC 
ATTACCCTCG 
GCTCGTCGCG 
AATAAATATG 
GATGAGCTGC 
GTGGGAGATA 
CCTGCTCCCG 
ATCGTAGCCA 
AGCGACCCCA 
AAACAGGTGA 
GCTCCTTTCC 
GACGCATTCA 
TTGGTGGCAG 
GCACGCACGA 
AACAATGCTT 
ATTGAGGTGG 
AATCAGGCTA 
CCTTCAAAGG 
GCTCGTCAGC 
AAAGCTCCTA 
GAACTTACCC 
GAAATCCTGA 
CAATCTGTGA 
CTGGATAAGG 
GAAGGTCTTT 
CTCCAAATGA 
TACAATAACT 
CATACCATGT 
GTAAA'JTACG 
ATGTTCTTCT 
CTTGCTTCAT 
GCCCGTTCGG 
ATATTCGATG 
GTCTTCTCAG 
GCATACAGTG 
C AG AT CT ATT 



TGGGATTCAT 
CTGTTCGTGT 
CGAAAGATCG 
GCCAGTCCGG 
CCGGTAAGAA 
ACGCTTCTAC 
AGGGAATCAT 
ACGGGCATGA 
AT GC CAAC CT 
CA GAACG CAT 
GCAACTATTA 
TCGATGTGGA 
TGAATCCAGC 
TTGCTACCGA 
CTCCTCAAGA 
TCACTACAGC 
TCAGTGCAGG 
ATTTTGTTGC 
AGATAGAAAG 
ATGTGCTCAA 
ATGCCAATGA 
AATATCAGAC 
TTGCCCAAAT 
CTGAAGCCAA 
AGAAAGTAGA 
AGGCCGGAAA 
TTAGCAATGG 
TGAGTGCTCT 
TGAATTCGTT 
TGCTGACAGG 
CGGGCAAAAC 
CTGCTAACCG 
TGAAAAATCA 
ACGGCGATAA 
ATCAGGTAAT 
TTATCGGTAA 
TGCCCAACCT 
GAAAGATCGA 
TCGTGTCCGG 
CCGTAATGGA 
TGGCTGCATT 
TCCCCACGGA 



TCCGATGAAG 
CGGTAAGTTG 
TGCGGATTTC 
TTTGGCTCAC 
CTTGATCAAC 
CGGATTCGAC 
CGACTCCTGC 
GATCGACGAG 
TCGTATGTTC 
GCCCATCGGT 
TAAGAAATGG 
CTATGTGGAG 
AGAGCGTATC 
TGCTGAGGCT 
AGTGCGAGGA 
CGTGAATGAG 
AGCTTTCTTC 
CACGGTTCGT 
CCTCCGTCAG 
GCGATACGAG 
ATACTCCACC 
GGTGAATGCT 
GATCGATCCG 
GATTCCGAGC 
AGCCAAGAAA 
GATCGTTTCC 
CATCAAAGTA 
CAGCCCGGGT 
CATGAACGTG 
TCGCTCTGCT 
GACTGTAGAA 
CAAGGATCCC 
GGAAGCCAAC 
TCCGATGATG 
GGCTTTCTAC 
TCTGGATGAA 
CAAGCGTGGC 
TTGCAAGTTC 
AAATGTGGAA 
TCAGGTGTAC 
CGGCGGTCTC 
TCCTGCTCGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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GCCGAGGAAA TGAATGCTAT CGTTTTTGCT 
AATGTGGAAT ACTTTAAGAA GACTATCGAA 
CGTGAGAATC GTTTCTGGCT CGAAGCCATG 
ATCACAGACT ACGAATCCGT ACTGAACGGT 
GCAGACCTCT TGAAGCAGCA GAATCGGGTT 
CAA 



GAGTTGGAGA AGCTTGCCAA 
AACCTGAATA AGCAGCACAA 
AAGGCGTCTT TCTTCGAAGG 
CTTACTCCTG CT GAATTGCA 
GTTGTCATCA TGGCTCCTGT 



GGAGGGCCCC 
AGAAAGTCTG 
AAATGACTTC 
AAAGTTTGCG 
TGCAAAGGCT 



2580 
2610 
27 00 
2760 
2820 
2823 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



(21 INFORMATION FOR SEQ ID NO: 143 

U) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 2052 base pairs 
( B J TYPE: nucleic acid 
IC) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DMA (genomic) 

fiii) HYPOTHETICAL: MO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIIIGIVALIS 

FEATURE; 

(A) HAHE/KEY: miscjeatur- 

(B) LOCATION 1...20S2 



Ui) SEQUENCE DESCRIPTION: SEQ ID IIO:14 3 



ATGAGTAAGA 

TTCCTGTACA 

ACGCAGAAGC 

CGCGTAACGG 

GGGATGACCG 

GAGTTTCTTG 

TTTTACTCGG 

GATGCTACGG 

GACAAGGCTG 

CTCAAAAAAG 

ATCATTTTCG 

AATCAGATCA 

GACTATAAGG 

CACCTCAATG 

AACAACTTGG 

GATGAAGTAC 

CCGGATATTC 

ATCTCGTCTC 

CGCCCCACAT 

ACGGATGAGA 

CACAAGTACA 

GGACAGGTAG 

GCATCCGACA 

AGCCTGCTGG 

AACAATCTGA 

ACTCTCGTGA 

GCTTTCGAAT 

CGCCGTATGC 

GATTCGTACA 

GAGAAAGAAT 

GCGGAAGAGG 

GCCACGAAGG 

AACGATCAAC 

GCTCTGCTCG 

CAGCGTCTTC 



AAGGAACAAT 

GCGACCATGA 

TGAAAACGCT 

TCAGCGTGGA 

AAGAGGAGGT 

AAAAGTACAA 

CTTTCATGGT 

CGGTGAAATG 

ACCGTGGCAC 

AAAAGATAGA 

GCAAGAAGCA 

ACGACACACA 

AATTTTACCG 

TGGACTATCC 

ATCTGCAGCG 

AGGGTATCGT 

CCCTCAACGT 

AT AT C ACCAA 

TCGAGGAGAA 

AGTTCTATGA 

CGTTCGACGA 

TGTATCTCTA 

AAGGCTACAG 

AGCAAAAGTT 

TCCGCAAGGA 

AGCTGTTCGA 

CGCTCGGAGC 

GCGATATGGC 

ATCTGGTACT 

CGGTAGAGCC 

CCAAGCTGCT 

AAGCCAAGGA 

TGACCAAATA 

GAAGCCGATT 

TC 



CGGGGTAACG 

GATATTCCTG 

TACATCCGTC 

TGAAGTGGCA 

GGAGAAGTAC 

AGACGACAAG 

GTCCGAGCGA 

GAGCTGCGAC 

CGACATCGTG 

GGGGCTCCTC 

GGAATGGAAA 

TCCTGCCTGG 

TTCGCTCTAT 

GTTCAATCTG 

CAACAAGATT 

GCCGGACT TC 

ATCGCGCTCC 

GAAGGTGGCA 

ATGGGATAGT 

GCGTGCAGCC 

ATACCGAACG 

TGCTACGGAC 

CGTGATGCTG 

GGAGAAGACA 

GGAAAGAGCC 

AGCACGCCTG 

CGAAGGTGAA 

ACAGCTGCAG 

TAATACCGAT 

TTCGCTCACA 

CGATGAGGAA 

GAACAACGCC 

TGCTCAGGAC 

GCTGACGGGA 



AGCGACAATA 

CGTGAGATCG 

GGCGA ATTCA 

CGCACGATCA 

ATCAATCAGA 

GCCGCCATTA 

GTGGACGTGA 

GGATCGCCCG 

ATGCACATCG 

GGCAAATACT 

GACGGCAAGA 

ACCAAAAAGC 

CCCATGTCCG 

ACAGGTATCC 

CAGCTCTACT 

CTCACCCTCC 

TAT CTGCAG A 

GACCGTCTGG 

CTGAAGCTCT 

AAATTCTTCC 

CTCGTCGAAG 

AAGCATGGAC 

TTGGATGGTC 

CACTTTGTCC 

GAAGTGAAAC 

CCACGGGACG 

GCCATCCTTA 

CCGCGAATGA 

CATCCGCTCA 

GAGCTTAGAG 

AAAGGGAAGA 

GTCGAACAGG 

AACGAGCTGA 

GAGGCTTTGG 



TATTCCCCGT 

TCTCCAATGC 

AAGGCGAGAC 

CGGTCAGCGA 

TTGCTTTCTC 

TCGGCCACTT 

TCACGCGCTC 

AATACACGCT 

ATGAGGAGAA 

GTAAGTTCCT 

TGCAAGATAC 

CTGCCGACCT 

AAGAGCCTCT 

TCTATTTCCC 

GCAATCAGGT 

TGCACGGGGT 

GCGATGCCAA 

AAGAAATTTT 

TCGTCGAATA 

TTTTCACCGA 

GTGTACAGAC 

AGTACAGCCA 

AGTTGGATCC 

GT GT C GAT AG 

TGTCCGATAC 

AGAAGAAGCA 

TCACACAAGC 

GCTTCTACGG 

TCGACAGGGT 

CGAAAATCGC 

AACCGGAGGA 

CCAAAACCGA 

TAGGTCAGCT 

CCGAATTCAT 



CATCAAAAAA 

CGTGGATGCT 

GGGTGACCTC 

CCGCGGCGTA 

CACTGCGGAA 

CGGACTCGGA 

TTTCCGAGAA 

CGAACCTGCG 

TAGCGAGTTC 

TACCGTGCCG 

GGACGAGGAC 

CAAGGACGAA 

CTTCTGGATC 

GAAGATCAAA 

TTACGTCACC 

CATCGATTCG 

TGTGAAGAAG 

CAAAAACGAC 

CGGTATGCTG 

TATGGACGGA 

GGATAAGGAC 

CGTGAAACGT 

GCATATCGTG 

CGATACGATC 

GGAGCGCGCC 

CTTCAATGTA 

CGAATTCATG 

CGAACTCCCC 

ACTCTCCGGT 

CGAGCTGAAA 

AATCCCTGTT 

AGGCAGT AT C 

CATCGACTTG 

TCGTCGCAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
102O 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2052 



70 



75 



(2 J INF0RIIATION FOR SEQ ID NO: 14 4 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1392 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 
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<ii> MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

O 

tiv) ANTI-SENSE: NO 

(vx) ORIGINAL SOURCE: 
1Q <A} ORGAt ^SM: PORY PH YROMONAS GINGIVALIS 

(ix) FEATURE: 

<A) NAME/KEY : misc_ feature 
(B) LOCATION 1 1392 



15 < xi > SEQUENCE DESCRIPTION: SEQ ID NO: 144 

otSSS?? ^g^ta GACGATGATG TGGCAGTCTG TGCCGCACTG 60 

CGTC1 GGTGC TCAAGCGAGC GGGCTATAAT CCCGTTATAG CCAACAGTCC CGACGAAGCT na 
20 IJSSS™* TGCGGAATCC TGATGGCGGC TGTAAGCCGG CTGTgSct GATG^AT^ 
tttcgacctc cggcagggaa GGATTGGAAC TACTCGAGAA gatgcagata 
£S5S5£Z SSSSSJ ACTGATGACG GCTTGGGCTT CGATTCCACT GGCA^GGaS 
SEESE TCGGCTCCTT 



30 



180 
240 
300 
360 
420 



, v-wv_aaw\im tsis^AAtaGCAT GGGACAACGA TCGGCTCrPT 

SSSSSS" ATACGGCCTT SCATCTGGCT GCTCCCTCA'3 CTBTGGCeAA TOW^SS 

25 ?«^™ CA ? C AAAGCTACAG TCCAACAGAA ?^£5S2? 4^0 

«»cc«»ta s<o 



600 
660 



- _™rrrrz*~ lui^mm AtrfSAACGGAT ACGCCGCATA 

ATtaCCTCTGT GCTGATCACG GGCGAGAGCG GTACGGGCAA AGAGTTGATA 
GCCGAAGCTC TGCACCGTGG GAGCAAACGA GCCTCAGCCC CATTCGTCAA GGTCAATTTG 660 
GGTGGGATTC CCGAAAGTTT GTT CGAAAGT GAGCTGTTCG GACATAAGAA 2*^25? 7?n 
llrr^™ GAAAGGAC ^ TTCGAGCTGG CTGA^S 780 
SSSS*^ ^? CGAACT ACCGGTCGGC AACCAAGTAA AACTGCTGCG AGT GCTACAG 

?SacgI ^^ AGAGC otctccc *- c GAGTGGACAT CCGTGTGGTA 

ATGCTTCCTT GGAGCGAATG GTAGCCGAAG GACGTTTCAG AGAGGACCTC 
1*7*1^ T CAACCTGAT ACATCTGCAT CTGCCTCCGC TGCGTGAGCG TCAGGAGGAT 
ATACAGCTGC TGGTGGAAGC CTTCAGTGAA GCCTTTGCCC AATCGAACGG ATTGCCCCAT 
CGCGAACTGA flAAn^^ra^^ TATGCGACGT ATCTGTGCCA TGCOCCTACC G^GCAATGTA 1140 
rrrn^ZZr AAAACGTAGT GGAGCGTACG CTATTGCTCT CGGGATCGAG AGAAATCAGT 1200 
gSScg ISSUES ^ GGTTCGCAG GTGACGGCAG CAGACCACTC CGACGAACGG 1260^ 
gSaOT^GTG CT^r^^ ^ GCTATC CG ^AGACGC TGACTAAATA CAACGGCAAC 1320 
40 aH^G^AC ^ GCACGAGC CTTGGGATTG AGCCGGGCAG CTCTTTACCG GCGAATGGAG 1380 

1392 



840 
900 
960 
1020 
1080 
1140 



(2) INFORMATION FOR SEQ ID NO: 14 5 

45 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(Bl TYPE: nucleic acid 

(C) STRANDEDMESS; double 
5 q tD) TOPOLOGY: circular 

iil) MOLECULE TYPE! DNA (genomic) 

(iii) HYPOTHETICAL: NO 

55 Uv) ANTI-SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS GINGIVALIS 

60 (ix) FEATURE: 

(A) NAME/ KEY : mise feature 

(B) LOCATION 1. . .7S0 

gg {xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145 

SIS^S? 6 * TAAAGAACCT CCACGCCACA GTACAGGGCA AAGAGATATT GAAAGGAATC 60 

^1™™™ TCAATGCCGG AGAGATTCAT GCTATCATCC GGCCGAACGG ATcSg^AA^ 120 

ACA?5cI15g SSSPEE *™ GGG *™ CCCTCCTTTG AACTCACGGA Ho 

70 WctS?^ wIS5££ 52I CGAACTC GAACCGGAAG AACGTGCACA CCTCGGACTC 24 0 

G^AG^S IS^SSS SSIf 5AGATC CCGGGC GTCA GCATGGTGAA TTTCATGAGG 300 

GCAGCTGTCA ATGAACATAG GAAAGCGATC GGAGCAGAAC CCGTATCGGC AAGCGACTT- l^n 

GCGTGCCATT GTGGAGCTGG AC^CAAAT^ "i «S 

T^SISi* AAGGCTTCTC CGGTGGAGAA AAAAAGAGGA ACGAAATCTT CCAAATGG^T rfflft 

75 ctcSS™ T ATTTT GG AC GAAACCGATA GCGGGCTCGA TATC^AcIS S40 

CTCCGCATCG TAuCAGGCGG GGTAAACCGA CTCCGCTCTC CGGAGAATGC TGCTATTGTG 600 
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ATCACACACT ATCACCGTTT GCTCGAGTAC ATCAAGCCCG ACTTCGTACA CGTCCTTTAC 
AAGGGGCGCA TCGTCAAGTC GGGAGGAGCC GAGCTGGCTC TCACGCTCGA AGAAAAAGGC 
TACGACrGGA T CAAGG AAG A GATAGGAGAA 



660 
720 
750 



10 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID HO: 14 6 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH- 1383 base pairs 
(S) TYPE; nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circuJar 

tii) MOLECULE TYPE: DMA (genomic) 

(ill) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 

{VJ ) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix> FEATURE: 

(A) NAME/ KEY : miccf ea twre 
tB) LOCATION 1...1383 



txi) SEQUENCE DESCRIPTION: SEO ID MO; 1<1 6 



ATGGCTAAGG 
CAAGGCAACT 
CCGGCATCGG 
AGACTCTTAC 
TTCGACCGCG 
CCGGGAATCG 
ACGCTCTATG 
GGGCAAGCCA 
CGTGCAGAAG 
GAGGAAATGG 
CTCAAATACT 
GGTAGCATAG 
GGGGATAAGC 
TCCGAGCTGG 
GAACATCTCA 
GAGGG CATTC 
AATCCGCAGC 
CTGGAGAAAC 
GGTATCAAAA 
AGTCTGGACA 
GAGATACGTC 
AAAGAGATAT 
CGGCTCGTGC 
GAA 



AGAAAACGAT 
GCAATGCCTG 
GCAAGCATGC 
AGGATGTGGA 
TACTGGGTGG 
GTAAGTCCAC 
TGTCGGGCGA 
TGAATGGGTG 
AACTCACACC 
AAAGCTCGGC 
GCAAGACTAC 
CCGGACCGAA 
ATCATCTCTA 
GGATATACGA 
TCACACGCAA 
GCCCGATACT 
GTTCGGCCAC 
GTGCCGGCTT 
TAGCCGATCC 
TCGTTATCCC 
CCGTGAGCCG 
TGGTACCGGC 
CGGTCAGAAA 



CTACGTCTGC 
TGGAGAGTGG 
AGCCAAGAGT 
GT CCGGCG AT 
AGGAATTGTC 
GCTTATCCTC 
AGAAAGTGCC 
CTACGTATAC 
CGATCTCCTC 
CGGCAGCGTG 
GGGTATCCCC 
GGTGCTGGAG 
CCGGATACTC 
GATGCGGCAG 
TAGGGAAGAC 
CATCGAAGCG 
GGGCTTCGAT 
CAAGCTCATA 
GGCTACGGAT 
GCCGGCCGTA 
CATCGAGCAG 
CGATAATTTC 
GGTGGAGGAA 



CCTTCGTGCG 
AACTGCATTG 
TTTATGCCTC 
GAAGAGCGTA 
AAAGGAGCAT 
CAGACGGTGC 
CGACAACTGA 
TGCGAAACGA 
GTGATAGACT 
GGGCAGATCC 
GTCATCGTCA 
CATATAGTGG 
CGAGGACAGA 
GACGGTCTGC 
CTCAGTGGCA 
CAGGCTTTGG 
ATTCGGCGGA 
CAG AAG GAT G 
CTGGCCGTTA 
TGCATGACGG 
CG CAT AAC GG 
CGGCAGGAGG 
GCCTTCCCCC 



GAACCAAATA 
ATGAGGAGAA 
GGG AG CAG G A 
TTCGCCTCGG 
TTGTCCTGCT 
TGCGTCTGCC 
AGATGCGCGC 
ATATAGAGAG 
CTATACAGAC 
GCGAATGTGC 
TCGGACACAT 
ATACGGTGCT 
AGAACCGCTA 
GTGGCGT GGA 
TAGCCATAGC 
TCAGCTCGGC 
TGAACATGCT 
TGTTTCTGAA 
TCTCGGCAGT 
GCGAGGTCGG 
AAGCGCGTCG 
ATGCCGGCCG 
ATCTGTTCTC 



CGCCAAATGG 
GGTGCCGGCA 
•CAACCGGCCA 
CGATGAAGAG 
TGGCGGCGAG 
GCAGT TGCGC 
CGAACGACTG 
GATACTCTCC 
GGTCTATACC 
CGCCTTACTG 
CACCAAAGAA 
T CTCTTCGAC 
T GGCAGT ACT 
GAATCCGAGC 
CGTAGCGATG 
CAT T T ATGCC 
CTTAGCCGTA 
CATTGCCGGA 
GCTGGCGTCG 
ACT CTCCGGA 
CATAGGGTTC 
CTTCGGTATT 
GAAAGGAAGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1363 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 147 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 
(D) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
ID) TOPOLOGY: circular 

til) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(lv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMGHAS GINGIVALIS 

(i>:) FEATURE: 

(A) NAME/ KEY : nusc_f ea turo 

(B) LOCATION 1 . . .813 
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txi) SEQUENCE DESCRIPTION: SEQ ID MO: 147 



ATGAATAGCA GACATCTGAC AATCACAATC 
TTGGGCGGCT GCTCCGTAGC CCAACAAGAT 
ACTTCGGCGT GGA'l'ACAACG TTCGGCCGAA 
ATCGCTACGG AAAGAGTGGA CGCTCTACCG 
GCCATCGTAA CGGACATAGA CGAAACCATT 
GCTCTCAGCG GCAAGGATTA TGATGAAGAG 
GCCGACACAC TGGCAGGAGC TTTGTCTTTC 
GTCTTTTACG TCACCAACCG CAGAGACAAT 
CGTTACGGAT TCCCCTTTGC CGATGAAGAA 
AAAGAACCCC GTCGGCTCAA AATACAAGAA 
AACTTGGGCG ACTTCCACCA CTTCTTCAAT 
CTGGGCCTGA CAGCCGGGSA GTTTGGCCGG 
GCATCTTGGG AACCGGCATG GTACGGCGGG 
GCACTTAAAC AACTGCACTC ACAGAACAGC 



ATTGCCGGCC TCTCCCTCTT TGTACTGACA 
ACGCAGTGGA CTCTCGGCGG AAAGCTCTTT 
TATCAAGCGC TTTGCATTCA GGCATACAAC 
GCAGAACGTA AACAAGGAGA TAGGCCTTAT 
TTGGACAATA CGCCTAACTC CGTGTATCAG 
ACTTGGGGGA AATGGTGTGC ACAGGCCGAT 
TTCCTCCATG CAGCGAACAA GGGGATCGAG 
CTGCGCGAAG CAACTCTTCA GAACCTT CAG 
CATTTGCTTA CGACCCATGG GCCATCCGAC 
CAGTATGAAA TAGTATTGCT CATAGGAGAC 
ACGAAAGAAG AGTCCGGACG CAAACAGGCT 
CACTTCATCA TGCTGCCCAA TCCCAACTAC 
AAGTATCCGC CACTGCCCGA AAGAGACAAA 
AGA 



(2) 1 1 J t'ORi iATI ON FOR SEQ ID HO: 148 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 base pairs 

(B) TYPE: nucleic acid 

(C) STRAW DEDH ESS : double 

(D) TOPOLOGY; circular 

(il) MOLECULE TYPE: DMA (genomic) 

(1HJ HYPOTHETICAL: WO 

tiv) ANT I -SENSE: MO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; PORY PH YROMONAS GINGIVALIS 

lix) FEATURE: 

(A) HAME/KEY; miscf eature 
(81 LOCATIOH 1...12S1 

(Xi) SEOUENCE DESCRIPTION: SEQ ID WO; 148 

ATGAGCACCA ATATAGATGT ACAACAGATC AAACAGCGTT TCGGCATCAT CGGTAGCAGT 
CCGCTGATGG AACATGCCAT ACGAGTGGCA GCACAGGTGG CTCCTACCGA CATGTCCGTC 
CTCGTGACGG GGGAGAGCGG TTCGGGGAAA GAGTTCTTCC CACAGATAAT CCACTACTAC 
AGCGCCCGGA AACAT CAT AG CTACATTGCA GTCAATTGCG GAGCCATCCC CGAAGGAACC 
ATCGATTCCG AGCTGTTCGG AC ACCG C AAA GGTTCCTTTA CCGGAGCCGT ATCGGATCGC 
AAGGGGTACT TCGAAGAAGC ATCCGGCGGC ACGATCTTTC TGGACGAAGT GGGCGAACTG 
CCTTTGCCCA CGCAGGCGAG GCTGCTGAGG GTGCTGGAGA CGGGCGAGTT CATCCCCGTA 
GGAGCCAGCC AGTCGCAGAA GACGGATGTC CGTATCGTAG CGGCGACGAA TGTGAACCTC 
AAGGAGGCGG TAGCGAACGG GAAGTTCCGG GAAGACCTCT TCTTCCGGCT CAATACGGTA 
CCGATCGAGG TGCCTGCGCT GCGTATGCGA CCGGACGACG TGCCCTTGCT TTTTCGCCGA 
TTCGCCGCCG ACAGCGCCGA GAAGTATCGG ATGCCTCCGC TGCGCCTATC GGACGAAGCC 
CGTACCATAT TAATGCGTTA CCGCTGGCCC GGCAATGTGC GAGAGCTGCG CAATATAACC 
GACAGGCTGA GCATCCTGGA GGAGGAGCGG ACGGTATCGG CAGAGACCAT CACTCGCTAC 
CTGGACGCTG AGGGGATGCA AGACCTCCAC CCCGTCGTGA TCCGACGGAA CGAAACGACC 
GAAGCGGACA AACAAATCCC CCATTACGAG CGCGAAATCA TCTACCAGGT GCTATACGAT 
ATGAAGAAAG AGATAGCCGA TTTGAAGGGG ATGATGAACC GCCTGGCGCA CCACGAACAG 
CCCTCATGGC CTGTAGGGTC GGACGTCTGG GGCAACGACG ACAAGCGCAC CGCAGATCCG 
AAGTGGGGCG TCAGCACGCA CAAGGCCCCC ATCGCGAACG CGGCAGAACC CGTGGAGCCC 
ATACAGGAAG CCAGCGAATA CACCGAGGAT CCGGTTTCGC TGGAGGAGGT AGAGAAGAAA 
ATGATTL'CCC TTGCATTGGA ACGCCACGGC GGAAGGCGCA AGCAGACAGC CGAGGAACTG 
AAGATTTCGG AGCGGACACT ATACCGTAAA ATCAAGGAGT ATGGACTGGA A 



(2) INFORMATION FOR SEQ ID WO: 149 

(i) SEQUENCE CHARACTERISTICS: 

tA) LENGTH: 1806 base palrr 

(B) TYPE: nucleic acid 

(C) STRAW DEDN ESS : double 

(D) TOPOLOGY; circular 

<ii) MOLECULE TYPE: DNA (genomic:) 
(ill) HYPOTHETICAL: HO 
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{ivj ANTI -SENSE: NO 

{Vij ORIGINAL SOURCE : 

(A) ORGANISM: POR Y PHYROMON AS G1NGIVALIS 

<lx) FEATURR: 

(A) NAME /KEY: miscjsature 

(B) LOCATION 1...1806 

<xl } SEQUENCE DESCRIPTION: SEQ ID 110:149 



ATGATCCTAC 
GCGCGAAGTC 
GGCCATCGTC 
GAGATCCGAC 
GCAGAAGCCT 
CGCGAAGAAG 
GATGGCGTGC 
TTCTGGAAAG 
GCAGAGAAGA 
GAGCTGATGC 
TACTTCCGAG 
CGTCCCCCTT 
GTAGAACGCC 
GCTACCGCTG 
GCTCCCGTAG 
GAAGAAGATG 
AATGGCATGG 
CTGCAGGGTG 
CCGGCTAT CA 
CAGGGAGGTC 
TTCAATCCCC 
AAACCTTTTT 
GCCTTGGGAG 
GGAGCCTATC 
TCACTCAGTT 
GGCAAAGCAG 
AATCGGGATT 
ATAATAGACA 
GATCGACTGC 
TCGGCCGAAT 
GGAGAA 



TCTTCGGCGG 
CGTTTTTCTA 
TGACAGGAGC 
TGATCGTGGA 
CTGAACAAAC 
GTATCGTCTG 
AGCGTCTGCT 
AGCGCACCAC 
ACGGCTTTCC 
AAGCCGTTCG 
AAAAGATAGA 
TGCCTCCTTC 
TCGTGCCGGG 
CAGTAGTAGC 
AATTGCCTTC 
CTGTCGTATC 
CGGTATGCGC 
AAGGGGTGGG 
ACCTCGTACC 
TGGATATTAC 
G ACT C GG CAT 
CGGCCGAAGC 
CCAATCATAT 
CGGCACTCAT 
GTGTAGCTTC 
TGAAACTCGC 
TCCTGCACGA 
GCCTGAATTT 
TACGAAAGAT 
TAGAACTCCT 



TACTACGGAA 
CTCCACCAAA 
CATGACGGTT 
CGCCGCTCAT 
AGGTATCCCC 
GTGTGCAAAC 
GATGCTCACA 
CTTTTGCCGC 
TGCGGAGCGC 
CCCCGATGCC 
AGCTGCCCGA 
ATTCATTCCC 
ATTCTTTTCA 
AGCCATGTAC 
GGGCGAAATA 
CGCAGTCCTG 
TACGATCAGG 
GGTAGTGACG 
TCGACGAATG 
GATTAGCGTA 
ACGGGACGGC 
GTTCGTTGGT 
CGTCCTCAAT 
TCCACAGGCC 
CTTCCCTTCT 
CGAAGGCTAT 
ACTGGCTCGT 
GGCTCGTGAG 
TGCCGAACGA 
GCTGATCGAT 



CGCCGTGCCG 
GGCAATCTGC 
GCCGACATGG 
CCTTTCGCCG 
GTAGTAAGAT 
TACGATACGG 
GGAGTGAATA 
ATATTGAAGC 
ATCGTTTTCT 
ATTATCACAA 
CGGATGGGCA 
GTAGGCGGGC 
CTCCGAAGCG 
CGATTGATGG 
GTCAGTCTGC 
AAAGATGCAG 
CTCAATCCCG 
CTCCCCGGCC 
ATGACAGCAG 
CCCGAAGGCC 
ATCTCTATTA 
GCCATCCGTA 
TCGGGAGCCA 
TTTGTGCAGT 
GTCCGTTCGG 
CTGGATACGC 
CAGGCAGGTT 
CTATGGACTA 
TCTTGGGAAA 
GAGTCCGGAG 



CAGCTCGCGT 
AAGAGATCCA 
TTTCGTTTTG 
AAGAATTGCA 
ACGAGAGACA 
CTGCCGAGCG 
CGATCCCCAA 
GAGACGAATC 
TCGAACCGCA 
AAGAAAGCGG 
TCCGTATATA 
CTGTCGGTTT 
GATTCACTAC 
GGCTTGGCTC 
CCATAGCGGA 
GTGATGATCC 
AACATGAGGA 
TCGGTCTGGA 
AGGTACGCCG 
GAGAGGCTGC 
TCGGAACATC 
AGCAAGTGGG 
AGAGTGAGCG 
ATGGCAATTT 
TAACGGTAGG 
ACAGTAAAAA 
GTTCGGAAGA 
TGCCGAGTGC 
CTTGCCGCCC 
CGATTCGTTT 



GCTGGATGAA 
GAGTAGCCAC 
TCGGAAAGAA 
CGCTTCAGTG 
ATACCCTCCA 
GATGCTTGGC 
GCTGGCTGCT 
GGTTGCTTTG 
TGCGGACGAG 
AGAGAGCGGT 
TGCCGTCGTA 
GAGACCGGCG 
CGGCACCACA 
TCTCGCCGAA 
AATTCGAGAG 
GGATGTGACC 
AGTCCGCTTC 
GGTCGGAGGT 
ACT CTATGCG 
TACCCAGACA 
GGGAGTCGTG 
TATTGCCACC 
TTATGTAAAA 
CGTCGGCGAA 
AATCATGCTC 
GGTAGTGATG 
CATCCATGCC 
GGAGGACAGC 
ATCGGTACCA 
TCGTATCGGT 



(2) INFORMATION FOR SEQ ID HO: 150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1329 base pairs 
(G) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

tii) MOLECULE TYPE: DHA I genomic) 
(ill) HYPOTHETICAL: NO 
<iv> ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROI 10NAS GINGIVALIS 

( ix ) FEATURE : 

[A» NAI IE/KEY: rolsc_f eature 

(B) LOCATION 1...1329 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150 

ATGTTGAGGA CTTTCCGAAT CGGTGGTATT CACCCCCCCG AAAACAAGTT 
AAfjCCCGTAG AGGTGTTGCC TATCCCCTCA CAGGTAGTCA TCCCTCTTGG 
GGTGCACCGG CAACTGCCAC GGT CAAG AAA GGGGATGAAG TTAAGGTCGG 
GCTCAGGCCG GAGGATTCGT ATCAGCTAAT ATCCACTCAT CTGTGTCGGG 
AAGATCGATA ACGTATACGA CTCAAGCGGC TAT CCCAAGC CCGCAGTCTT 
GAAGGTGACG AATGGGAAGA GGGCATCGAT CGCTCACCAG CCATCGTCAA 
CTGGATGCAA AAGAAATCGT AGCCAAAATT TCTGCAGCCG GTATTGTGGG 



60 
120 
180 
240 
300 
360 
420 
430 
54 0 
600 
660 
-720 
780 
84 0 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1806 



GTCGGCAGGC 
TOAGCACATC 
GACTAT CATT 
TAAGGTGCTG 
GATTAGCGTA 
AGAATGCAAT 
TCTTGGCGGT 



60 
120 
180 
240 
300 
360 
420 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



o .\3 »LJ» -tun- 



WO 99/29870 PCT/AU98/01023 



102/490 



GCTACCTTCC CTACCCATGT GAAGCTGTCC 
ATCATCAACG CCGTAGAGTG CGAGCCTTAT 
CACGGCGAAG AGATCATGAT CGGCGTGAGT 
GCCGTCATCG GAGTTGAGAA TAATAAGAAA 
5 ACTGCATATC CGGGCATAGA GGTAATGCCG 
AAGCAGCTGA TCGATGCAGT GATCCGCAAG 
ACAGGTGCCG TAGTACAAAA CGTGGGTACG 
AACAAGCCTC TGGTCGAGCG GATCGTGACG 
AACCTCCTCG TTCGTATAGG TACTCCTATT 

10 CCGGAGAATA CGGGCAAGAT CATCGGCGGA 
CCGGATGTGC CTGTGACCAA AGGCAGCTCC 
GTTCGCAAGC CTATGCGCGA CTGTATCCGA 
GGACTCAATC CGGCTTTCCT TATGCGCGAC 
AAAGGCAACG TGGTTGACTG TATCGAATGC 

15 CGTCCTCTGC TGGATTATAT CCGCCAAGCC 
CGTAAGCAA 



CCTCCTCCGG GCAACAAAGC TGAGATUCTG 4 SO 

CTGACGAGCG ACCATGTCCT TATGCTGGAG S40 

ATCCTGATGA AAGCCATTCA GGTAAACAAG 600 

GATGCTATTG CTCACCTCAC CAAACTGGCC 660 

TTGAAGGTGC AATATCCTCA AGGCGGTGAG 720 

CAGGTAAAAA GCGGTGCCTT GCCTATCAGC 70O 

GTATTCGCCG TGTACGAAGC AGTACAGAAG 84 0 

GTTACAGGAA AAAAACTGTC TCGTCCGTCT 900 

GCGGCTTTGA TCGAAGCAGC AGCTGGCTTG 960 

GGTCCGATGA TGGGACGCGC TCTGCTGTCA 1020 

GGAGTATTGA TTCTCGATAG AGAAGAGGCA 10B0 

TGCGCCAAGT GCGTCGGAGT GTGTCCGATG 114 0 

ACCTTATATA AGAGCTGGGA AACAGCGGAA 1200 

GGTTCGTGCA GCTTCACCTG TCCGGCCAAC 1260 

AAGAAGACfG TGATGGGTAT CCAAAGAGCA 1320 

1329 



20 



(2) INFORMATION FOR SEQ ID NO--1S1 



(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDHESS : double 
25 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DMA (genomic) 



30 



35 



(iii) HYPOTHETICAL; NO 

<iv) AUTI- SENSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMQNAS GIHGIVALIS 



(ix» FEATURE: 

(A) NAME/ KEY : miscjeature 
tB) LOCATION 1...1437 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151 



ATGAAAAGAA TACAACTAAC TCTTATCGCT 
CAAAATGCTT ACGAGGGAGT AATTT CAT AT 
GTTGTACTGA ATGGTGCGGC AGATATGAGT 

45 ATTGTTACGC CTATTCTTCG TTCAGAAGAT 
GTCATTACAG GCCGCAATAG AACAAAAGCT 
TTGCCCCAAG CAAAACATGC AGCTCAATAC 
TTTGCTTTTA CAGGAGAACA TGCTTATGCA 
CGTGAGGAGG TACGAGGTTG TGCTAAATGC 

50 TTTGATCCAC TCTTCAATCC GGCAGAGGCT 
GAAGAAGTGG AAAAACAGCG AGAGTCCAGC 
AAGGCAGATG TCCTTCCTGA GTATCGCAAC 
TTTGTAAGCA CCGTTAAGGC TAATCCAAAC 
TTTGCTTCTC CCGAGGCTTC AATAGCCCAC 

55 AGACTCGCGG AAGAATTGGT GCGTAAGTAT 
TTCGGCGGTG AAGATTGGAA GGGGCTGAAA 
CGTGACCGCG TATTGGAGAT AATCAACTCC 
CAGGCTCTGA AGCAACTTTC GTCTTATCGT 
CGTCGCAATA CGATAACCAT GGGGTAT AT C 

60 GAAATCATTA AGACTGCTCC GAAAGAACI'T 
TCTTATCCTG AGGGGCACCA AGAGCGTTTG 
CCTGAAAGTG TAACGGGCCG AATCAATTTG 
CAACAGGCAA TTGCTCTGTT GAGTCCGATT 
GGAGCTGCTT ATGCTCGTAC GGGAGATTTT 

65 GTTGCAGAAG GAGATGCAAA TGCGCAGCGC 



CTCTTCGCCG CTGTTGCCGG TTTGGTCGCT 60 

AAAATTTCGT TGGACAAAAC CGGAAACAAG 120 

AATTT AAAGC TCAAGAGCAC TCAGATGATC 180 

GGTACCAGCC GGGTGGAATT TCCTTCGGTA 240 

CTCAAGCGTG AAATCGCATT TAG TT CGGCT 300 

ATTCGCCGTC ATAATGGGAA GAG CG A GC AG 360 
TCATGGATGA TGGATGCCAA GTTTGTGGTT 4 20 

CCTGTAGGTC TCTCGAtffAA TATTGTTCCT 460 

CCTTATTTGT TGGCACACAT TACTCCGGCA 540 

TTCGATGCTT ATATCAACTT CAAAGTCAAT 600 

AATAAGGCGG AGTTAGAGAA AATCAAAGAA 660 

TATTCGGTCA ATAAAATGAT CATCGAAGGG 720 

AATAAGGCTT TGTCGGAGCG CCGTGCTAAA 780 

GGCAAAACAT TGCCGAATAT AACCACTGAA 840 
CTGGCTATCG AAAAGAGTGA TATAGCCGAT 900 
GATAAATATG CCGATGATGA TGCACGTGAA 960 

TATATCTTGG ATCAGATCTA TCCGAATTTG 1020 

GTTCGTGATT ATACCCTCGA AGAAGCTCGT 1080 

AGTGAGGCCG AAATGTACCG TGTGGCAATG 114 0 

TTTGCTCTGA ATACGACCCT TAAGTATTTC 1200 

GCTGTAGCCG CTTTTAATGG TGGAGACGTT 1260 

CAGACAGAAA AGGGTGTAAG CAATATCCTT 1320 

GCTCGTGCCG AAACCTTCTT CCGTAAGGCC 1380 

AACCTCGATA TGCTGCTTGG CAAAAAG 1437 



<21 INFORMATION FOR SEQ ID NO: 152 

70 (!) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 114 9 base pairs 
IB) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
(DJ TOPOLOGY: circular 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



AU» & 



WO 99/29870 PCT/AU 98/01 023 

103/490 

{ix> 1IOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

5 |iv) AHTI-SEHSE: NO 

<vU ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYKOMONAS GIHGIVALIS 

10 (ix) FEATURE: 

(A) NAME/ KEY: miscf eat u re 
tB) LOCATION 1...1149 



15 



20 



30 



(Xi) SEQUENCE DESCRIPTION; SEQ ID NO:152 



ATGGCAGAAA AAAGAGACTA TTACGAAGTC CTCGGTGTAT CGAAGAATGC CACCGACGAT €0 

GAACTGAAAA AAGCATATCG CAAGAAGGCT ATCCAATACC ATCCTGATAA GAACCCCGGT 120 

GACAAGGAGG CCGAAGAGCA CTTCAAAGAG GTAGCTGAAG CCTACGACGT ATTGAGCGAT 180 

CCGCAGAAGC GCAGTCAATA TGACCAGTTC GGCCATGCCG GATTGGGCGG AGCTGCCGGT 24 0 

GGAGGTTTCA GCGGAGGCGG TATGTCCATG GAGGATATTT TCAGTCGCTT CGGTGATCTA 300 

TTCGGTGGGT TCGGCGGTTT CGGCGGATTC TCCGATATGG GCGGTGGCAG TCGCAGACGT 360 

GTTCGCAGAG GGTCTGACCT GCGAGTACGA GTGAAGCTTT CTTTGGCCGA TATAAGTAAA 420 

GGTGTGGAGA AGAAAGTGAA GGTAAAAAAG CAGGTAGTGT GCAGCAAATG TCGTGGCGAT 480 

GGCACGGAAG AAGCCAATGG CAAGACTACC TGCCAGACCT GCCATCCAAC CGGCGTGGTT 540 

Zt) ACACGTGTGA GCAACACTTT CCTTGGGGCC ATGCAGACCC AGAGCACTTG TCCCACTTGC 600 

CACGGAGAGS GTGAGATCAT CACGAAGCCA TGCTCCAAGT GTAAGGGCGA AGGTGTGGAG 660 

ATCGGCGAAG AGGTGATCTC ATTCCACATC CCTGCCGGTG TAGCCGAAGG AATGCAAATG 720 

TCCGTGAACG GCAAGGGAAA TGCCGCGCCC CGAGGAGGCG TGAATGGCGA CTTGATAGTC 780 

GTGATCGCCG AGGAACCGGA TCCGAATCTG ATCCGCAATG GCAACGATCT GATATACAAT 840 

CTGCTTATAT CCGTTCCGTT GGCTATAAAA GGAGGTAGTG TGGAAGTGCC GACGATAGAC 900 

GGACGAGCCA AGATCCGCAT CGAGGCGGGG ACACAACCCG GCAAGATGCT GCGTTTGCGC 960 

AATAAGGGGT TGCCCAGCGT AAACGGCTAT GGCATGGGAG ACCAACTGGT GAATGTCAAT 1020 

GTCTATATCC CCGAATCGAT CGATGCCAAA GATGAGCAGG CTATCGCAGC GATGGAAAAC 1080 

TCGGACAGCT TCAAACCTAC CGATGCTGCT CGTAAGGATA TAGACAAGAA ATACAGAGAG 1140 
JO ATGCTGGAT 



(2) INFORMATION FOR SEQ ID NO: 153 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 87 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



45 



50 



60 



70 



(ii> MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL : HO 
(iv) ANTI -SENSE: NO 



(vi] ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

55 (i X ) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1. . .879 



ixl) SEQUENCE DESCRIPTION: SEQ ID NO: 153 



ATGAAAAAAC TGATTTTAGC GACTTTGGGA CTTATGGCCA TTGCCATGCT CTCATGTTCA 60 

AGCAACAACA AGGATTTGGA GAACAAAGGG GAGGCTACTC TTTTGGTAAC GTTTGGTAGC 120 

T CCl'AT AAAG CTCCACGCGA AACCTATGCG AAGATTGAGA AGACTTTTGC CGCACCTTAT 100 

S^™?*** ^ G ATAAGCT G GACATACACG TCTTCTATTA TCCGAAAGAA ACTGGCTCAG 240 

CAGGGTATTT ATATCGATGC TCCGGATGAG GCTTTGGAGA AATTGGCTCG TCTGGGTTAT 300 

AAGAAGATCA ATGTACAGAG TCTT CATGTG ATTCCCGGCC GAGAATATGA TGAGATGATC 360 

GACTTTGTCA ATAAGTTTAA GGCAGCACAT AGTGATATTA CTGTGAAGGT AGGGGCTCCG 420 

CTTTTCGATA CCGATGAAGA TATGCGCGAG GTGGCAGAGA TCTTGCACAA GCGT TTTCAG 4 80 

CAAACGATAG AGAAAGGTGA AGCTATTGTA TTCATGGGAC ACGGCACCGA GCATGCTGCC 54 0 

AATGACAGGT ATGCCCGTAT CAATAAGATC ATGAAGAACT ATAGCAAGTT CATGATCGTC 600 

GGAACCGTCG AGTCCG AT CC CTCTATCAAT GATGTTATTG CCGAACTGAA AGAAACCGGT 660 

GCCACGGCCG TAACAATGAT GCCGCTGATG AGTGTGGCAG GCGACCATGC TACGAATGAT 720 

ATGGCCGGAG ATGAGGACGA TAGCTGGAAG ACGTTGCTGA CCAATGCCGG CTACACAGTT 780 

TCTATAGACA AGCTGGACAA TGGCAATTTC TCAGCTCTTG GAGATATAGA AGAGATCCGG 84 0 

/O AATATCTGGC TCAAGCATAT GAAAGCCACC TCTGCTCGC a7Q 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



.„}1 ' rli "3 n .,JL TL™ «"* Cji 1.1 



WO 99/29870 



PCT/AU98/01023 



104/ 490 



(2) INFORMATION FOR SEQ ID HO:154 

5 U) SEQUENCE CHARACTERISTICS: ^ - ■•;,>^Cliv:^ 

(A) LENGTH: 10S0 base pair- ' ' -7^ 

(B) TYPE : nucleic acid *" V^' * 

(C) STRAHDEDNESS: double • , ( r^-~s±-7&4>$V^k± 
tQ < D > TOPOLOGY; circular * . ti-'^r **; * - 



15 



.20 



(ii> MOLECULE TYPE: DMA (genomic) 
<iii) HYPOTHETICAL: NO 
tivi ANTI -SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM; POR Y PH YROMOMAS CINGIVALIS 
Ux) FEATURE : 

(A) NAME/ KEY: misc featur- 

(B) LOCATION 1. . .1069 



60 

120 

ieo 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



25 <Xi) SE QUENCE DESCRIPTION: SEQ ID NO: 154 

T^T^St SS?S?r TCTGTCGC ^ GTATCCTGGC TGCGCTGGGA 
AGCGAGATCT TT^l^r ^rJZZlZ* GCTCGGTGGC TATTCCGATG 

TACAATATCC TATGGAAATC Crr^^i CGTCCCGAAG GAGGAGAAGC ACTCGTGCAC 
30 GGTTTATCCG TTAGTOGCTT rrS^S GAAGCCCTCA CGGCTGCTTT TGCCGGCGCA 
TCCGTTCTCG GCA^CAG^C C^rJrnrv Jf^TCTTTC GCAATCCTTT GGCCGGTCCG 
GGCTCGCTGG GA^GAG^r ^22^°* TTGGGTGTTG CTTTGGTCGT TCTGCTGAGC 
ATAGCCGCTG C^GOTC SSSSS? 5IS GGTTATA TGGGCGAGGT GGCCATGAAT 
oc GTGCGCAGCC AC^TTaSS SS?^ J CGTT ^TGT CAGCACCAAG 

35 GTCATCGGGG TATTCAAGTT GGCGTTATGA TCGGATATGT AGCCACTGCC 

GGGTTGGGCA GCTTTTCCCG ^rS™ GAAG AAGAT A TTCGGGCATA CGTAATTTGG 
TTGATCTTTA TrawSS? Z»^ CGGAC T CGCAACTGA GTTTCTTTGC CATTCTGATG 

SSSS a™ CAGT tgaatct ^t ATTGCTGGGA 

TCCGCCGGTT TG CT C AT C GCT ACTCGTCGGG CACGGCTGCT CGTGATCTCT 

40 ATGGCTGTGC Ir^Z^ GCCT ^TGCG GTCCCATCCG CTT TTT GG G G 

CCTGCTACC? CTT^GaTt^ Mr^JZ^ TTTCACAC ^ CGGATCATCG GATCCTGATG 
CCGGGGTTTG AGGGGGCTTT GCAATATCAT TGCTCGTATG 960 

ATCGTCACCG TTTTGTTCCG SEES?" }020 

45 

(?) INFORMATION FOR SEQ ID NOH55 

(i) SEQUENCE CHARACTERISTICS* 
cn (A) LENGTH: 2271 base pairs 

OU (B) TYPE: nucleic anid 

(C) STRANDEDNESS: double 

(D> TOPOLOGY: circular 

55 (ii» MOLECULE TYPE: DNA (genomic) 

Uii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: HO 

60 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: POR Y P H YROMONAS GINGIVAXIS 
(ix) FEATURE: 
cc HAME/KEY: misc feature 

00 (B) LOCATION 1 ... 2271 

(r.l) SEQUENCE DESCRIPTION: SEQ ID NO:155 

70 cSgSc* 2££S5K£ JSn™ SSSE^ ^cgtcttct 

CCTCTTGCCG GTGCTCGAAT cS^^n S^ GTGTGT CGGATG <^GA AACCGGAGAG 120 
GGCGGACATT TTOA^AT^ GA^CTG^? SSS?* TAGTAGCC GG TGCCGATGCC 180 

sssss si iii F = = ssee 240 

AC*™ ££££ TCCTTACCGC SSSSS 



300 
360 
420 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



105/490 



10 



15 



20 



25 



30 



35 



40 



45 



GCCTCTTTCT 
TTCGGCCCCA 
CTCATCCTTA 
CGTATTTCTC 
TACGGATCCG 
CTGAGTGCAT 
CTCGATATAA 
GGCTGGCAGA 
GAGGAAACGT 
TCCTATTATG 
AT CTTCACTC 
GCTTCACTCG 
GTCTACGATC 
ATCAACAACC 
AACGATCAGA 
AAACTGACCG 
ACCGACAAGG 
GATTGGTTCA 
ATGACGCCTA 
GCTAACGGCT 
ATGGGTTCGC 
TATGCTTTGG 
AATGAACTTC 
CAGGGAATCA 
GATGTCCTAT 
GTGGAAGCTA 
AATGTGCACG 
GGCCGTATTC 
CGACTCGCCA 
GGTATAGACA 
GTAACGCCGG 



CGGCTCCTAC 
ATCTGATGGG 
TCGATGGTAA 
CTGATCAGAT 
ATGCCATCGC 
ATACGTCACA 
ACATCGGTAA 
ATAGTCCGTT 
ATAAGAAAAC 
CAACTAACAA 
CGACTTTTTC 
GTACGAACTA 
GCTTCCGTTT 
AAGGTCAGAC 
TCCGATACAC 
GCGGTTTGGA 
CAGATGCTTC 
ATATGACAGC 
AGGTATCCAT 
ATAAGACTCC 
ACAATCTCTA 
GCTTGGAGTA 
GCAATCTGAT 
AGAAAACCAA 
GTGATGCCTC 
AGAATCTCCA 
CCGATTGGGT 
AGAGCGAGCG 
CAT C G CAT CG 
ACCTGTTTGA 
GACGTACTTT 



TTCCGAGGCC 
CTCTTTCATG 
GCGTGTGTAC 
CGAACGGATC 
CGGGGTAATC 
TCGCATATCG 
GTTCAGTAGC 
CGAAATAAAA 
TTTTCGTGCA 
TCTTAGCTTC 
CGAAAAGAAG 
TCTTTTCCCC 
CGGATATTTG 
CGAGCAACCC 
GGCAGAGGCT 
GTATTTCCGT 
CACGCTCTCT 
CGGTTTCCGT 
ACTCGCCAAG 
CACGCTGAAA 
TCTCGGCAAT 
CAATCAAGGC 
CTCCTTTATG 
GCAGTATGCC 
TATCGGTl'GG 
GACGGATGAG 
TCACTACTGG 
Tl'ACTACAAA 
TTTCGCTCAT 
CTACGTGGAT 
CTTTGCTCAA 



TTATTGCAGG 
CAGCTGAACG 
GGCGATGTAG 
GAACTGGTGA 
AATGTGATCA 
AAGTACAACG 
AATACCAACT 
AAGAAAAAAG 
CAGGAAAATC 
AGCGGAAATG 
GCCTATGACA 
AATGGTCTGC 
TATCATGACA 
ACATTCTTTC 
CGCGGTGTAT 
GAGGAATTGA 
GCTTATGTAC 
CTGGTACACC 
TATGGGCCGC 
GAGCTTTTTG 
GCGGATCTTA 
CCTATCTCGT 
GATATACCGA 
AACATAGGAA 
GGTATCAAGT 
TGGCTGGAAG 
GGTCAGTATA 
GACGGCAATG 
TTCCGCCACA 
GATCGTCCTA 
ATAGCGATTC 



GGCTGAGTCC 
GCCTTAGCAG 
GCGGTCAGGC 
AAGGTGCTTC 
CAAAAAAGAA 
ATCGGCAAAC 
ATTTCTTCTA 
GATCCGGCGA 
AGGGTGTAAG 
TGCAGTACAA 
TGGATTATCG 
ATACGCTTTC 
AGGACAGCAG 
CGGGTCAGCT 
TTACACTGCC 
TCTCTCCCTA 
AAGATGAATG 
ATCAGGAGTT 
TGAACTTCCG 
CACGGAACGA 
AGCCACAGAT 
TCAGTGCAAC 
CCTCACCCGA 
AAGCTCGCAG 
TAGGAGCCGG 
GAGCTGCACG 
GACTTGGCGT 
CTCCGGACTA 
TCATCCTGGA 
TGGGTGTCAA 
GATTCAACAA 



GTCTTTTGAC 
TAAGTATATC 
CGATTTGAGT 
GAGTTCGCTC 
TACGAATCtiA 
CAATACTTCG 
CCATACGGAT 
ACCGGTCTTG 
CCAATCGCTT 
TAAACGTCAG 
TGCTTTGACG 
TTTCGATGCC 
TGAGAGCCTG 
ACG CAAT AAA 
TTATGCGCAG 
TAATTTGATT 
GAAACCGCTC 
CGGTACACGA 
CGCTACGTAT 
ACTCACCACT 
GTCGGATTAT 
GGTTTATGAC 
GCACGAAGCT 
CCGCGGCCTT 
ATACAGCCTC 
TCATCGTGCC 
GAGCCTTTTC 
TACCTTGTGG 
TGGAACGCTC 
TTATGCTACC 
C 



(2) INFORMATION FOR SEQ ID NO:lS6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 993 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : double 

(D) TOPOLOGY: circular 

(ii> MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL; NO 
(lv) ANTI -SENSE: NO 



480 
540 
600 
660 
720 
180 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
204O 
2100 
2160 
2220 
2271 



(Vi) 



50 



55 



60 



65 



70 



75 



ORIGINAL SOURCE: 
(A) ORGANISM: PORYPHYRQMOHAS GINGIVALIS 



<ix) FEATURE: 

(A] NAME/ KEY: misc featur- 

(B) LOCATION 1. . . 993 

|xi> SEQUENCE DESCRIPTION: SEQ 10 NO: 156 



ATGACGGACA 
ATCGCAGTGG 
GGACAGATAG 
CTTAGGGTAT 
CCCGACGTAG 
AACGCCAAGG 
CAGAAAGCTC 
TATGACCAGG 
GCCATCGCTA 
CGCGAAGACA 
GTGGAGTCGT 
ATATTCCCCA 
ATCGGCGATA 
GGAGCCGTTC 
ACATTCATCA 
TACGACCTGA 
CTACGCCCGG 



ACAAACAACG 
TGACGATCGT 
AAGTGACCGA 
CCGAGGGACA 
CGGCTAAGAT 
CTCTCAAAGG 
AGGCCGGCGT 
GAGTGGTACC 
CGCAAAAAGC 
AGCTGGCAGC 
ACATCAACGA 
AAGCCGGCGA 
TGTGGGCCAG 
TGGAGACTGT 
AGAACATGGG 
AGACCTTCGA 
GTATGTCCGT 



TAATATCGTA 
TGGTTTTTTC 
ATACCGAGTG 
GCAGGTGCAG 
GGAGCAGGCA 
AGCACGCAGC 
AGCCATAGCG 
GGCTCAGAAG 
GGCCGAAGCC 
TTCTGCCCTC 
AACCTACCTC 
ACTCGTAGGT 
CTTTGCCGTT 
GGTGCCGGCT 
TACCTATGCT 
GGTAAAGGCC 
GATCATACGC 



TTCCCGGCGT 
ATGCTCAGAC 
TCCAGCAAAG 
GCCGGCGATA 
AAGGCTGCCG 
GAACAGATAC 
ACCAAGACAC 
TTGGACGAAG 
CAGTACAATA 
GTCGATAOAG 
ATCGCCCCAC 
ACCGGCGCAC 
CGTGAGGATT 
CTGAATGAAG 
GCCTGGAAAG 
ACCCTTGCGG 
AAG 



TTCTCCTCTT 
CGGCCGAGGA 
TGCCCGGGCG 
CCCTCGCTGT 
AAGCAGCTGC 
AGGCAGCCTA 
ACCAGCGCGT 
CCACTGCCCA 
TGGCTCGCAA 
CGAGAGGAGC 
GGGCAGGCGA 
CTATCATGAA 
TCCTCAGCAG 
AAAAAGTACG 
CGACCAAGAC 
ATAAAGACAA 



GCTGGGAGTC 
GATTATCCAA 
CATCAAGGAA 
CATCGAAGCC 
ACAGGCTCAG 
TGAGATGTGG 
GCAGAACCTC 
GCGCGATGCG 
CGGTGCCGAA 
CGTCGCCGAG 
AGTGTCGGAG 
TATCGCCGAU 
CATGACCATG 
CTTCAAGATC 
AACAGGGCAG 
GGCACAAAAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



U^D ^'-ij ilVM „M , ii iD 3J1 iLJi O: a™', iL;H 4^ 



WO 99/29870 PCT/AU98/01023 

106/490 

(2} INFORMATION FOR SEQ ID HO: 157 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 801 base pairs , ^ 

IB) TYPE: nucleic acid 
(C) STRAUDEDNESS: double 
(Di TOPOLOGY: circular 

- , ■ . ^ i 

(ii) MOLECULE TYPE: DNA (genomic) r '^' 
(iii) HYPOTHETICAL: NO 
(lv) ANTI -SENSE; HO 
15 <vi) ORIGINAL SOURCE: 



10 



20 



30 



60 



(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY: misc feature 
CB> LOCATION \ . . . 801 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 157 



?c S£2JS TG TCAGTAATTT TTTGTTCGTC TCTTTTTCGG TTTT GCTTTT TGCATCATGC 60 

25 CGTTCCCAGC GAGAAAAGGT CGTTTACCTG CAAGATATCC AAACTTTTAA TCGGGAGATT 120 

ArSa^^ ^I5^ GT AAAAATTGAG AAGGACGATG TGCTGAACAT CCTTGTCAGC 180 

i^SS^S CGGAGCTTTC AACGCCCTAC AACCAAGTGT TGACCACTCG TGCACTGGCC 240 

^^£227 ATGGAACGAA CTCGAACGAA GGCTTCCTGG TCGATTCGAA AGGGTACATC 300 

™™T^ A TTTTAGGCCA GAT CTATGTA GAGGGCCTTA CTCGTACCGA ACTGGAGAAG 360 

SJ^ACAGA AGAGCATTAT TT CCAGTGGA TTTATCAAGG ATCCT A CGGT AACGGTGCAG 420 

^1^*— TCAAGGTGTC GGTTTTGGGA GAGGTGAATC ATCCGGGTTC GAT GTCGGT A 4 8D 

AAAGGAGAGC GAATAACTCT TTTGGAAGCG AT CGGAATGG CCGGAGACCT GACAAT CT AT 54 0 

^^?^^ CG ATCGGGTTTT TGTGATTAGA GAAACCGATG GGCATCGCGA GGTTT TCCAG 600 

35 CT fSrZl^ GAAAGGCCGA CTTGCTCGCA AGCCCCGTGT ACTATCTGCA TCAGAACGAC 660 

flSI^™ 1'° CGAA CGAC 'AAGAAA ACACAGATGA GCGAGATCAA CCAGAATAAT 720 

SSSSSS r TTACCTCC ACTTTGGTAT CCATTTCCAC GCTGACGATT 780 

40 (2) INFORMATION FOR SEQ ID NO: 158 

tl) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1707 base pairs 
A - (B) TYPE: nucleic acid 

4iJ (C) STRANDEDNESS : double 

IV) TOPOLOGY; circular 

(ii) MOLECULE TYPE: DNA (genomic) 

50 Uii> HYPOTHETICAL: NO 

(iv> ANTI - SENSE : NO 

_ (vi) ORIGINAL SOURCE: 

55 < A > ORGANISM; PORYPHYROMOtlAS GINGIVAL IS 



(ix) FEATURE: 

(A) NAME/ KEY: raiscj eature 

(B) LOCATION 1...1707 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 

?IS AAAA S A CCA *TCTGTT TTTATCTCTG CTGGTGATCT TTATCACCGG TAGTTTTATG 60 

fir A ?TSnSSI G CACAGAAGTC CAAGACGAAC AAACTCACCG AAGAAGATCG GAGCCGCAAT 120 

^l^ AC AGTCGATGGA TGTGCTTAGC AATATTATCG GTAACGTCAG GCTGTATTTC 180 

?ZS? A J A S5 A TAAGTATCAA ACATATGACT CGGCGTGGTA TAGATGCGAT GTTGGGCGGG 24 0 

S^n^I A I A ^ AATA CATTCCTTAC GAGGAAATGG ATGAACTGAA ATTGATGACT 300 

ATCCAGA^Ar* ^^ GT CGGAGCTATC ATATCGCAGC GCCCGGATAG TGCTGTGATT 360 

70 iJ^SSSS^ CTATGGAAGG TATGCCCGCA GACGAAGCAG CATTGATAGC AGGCGACCGC 420 

5£I2£S I™™™** AAATCCACCA CACCGAAAGT AAGCCAAGCA 480 

I A SSS GGTAC TGT TGCAAAG GTGACAGTAA TGCGCTATGG CGAAACCAAA 54 0 

SErS2£ ACGTCAAAAA GTGATTATGA ATTCCGTCAC TTACAGCGGA 600 
ATGCTCGATo GCTCGATAGG ATATATCCGC TTGAACAACT TTACGGACAA AACTfiraraR 
75 CGGCCTTGTT GGATCTTCGT GACAAACAAG GAGCGAAAGG 
75 GATTTAAGAG GCAATGGTGG CGGACTGATG CAGGCTGCTA TCGAGATAGT CAATCTGTTC 



660 
720 
780 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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WO99/29S70 
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107/ 490 



10 



15 



GTCCCTAAGG 
TTTCGCACAT 
CAATCGGCAT 
CTGATGGGAC 
AACCGCGTGA 
CGTTTGGACT 
CACAAAATCT 
GACATCGAGG 
GACGTTTTCG 
GATTTTTCCA 
TTTGACTATG 
GAAGGCTACC 
AACCTGTCGC 
ATTGTCACTC 
GTAGTCAAAG 
GCAGCTCCGA 



GCAAAGAGGT 
TGACTGAACC 
CTTCCTCGGA 
AAAAGAGCTA 
TCAAATTGAC 
ACAGCCGCAC 
TTTACACTGC 
TCAAACAAGA 
ATTTCGTCAC 
TAACGAACGA 
ATCGCCAGAG 
TGCCGGAAGC 
GTGATCTGCT 
GCTATTATTA 
AAGCTATTAA 
AAGCAGAGAA 



GGTAACGACC 
GATCGACACG 
GATTGTAGCC 
TGGCAAAGGG 
TACGGCCAAG 
CAATCGGACA 
TGCCGGAAGA 
TACAGCTGCG 
AGGTTATGTG 
GGACTATGCA 
TGGCAAGATG 
CAACfCGGAG 
ACGATTCAAA 
TGAGCGAGGC 
GCTGCTGAAG 
TAAAGGG 



AAAGGTCGCA 
AAACTCCCGA 
GGAGCACTGC 
CTTGTACAAA 
TACTACATCC 
GGTATGGCAA 
CGTGTAGAAG 
ACATTACTTT 
CTCAAGCATA 

gctttctgca 

CTTGACAAAC 
CTTAAAGCAC 
AAGGAGATAA 
AGTATCCGCC 
GACCATCCGG 



TTGGAGAGTC 
TAGTAGTCCT 
AGGATATGGA 
CGACTCGTCA 
CAAGCGGACG 
CGGCCATTCC 
ATGCAGGAGG 
ATTATATGGC 
AAACGATTCC 
AGATGATGGA 
TGGAGGAACT 
TACGCGAAAA 
CAAACTATCT 
AGAGTTTGCC 
AACAAATTCG 



GGCGTCCGTA 
GATCGATGGA 
CAGGGCTGTA 
GCTACCATAC 
TTGTATTCAG 
TGACAGTCTG 
AATCCTGCCT 
CATCAATAAT 
CAAGCCGGAG 
AGAAAAGAAA 
GGCTAAGATA 
GCTAAAACCC 
CAACAATGAG 
GGAAGATAAG 
ACAGATCCTT 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1300 
1440 
1500 
1560 
1620 
1680 
1707 



20 



25 



(2) INFORMATION FOR SEQ ID MO:159 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 3 base pairs 
(8 ) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



30 



(ii) MOLECULE TYPE: DMA (genomic) 
Mil) HYPOTHETICAL: HO 
(Iv) ANTI-SENSE: NO 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(V±) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROIOUAS G1NGIVALIS 

lix) FEATURE: 

(A) NAME/ KEY : mi sc_ feature 

(B) LOCATION 1. . .2943 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 159 



ATGCAAAACA 
TACCTGTCAT 
GTTGCCGGAA 
Cl'GAAAGAAG 
GTTATCTTGA 
CCCAACTTCA 
GATATTTTCG 
GGTTCGGGTG 
CTGCTCAAAG 
ATCGATGCTT 
CTTGTCGAAC 
GCCAACCTAC 
GCCAATGATC 
CCTGCAACTA 
GATTCTGCTA 
TCTCTGCTTA 
ATGGCTCAGA 
GTGCTTTTCC 
CTCTACGAAC 
GTAGTGACTT 
TCGATGACGA 
GGACGGGCAA 
GAGATCACGG 
CTTGCCAACG 
GTGATTGGTC 
GCTTTGGTTA 
GCAAACGGCG 
GCCGTGCTGA 
GCCAACGTAC 
CGTGCCGTTA 
ACT ATT ATT A 
ACTACGTTGA 
GTCTTCGAGA 
ACTCGCAATC 



AAGGATTTGT 
TCTCTTTCGT 
TGGCCTATCT 
CTCAAGCCCA 
AACTTAACGC 
ACAAAGCTCT 
TGAAGGAATA 
ACCTTCGCGA 
AAAAATATAA 
TCGGTGTGGT 
TCCCCGGAGT 
AGTTCTGGCG 
GTCTGAGCGA 
CTGACT'CTGT 
CTGTAGCACA 
CTCCCGTGAA 
TATCTGAAAT 
TCTGGGGTGC 
TCTATGCTAT 
CCGCCAAGAG 
TGAATGAAGA 
TCGCTATCGT 
G CGGTCGCTC 
TACTCAACTC 
CTACGCTGGG 
TCCTGATGTG 
CATTGATTGT 
CCCTCTCGGG 
TTATCTTCGA 
CGGATGGTTA 
CCGGTATCAT 
TTATCGGTCT 
AACTGGCGAA 
TCCTT GTCAA 



GATTGTTATC 
TACGAACCGT 
TGATTCCATG 
GCAAATTGGT 
AAGCGATCTG 
GGAGAATGCT 
TCGCAAGCTC 
CCAGATTACC 
TAGTGCTGTA 
TGCACCTAAT 
GAAAGACCCT 
TACATACAAA 
ATTGGCTATG 
AGCTGCAACA 
AAAAGAGGCC 
TCGTGGCGGT 
GCTCCAGCAA 
TAAAGCAATC 
TCGTACCAAT 
TGATATCCAA 
AGGTGCTCGT 
TTTGGATGGT 
TCAGATCTCC 
CGGTAAAATG 
TGCCGAG'1*CC 
TTACATGTGT 
AAACAGCTTC 
TATCGCAGGT 
GCGTATCAAA 
TGGCAACGCT 
CCTATTCCTC 
TATCGCTTCT 
AAAAGGTCGT 
TCCCTCATAC 



ACATCGGCTC 
TACGAAAAGA 
TCGAATGAGA 
CTTGGCCTTG 
CTTCGTAACC 
GCCAAGAGCA 
GATCCCAACG 
GCAAAGTCTA 
GAAGCTTCGT 
TTGCAGCGAT 
GAGCGTGTTC 
TTCGAAGAGG 
AACAACACGG 
GCCGATTCTG 
AAGGATGCTA 
GCAGTAGTGG 
GCTCACGATC 
GAAGACCCCG 
CGTACGGGAG 
AATGACTTCG 
AAATGGGCGC 
GTGGTTTATT 
GGGCACTTCA 
GATGCTACGG 
ATTAAAGCCG 
CTGGCTTACG 
TTCACATTGG 
TTGGTGCTGA 
GAAGAGCTTC 
TTCTCTGCCA 
TACGGGACGG 
TTCATTACGG 
TTGGATAAGA 
AACATCTTGG 



TGGCCATCAT 
AGGCTAAGGC 
AGGTCTGGTT 
ACTTAAAGGG 
TCTCTAACAA 
CGGAGCAATC 
GTCGCTTGGC 
CGGATGCAGA 
TCAATGTGCT 
TGGAAGGGCA 
GTACCCTTTT 
TCAGCGGAGA 
ATGCTACCCC 
CTGCTGTACA 
CTCGTAAAGA 
GTGTGGCTCG 
TGAAGGTTAC 
AAACCAAAAA 
ATCCTGATTT 
GTCGTTCCGA 
GTATCACAAA 
CTGCTCCGAA 
CCGTGGAGGA 
TAAGCATCGA 
GATTCTTGTC 
GTTTCTTGCC 
GCGTATTGGC 
CGCTGGGTAT 
GTGCCGGTAA 
TCTTCGACTC 
GGCCGATTCG 
CTGTCTTCTT 
TTACATTCAC 
GTAAGCGCAA 



CTGTGCGTTT 
GATGGGCGAT 
CGGCTACACG 
GGGTATGAAC 
AAGTTTGGAT 
CGACTTCATC 
CGTTATCTTC 
CGTAGTGCGT 
CCGTGCTCGT 
AGGGCGTATC 
GCAACGCAGT 
CTTGATCGCT 
GGAAACAGAG 
AGCTGTAGCT 
CGCACTCTTC 
TCGTGCTAAT 
ACGTGAAGAT 
GGAGACCGAC 
GGGAGGTGAT 
ACCGATCGTT 
GGATAACGTG 
CGT GAATG AT 
GGCCGGTGAC 
ACAGGAAAAC 
GTTCCTGCTC 
GGGTCTTATC 
TTCTTTCCAT 
GGCTGTGGAT 
GACTCCGATT 
GAACGTTACG 
CGGTTTTGCC 
GACTCGTATC 
TACGAGCATT 
GACCGGCTTT 



60 
120 
IflO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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ATCATTCCGG 
AGGGGTATTG 
TCCGAAGCCG 
ATCGGTACTG 
GAAGAAACTG 
CAGCAGCCTA 
AGTATGTCGA 
ATGGCCATTT 
TCTGTGGCCG 
CCCTTCACCA 
TCGCTCAATG 
AACAGAGATC 
AATACGTCTT 
ATGCGTAGTT 
TTTGTTGCTA 
AAA 



TGATTATCAT 
AATTCTCCGG 
TTCGTTCGGC 
AAGGGACAGA 
AAGCAGAGAT 
CTGCTGATCA 
GTGACATCAC 
ATATTCTGAT 
CTACTACATT 
TGGAGATCGA 
ACACCGTGGT 
GCTATCAGGT 
TGACTACGTT 
TCACGTTCTC 
CACCCCTTGC 



CGTTTTGGGA 
AGGACGTAAC 
CTTGTCTTCT 
GGTGCGTATA 
TACTGACAAA 
GTTCTTGGAC 
GAGAGGTGCT 
TCGCTTCCGT 
CTGCATTATT 
TCAGAACTTC 
TGTATTTGAC 
GATCAACGAT 
TATCGTTATG 
GATCCTGCTC 
CTACGAGATC 



CTTATAGCTT 
TACGTAGTTA 
CCCCTGCAGG 
TCTACGAACT 
TTGTATCAGA 
AATATCATTA 
ATTTGGGCTG 
GACATTTCTT 
GCTCTGTATG 
ATCGCTGCTA 
CGTATCCGAG 
GCCCTTAATT 
TTGGTAATCT 
GGTATCGTTA 
CAAAAGCGCA 



CATTTACAAT 
AATTCGACCA 
AAAAGGTATT 
ATAAGATCCA 
GCCTGAAAGG 
GCTCTCAGAA 
TGCTGTTATC 
TCTCT GCCGG 
CGTTG CT GTG 
TTCTGGCTAT 
AGACGATGAA 
CAACATTGGG 
TCATCTTTGG 
TCGGTACATA 
AGCTCAACAA 



CGGTCTCAAT 
GCCTGTATCT 
GGTTACCTCC 
GGAGGAAAGC 
TTTCTACACC 
AGTAAGTCCC 
GATGATCTTC 
GGTATTCGTA 
GAAGATTCTG 
CATCGGTTAC 
ATTGTACCCC 
TCGAACATTA 
AGGTGCTACG 
CTCTACGCTC 
AGCAGCTAAG 



2100 
2160 
2220 
22AO 
2340 
2400 
2460 
2520 
2S80 
2640 
2700 
27 60 
2820 
2880 
2940 
2943 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 160 

(i) SEQUF.NCE CHARACTERISTICS; 

(A) LENGTH: 3051 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDHESS : double 

(D) TOPOLOGY: circular 



30 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



35 



40 



45 



50 



55 



60 



65 



70 



75 



<VU ORIGINAL SOURCE: 

(A) ORGANISM: P0RY PHYROHONAS GINGIVALIS 

tlx) FEATURE; 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1...3051 

(xij SEQUENCE DESCRIPTION: SEQ 10 MO.160 



ATGAAAAGAA 
CAGAATAGAA 
GCGAATGTCG 
TTCACGCTTA 
ACCAAAGAGG 
CTGGAGCAGG 
TCTGTGGCCA 
GCCCTCCAAG 
GTCGCTTCTG 
ATCGTGGATG 
GAATCTATGT 
AACGGAGTCG 
TTTAATGCCA 
ACTGGAGATG 
GTTCAGAAGG 
TCTTTGAAAG 
TGGCTCAAGG 
GGGTCTCAGG 
CGTGAACCGG 
GAATGGCTGA 
TATTTTGGAA 
TATAACCCTT 
AGACCTTCTA 
CAGGCCAATG 
CAGGCTGGTG 
TATGATTCTA 
TTTACAAATA 
ATGGGGCATG 
GAAAGTGAYA 
GAACACAGAG 
GACAAATGGA 
AATAATAGAA 
ATTCAAGAAA 
AACTCGGAGA 



TGACGCTATT 
CCGTGAAGGG 
TGGTTGTCGG 
GCGTGCCTGC 
TCGCCATCGC 
TAGTTGTATT 
AAGTGTCCAG 
GTCAGGTAGC 
TGGAGATCGA 
GTATGCAAAC 
CCGTTTTGAA 
TTTTCATTCA 
GTTACGGGAT 
AATTGCTGGA 
TTAAAGATAT 
ATGAGTATGG 
CTTTGTTTAA 
GAACTTCATA 
CAAATTTTAA 
AAGTTGGTGC 
AGTATTATAT 
TTGATGTGAA 
TGACAGAACC 
TAAATGGTTT 
TTGATATTAC 
CTCCTCTTGG 
CGGCTGAATA 
AATATATTGA 
AGTTGATGTT 
TCGCTGAATA 
TGTATATAGA 
GCGCGTGGTT 
GTAATTGGCT 
TTGGTAATTA 



CTTCCTTTGC 
TACAGTTATC 
AAACACCACT 
CAATGCCAAA 
TAA'i'GTGATG 
GGGTTACGGT 
CGAAAAGCTC 
CGGTATGCAG 
TGGTACAGGG 
TTCTTTGGAT 
AGATGCTTCT 
AACGAAGAAA 
TTCTCAAATC 
TTTTCAGGTG 
GATCCTTGCC 
TAAGACATTG 
AACAGCACCC 
TTATGCCTCT 
GCGCTATAGT 
AAATTTGTCT 
GGGGTCAGGT 
TGGGGATTTA 
GTACTTCGCA 
CGCCCAGATT 
TAATACTCGC 
GGAAAGAAGA 
TAAGTTTTCA 
ATATGAAGGG 
ACT GAGCCAG 
TGCCTATTTG 
TTTCTCTGTT 
CTATTCTGTC 
CAGTGATCTT 
CAACCACCAA 



TTGCTGACGA 
TCCTCCGAGG 
ATCGGTGCTG 
ATGTTGAGAG 
AAGATCGTAC 
ACGGGACAGA 
GCGGAAAAGC 
GTTATGACTA 
TCGTTGGGGG 
GTTGTGGCTA 
GCAACATCTA 
GGTAAAAT6A 
CTGAATACTA 
AAGGCAGGTT 
GGAGCTGAAG 
TTCCCAGTGG 
ACCAGTCAAG 
ATAGGCTACT 
GGCCGGCTCA 
GGTGCGATAG 
ACTTTCCGTG 
GCAGATGTCT 
AAAATGAGAC 
ACTCCGATCA 
ACTTCTTCTA 
GAAAGAGCTT 
ATTGATGAAA 
GATGTTATTG 
GGAAAAACCG 
TCT TTCTTT A 
CGTAATGACC 
GGTGGAATGT 
CGACTGAAAA 
GCACTCGTTA 



CCATTGGGTG 
ATAATGAGCC 
CAACCGACTT 
TGTCCTATTC 
TCGATCCGGA 
AACTCAGCAC 
CCGTTGCCAA 
CATCCGGTGA 
CAAGCTCTGC 
CGATGAATCC 
TTTATGGAGC 
GCGAGAGAGG 
AGCCCCTTGA 
TTTGGGGGAA 
ATTTGTATGG 
ATTTTAATCA 
GTGATATTTC 
TCGATCAGGA 
ACTTCGAAAG 
CGAATAGACG 
TGTTAACGAT 
ATTACATGTA 
CGTTCAGTTC 
AAGGCCTTAC 
AGAGAATGCC 
ATCGAGATGT 
AACATGATCT 
GGGCATCTTC 
GAAATAGTTT 
GTCGTTTTAA 
AATCCTCTCG 
TTGACATATA 
TGAGTTATGG 
CTGTGAACAA 



GGCTATGGCC 
CCTGATCGGC 
GGATGGCAAC 
CGGTATGACT 
CTCTAAGGTT 
TGTTTCCGGT 
TATCATGGAT 
CCCTACTGCC 
ACCATTGTAT 
GAATGATTTT 
TCGTGCTGCA 
TCGTATTACC 
TAATATGATG 
CAATCAAACC 
CAATTATGAT 
TGATGCAGAC 
TTTCTCCGGA 
AGGTATGGCT 
TCGTATCAAT 
ATCTGCCGAC 
GCCTCGTTAT 
TGGAGCTACC 
CGAATCACAT 
TTTAAAGGCA 
CAATAATCCG 
TAGCAAGTCT 
TACAGCATTG 
TAAAGGATTT 
GTCTTTGCCT 
TTACGGTTTT 
ATTCGGATCC 
TAATAAATTC 
TACAACGGGT 
TTATACTGAA 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
760 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1390 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
I960 
2040 
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GATGCTATGG 
TCTCAGTTCA 
GATTTCTATG 
GGTTTCTTCT 
CTTAAGGGGA 
TACAATAGAC 
ACCGGTACTA 
ATCGACAAAA 
AATAAAGTTA 
ACTCCTCCTA 
GCTGATTTTG 
AATGCAGGTG 
GATAATAAAG 
TTGGAGAATG 

agtctttttg 

CTGTTAACTG 
AATCAATATC 



GGCTTAGCAT 
ACTTCGGTTT 
TCCGCACTAC 
CACAGTATCA 
CGATCTACCA 
AGGAAATAAC 
TATGGGAAAT 
AAACCGGTAA 
CAACAAGCCA 
TTACAGGTGG 
CCTACATCGT 
GATTGATGCA 
AAACAGATGT 
CTTCTTTCCT 
CTGGGCAGAA 
TTACGAAGTA 
CTAATTCTAA 



TTCTACAGCA 
GGCTGCAGGG 
GAATGATATG 
GAATGTAGGC 
AAATAAGGAC 
AAAGCTTTTC 
TGGGTACCCC 
GCAGTTGTGG 
GTACTCAGCT 
TTTCTCCTTA 
TGGTAAATGG 
ATTAAATAAA 
TCCAAAATTG 
GCGTTTGAAG 
TGTGATTGGT 
TAAAGGCTTT 
GCAGTACGTT 



GGCAATCCCG 
GCTTTCAATA 
TTGATTGATG 
TCTATGAAAA 
TCGAATGTAT 
TTCGGTCTCA 
AATTCGTTCT 
TATGTTCCTG 
GACTTGGAGA 
GGTGCTTCTT 
ATGATCAATA 
GATAAAATGC 
GGACAGTCTC 
AATCTCAAAC 
GGAGCTCGTG 
GACCCTGAAG 
GCGGGTATTC 



ACCTCTCGTG 
ATCGCTTATC 
TCCCGATGCC 
ATACGGGTGT 
ATGCtTCTGC 
ATAAGTACAT 
ATATGGCTGA 
GTCAAGTCGA 
CACGAATTGA 
GGAAAGGACT 
ATGACCGTTA 
TATTGAATGC 
CTCAGTTTGA 
TCACCTATGT 
TCTATTTGAT 
CAGGGGGGAA 
AGTTGTCTTT 



GGAAAAGCAG 
TGCAGAGGTA 
TTATATCAGT 
AGACCTTTCT 
GAATTTCAAC 
GTTGCCTAAT 
ATATGCTGGA 
TGCGGATGGT 
TAAGTCTGTT 
TTCTTTAGAT 
CTTTACAGAG 
CTGGACAGAG 
TACGCATTTG 
ACTCCCCAAT 
GGCGCGCAAT 
TGTGGGAAAA 
C 



(2) INFORMATION FOR SEQ ID HO: 161 

(1) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 3042 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
ID) TOPOLOGY: circular 

(li) MOLECULE TYPE: DHA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

< ix ) FEATURE : 

fA) NAME/ KEY : miscjeature 

(B) LOCATION 1...3042 

[xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161 



ATGACGCTAT 
ACCGTGAAGG 
GTGGTTGTCG 
AGCGTGCCTG 
GTCGCCATCG 
GTAGTTGTAT 
AAAGTGTCCA 
GGTCAGGTAG 
GTGGAGATCC 
GGTATGCAAA 
TCCGTTTTGA 
GTTTTCATTC 
AGTTACGGGA 
GAATTGCTGG 
GTTAAAGATA 
GATGAGTATG 
GCTTTGTTTA 
GGAACTTCAT 
GCAAATTTTA 
AAAGTTGGTG 
AAGTATTATA 
TTTGATGTGA 
ATGACAGAAC 
GTAAATGGTT 
GTTGATATTA 
ACTCCTCTTG 
ACGGCTGAAT 
GAATATATTG 
AAGTTGATGT 
GTCGCTGAAT 
ATGTATATAG 
AGCGCGTGGT 
AGTAATTGGC 



TCTT CCTTTG 
GTACAGTTAT 
GAAACACCAC 
CCAATGCCAA 
CTAATGTGAT 
TGGGTTACGG 
GCGAAAAGCT 
CCGGTATGCA 
ATGGTACAGG 
CTTCTTTGGA 
AAGATGCTTC 
AAACGAAGAA 
TTTCTCAAAT 
ATTTTCAGGT 
TGATCCTTGC 
GTAAGACATT 
AAACAGCACC 
ATTATGCCTC 
AGCGCTATAG 
CAAATTTGTC 
TGGGGTCAGG 
ATGGGGATTT 
CGTACTTCGC 
TCGCCCAGAT 
CTAATACTCG 
GGGAAAGAAG 
ATAAGTTTTC 
AATATGAAGG 
TACT GAG CCA 
ATGCCTATTT 
ATTTCTCTGT 
TCTATTCTGT 
TCAGTGATCT 



CTTGCTGACG 
CTCCTCCGAG 
TATCGGTGCT 
AATGTT GAGA 
GAAGATCGTA 
TACGGGACAG 
CGCGGAAAAG 
GGTTATGACT 
GTCGTTGGGG 
TGTTGTGGCT 
TGCAACATCT 
AGGTAAAATG 
CCTGAATACT 
GAAGGCAGGT 
CGGAGCTGAA 
GTTCCCAGTG 
CACCAGTCAA 
TATAGGCTAC 
TGGCCGGCTC 
TGGTGCGATA 
TACTTTCGGT 
AGCAGATGTC 
AAAAATGAGA 
TACTCCGATC 
CACTTCTTCT 
AGAAAGAGCT 
AATTGATGAA 
GGATGTTATT 
GGGAAAAACC 
GTCTTTCTTT 
TCGTAATGAC 
CGGTGGAATG 
TCGACTGAAA 



AGCATTGCGT 
GATAATGAGC 
GCAACCGACT 
GTGTCCTATT 
CTGGATCCGG 
AAACTCAGCA 
CCCGTTGCCA 
ACATCCGGTG 
GCAAGCTCTG 
ACGATGAATC 
ATTTATGGAG 
AGCGAGAGAG 
AAGCCCCTTG 
TTTTGGGGGA 
GATTTGTATG 
G ATT TTAATC 
GGTGATATTT 
TTCGATCAGG 
AACTTCGAAA 
GCGAATAGAC 
GTGTTAACGA 
TATTACATGT 
CCGTTCAGTT 
AAAGGCCTTA 
AAGAGAATGC 
TATCGAGATG 
AAACATGATC 
GGGGCATCTT 
GCAAATAGTT 
AGTCGTTTTA 
CAATCCTCTC 
TTTGACATAT 
ATGAGTTATG 



GGGCTATGGC 
CCCTGATCGG 
TGGATGGCAA 
CCGGTATGAC 
ACTCTAAGGT 
CTGTTTCCGG 
ATATCATGGA 
ACCCTACTGC 
CACCATTGTA 
CGAATGATTT 
CTCGTGCTGC 
GTCGTATTAC 
ATAATATGAT 
ACAATCAAAC 
GCAATTATGA 
ATGATGCAGA 
CTTTCTCCGG 
AAGGTATGGC 
GTCGTATCAA 
GATCTGCCGA 
TGCCTCGTTA 
ATGGAGCTAC 
CCGAATCACA 
CTTTAAAGGC 
CCAATAATCC 
TTAGCAAGTC 
TTACAGCATT 
CTAAAGGATT 
TGTCTTTGCC 
ATTACGGTTT 
GATTCGGATC 
ATAATAAATT 
GTACAACGGG 



CCAGAATAGA 
CGCGAATGTC 
CTTCACGCTT 
TACCAAAGAG 
TCTGGAGCAG 
TTCTGTGGCC 
TGCCCTCCAA 
CGTCGCTTCT 
TATCGTGGAT 
TGAATCTATG 
AAACGGAGTC 
CTTTAATGCC 
GACTGGAGAT 
CGTTCAGAAG 
TTCTTTGAAA 
CTGGCTCAAG 
AGGGTCTCAG 
TCGTGAACCG 
TGAATGGCTG 
CTATTTTGGA 
TTATAACCCT 
CAGACCTTCT 
TCAGGCCAAT 
ACAGGCTGGT 
GTATGATTCT 
TTTTACAAAT 
GATGGGGCAT 
TGAAAGTGAT 
TGAACACAGA 
TGACAAATGG 
CAATAATAGA 
CATTCAAGAA 
TAACTCGGAG 



2100 
2160 
2220 
2280 
234 O 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3051 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
. 900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
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ATTGGTAATT 

GGGCTTACCA 

AACTTCGGTT 

GTCCGCACTA 

TCACAGTATC 

ACGATCTACC 

CAGGAAATAA 

ATATGGGAAA 

AAAACCGGTA 

ACAACAAGCC 

ATTACAGGTG 

GCCTACATCG 

GGATTGATGC 

GAAACAGATG 

GCTT CTTTCC 

GCTGGGCAGA 

GTTACGAAGT 

CCTAATTCTA 



ACAACCACCA 
TTTCTACAGC 
TGGCTGCAGG 
CGAATGATAT 
AGAATGTAGG 
AAAATAAGGA 
CAAAGCTTTT 
TTGGGTACCC 
AGCAGTTGTG 
AGTACTCAGC 
GTTTCTCCTT 
TTGGTAAATG 
AATTAAATAA 
TTCCAAAATT 
TGCCTTTGAA 
ATGTGATTGG 
ATAAAGGCTT 
AGCAGTACGT 



AGCACTCGTT 
AGGCAATCCC 
GGCTTTCAAT 
GTTGATTGAT 
CTCTATGAAA 
CTGGAATGTA 
CTl'OGGTCTC 
CAATTCGTTC 
GTATGTTCCT 
TG ACT TGG AG 
AGGTGCTTCT 
GATGATCAAT 
AGATAAAATG 
GGGACAGTCT 
GAATCTCAAA 
TGGAGCTCGT 
TGACCCTGAA 
TGCGGGTATT 



ACTGTGAACA 
GACCTCTCGT 
AATCGCTTAT 
GTCCCGATGC 
AATACGGGTG 
TATGCTTCTG 
AATAAGTACA 
TATATGGCTG 
GGTCAAGTCG 
ACACGAATTG 
TGGAAAGGAC 
AATGACCGTT 
CTATTGAATG 
CCTCAGTTTG 
CTCACCTATG 
GTCTATTTGA 
GCAGGGGGGA 
CAGTTGTCTT 



ATTATACTGA 
GGGAAAAGCA 
CTGCAGAGGT 
CTTATATCAG 
TAGACCTTTC 
03AATTTCAA 
TGTTnCCTAA 
AATATGCTGG 
ATGCGGATGG 
ATAAGTCTGT 
TTTCTT TAG A 
ACTTTACAGA 
CCTGGACAGA 
ATACGCATTT 
TACTCCCCAA 
TG3CGCGCAA 
ATGTGGGAAA 
TC 



AGATGC7TATG 

gtctcagttc 
agatttctat 

TGGTTTCTTC 
TCTTAAGGGG 
CTACAATAGA 
TACCGGTACT 
AATCGACAAA 
TAATAAAGTT 
TACTCCTCCT 
TGCTGATTTT 
CAATGCAGGT 
GGATAATAAA 
GTTGGAGAAT 
TAGTCTTTTT 
TCTGTTAACT 
AAATCAATAT 



(2) INFORMATION FOR SEQ ID NO: 162 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 63 base pairs 
(Bj TYPE: nucleic acid 
(C) STRANDEONESS: doubl ~ 
fD) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

(iiij HYPOTHETICAL: HO 

(iv) AHTI-SE1ISE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 
fix) FEATURE: 

(A) 1IAJ IE/KEY: misc feature 

<B) LOCATION 1 24 63 

0:i> SEQUENCE DESCRIPTION: SEQ ID HO:162 



ATGAAGAAAA 
AGCATGCAGG 
TTCTCATTCA 
GACGAGTTTG 
GCATTCCTGC 
GTCUAGATGT 
GAAGCCTATA 
TCCGTCATTC 
GCCAATACAG 
ACCGGATATG 
CCGAGGATTA 
G ACGGAG CAT 
AATGTCGATT 
CCTTGGCAGC 
GGTGCCAATA 
AGACATGCAG 
AGTATTTCCG 
TGGTATGTAA 
ATTACCCTTG 
AATCTGAACA 
ACAGTGACTT 
GACAAGCGCA 
AAAGCCGGCA 
ATTACGATCA 
CTGCCTGTAC 
TTCCAAGATC 
CCTGCCATGA 
GTTACGCTCA 
ACTGTCAATG 
GGCCAGACGG 
GATGGCGGCA 
GGCCAATACG 



AGAATTTTTT 
CACAACAGGC 
AAGTGCCTGA 
ACCCCGTAAC 
GCTATGGGCT 
TCGACTACCG 
-TCAGCATGAT 
GCAAGTCCGT 
GGAAAAAGCC 
TGGTACTGCT 
AGAACATTCT 
ACAGAGCCGG 
TGAACCGTAA 
CGGAGGCAAC 
TACATGGAGG 
ACGATGAGTG 
CCAGCTACAT 
TTCGCGGAAG 
AAATCAGCAA 
AAGAATCTCT 
CCGCTGCGAA 
ACTCCGATGT 
CTTATACGGT 
AGGACAAAGA 
CCGATTTCAC 
AAACGACAAA 
GTACAGAGCA 
AAGTGTGGAA 
CCGTTATGCC 
TATCTTTCCA 
CTCCCGCTAC 
ATGTCACGCT 



GCTTCTTGGC 
CAAAGATTAT 
TAGGGCCGTT 
CAATGAAGCC 
CAAGCCTACA 
CTCAGGAGAA 
GGAAGAGTTC 
AAAGGATCGT 
TCGCGTGCTC 
CCGACTCATA 
GGATAAAACG 
AAACCACACC 
CTTCAAGGAT 
TGCATTCATG 
AACAGAGGTG 
GTACAAACTG 
GACCTCCGAA 
TCGTCAGGAC 
CACGAAGTTG 
GCTTGCTCTG 
CGGACAGCCT 
TTACTCCGAT 
GAAATACAAA 
AACCGTCATC 
AGCTTCTCCT 
TAACCCCACG 
GAATCCGCTC 
TGCAAGTGGT 
TGTAGCTGAA 
AAACCAATCC 
CAGTGAAGAC 
CAAGGCGATC 



ATTTTCGTTG 
TTCAACTTTG 
CTACAAGAGC 
ATTGCCTATG 
TTCTTGACTC 
AAATACGAAT 
CAAACAAAGT 
AAACTGATGA 
TATACTTCTA 
GACCATCTGC 
GAAGTATGGA 
GTACAAGGAG 
GATGTAGCCG 
GATTTGGAAG 
GTGAACTATC 
ATCAGTCGCA 
ACCAATTCGG 
AATGCAAATT 
GTGCCGGCCT 
ATCGAAGAAT 
CTCAAATGCC 
GCTACCACAG 
GCCGAGGGTT 
ATGGACATTG 
AT G ACCAT CT 
AATTGGGAGT 
GTATCCTATA 
TCCAACACGA 
TTCGTCGGTA 
ACCAATGCCA 
GAAAACCCGA 
AGTGCTTCCC 



CTTTGCTGAC 
ACGAACGGGG 
TGGCTCTGAT 
CCAGCGAAGA 
CTCCATCCAT 
GGAATGCTTA 
ATCCATCACT 
TTTGCAAGCT 
CCATGCACGG 
TGTCGAACTA 
TCTGCCCTTT 
CTACTCGCTA 
GTGATCACCC 
GAAACACCTC 
CATGGGATAA 
ACTACGCAGC 
GAATCATCAA 
ATTTCCATCG 
CTCAACTTCC 
CCTTATACGG 
AGATCTTGAT 
GCTACTACGT 
ATCCTGAGGC 
CATTGGGCAA 
CAGTAGGCGA 
GGACGTTCGA 
GTCATCCCGG 
TTACGAAAGA 
CCCCGACGGA 
CCAACTACGT 
CTGT6CTTTA 
GTCAAACGGT 



TTTCATCGGC 
CGAGGCCTAC 
CATGTCCATC 
GGAGTTCGAG 
GCAGCGCGCT 
CCCCACCTAT 
TTGTACTACT 
GACGTCCTCT 
AGACGAAACG 
CGAATCCGAT 
GACCAATCCG 
CAATGCCAAC 
CGATGGAAAA 
TTTCGTGCTC 
TAAAAAAGAA 
CGCTTGTCAG 
CGGTTCAGAC 
TCTGCGAGAA 
AAAGTATTGG 
CATCCATGGT 
AGAAAACCAT 
ACGTCCTATC 
AACTCGTACC 
CTCGGTTCCT 
AAGCGTCCAA 
AGGCGGACAG 
TCAGTACGAC 
AAAATTCATC 
AATAGAAGAG 
ATGGATATTC 
CAGCAAAGCC 
GAAGACGAAA 



20-IO 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3042 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1970 
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GAAAAATACA 
ACACCTCGAA 
CCGACTTCAT 
CCGGTGGTCA 
GGCGGAAGCA 
GAGGACATAG 
CTCATAGAAG 
GTACTCAAAA 
GGAATCTACA 
GGG 



TCACTGTCAA 
AAGTAAAGAA 
GGCTTTGGGT 
CCTACAATGA 
ATGTGAAGAA 
TGGCACAGAC 
CCAACGCTGC 
CTACTCCGAA 
CCATCAATAT 



GAAAGCT CCG 
AGGCGAGACA 
GTTCGAAGGC 
AACAGGCAAG 
AGCAGAAGAC 
GGGTATCGTC 
TATCAAAGCG 
TCAGCTCCGC 
CAAAACGGAA 



GTCCCTGCTC 
GTTACTTTCA 
GGCTCTCCTG 
TACGATGTCC 
TACATTGAGG 
ATTCGTCCGC 
ATCGTTCTCT 
TCGACCGTAG 
AAATCCGCT C 



CGGTAGCCGA 
AAGACTTGTC 
CCACCAGCAC 
AGCTGACTGC 
TTATCCTCGA 
AAAACGGAAC 
ATGACATCAA 
ATCTTTCCAT 
GCACGGAAAA 



CTTCGAAGGA 
TACGAACAAT 
GGAGCAAAAC 
CACCAACGAG 
TGACAGTGTC 
GAAGCAGATC 
TGGACGGGTC 
CCTGCCCGAA 
GATCCATATC 



1980 
2040 
2100 
2160 
2220 
2280 
234 0 
2400 
2460 
2463 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 163 ' 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 8 base pairs 

(B) TYPE: nucleic acid 
IC) STRANDEDNESS: double 
<D) TOPOLOGY: circular 

Ui) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

<iv) AIITI-SEttSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: PORYPHYROMOHAS GINGIVAL! S 

(ix) FEATURE: 

(A> NAME/ KEY : mlsc feature 

(B) LOCATION 1...228 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 163 

ATGCGACTGA TCAAGGCTTT TCTCGTGCAA CTCTTACTGC TCCCCATTTT CTTCTACAAG 
CGGTTTATAT CGCCGCTTAC ACCGCCTTCA TGCCGGTTTA CCCCCTCATG TTCGTCCTAT 
GCCATCGAAG CCTTACGTAA AT AT GGCCCG GGCAAAGGAC TATTGCTGAG CATCAAGCGT 
ATTCTCCGCT GTCACCCGTG GGGTGGAAGT GGCTATGACC CCGTTCCG 

(2) INFORMATION FOR SEQ ID NO: 164 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

tii) MOLECULE TYPE: DNA (genomic) 

Uit) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: FOR Y PHYROMONAS GINGTVALIS 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 
(BJ LOCATION 1...2577 



Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 



ATGGCATACG 
ACCCGTCTCG 
GGAGACAATG 
CGGTTGATCG 
ATCTTCACCC 
GAGGACGAGG 
AGCTTAGTAC 
TACTTCGGAC 
GGGTACCAAG 
AGCGGGACAG 
ACCACCACGA 



ACTTTACACA 
GCGTCGTAGC 
GCGCGATCGA 
AGTTGGAAGC 
CTTCGGCTCG 
CCGTCAGCCC 
AAAAGATATT 
AGCGCAACCC 
ACAACGACTT 
GCGGAGGCTC 
CACGGAGTGG 



AACATTCCGC 
CGTGACGCAA 
CATCATGCGG 
CATCGCCGAG 
GGAGGCTATC 
GGTCCATCTG 
TAT GAAACAA 
CTCCGAAGGG 
CGACGACGAA 
CGGCGACGCC 
AGGCGACACG 



AACAGCCTGG 
GATATGCTCG 
CACTATGGGA 
AGTTTGCCTG 
GAT GATGCCA 
TTG CTGAGTA 
GGTATAAAAT 
AAGTCTCCCT 
GAGGACGAAT 
CCCGAACAGA 
CCTGCACTGG 



AGTACAGCTA 
TACTCGGTAT 
TCAACTTGTA 
CTTCGCCTGA 
CAGACATCTG 
TCCTCAACTC 
ACGACACCAT 
CCGAAATGGA 
CCTCTCCGCC 
ATACCGGCGG 
ACACCTTCGG 



TCAGGAAGCA 
CATTCGCGAC 
CGAACTCAAA 
GGGATCGCCC 
TGCCGACATG 
GACACAGGAG 
CCTGTCGGAT 
GATCCTCGAC 
TTCCGGGAAT 
AGGCGATACT 
CACCGACATC 



60 
120 
180 
228 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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ACTGCCATGG 
AGGGTGATAC 
GGTGTAGGCA 
AGCCGTATT C 
ACCAAATATC 
AATCCGCAGA 
GCCGGATCGA 
TGCATCGGAG 
GAACGCCGCT 
CTGCAAAACA 
ATCAAAGCGG 
GCGATAGATG 
AAAGAAATCG 
GTAAAGGCTC 
CAGCAGATAG 
GTGGACGAGA 
CTGAGCACGG 
GTAGGTCAGG 
CTTCGCAATG 
AAGACCTATT 
AGGGTGGATA 
CCGGGATATG 
TATTCCGTGG 
TTACAGGTGA 
AACACCGTGA 
GGTATCGGGT 
ATCCAAAAAG 
ATCCTCTTCG 
GCCGTCTTGG 
GATGTGATAG 
CTCCAGAACG 
GGGCAGACGC 



CGGCAGCAGG 
AGAT ACT CAG 
AGAGTGCCAT 
TTTTCGACAA 
GCGGACAGTT 
TCATCCTCTT 
TGGATACGGC 
CCACTACGCT 
TCCAGAAGGT 
TCAAAGAGAA 
CAGTGGAACT 
CCATGGACGA 
AGATACTGGA 
AGAACTACGA 
CGGAAGAGAA 
ATGTAGTGGC 
GCGAAGGCGA 
ACACAGCCAT 
AAAAGAAACC 
TGGCCAAGAA 
TGAGCGAGTA 
TGGGCTATGA 
TTCTCTTGGA 
TGGACGAAGG 
TCATCATCAC 
TCCGT TCGGA 
CTCTGAACAA 
ACCAACTGGG 
CGCGCATCCA 
CGACGAAGGG 
AAGTGGAGGA 
TTACGCTCTC 



CAAGCTCGAC 
CCGGCGCAAA 
CGTGGAAGGA 
GCGGATCATC 
CGAAGAGCGG 
CATCGACGAG 
CAATATGCTC 
GGATGAGTAT 
GCCGATAGCC 
ATACGAGGAC 
GACCGATCGC 
GGCCGGCGCG 
GGCCGAATTG 
ACTGGCTGCC 
GAAAAAATGG 
GCATGTAGTG 
ACGTCTGCGC 
CGAAAAGATG 
GATCGGTTCT 
GCTCGCCGAA 
TATGGAGAAG 
AGAAGGCGGC 
TGAGATCGAA 
TCAGCTGACC 
CTCCAACGTG 
AAAAGACGAG 
CACGTTCAGC 
CAAGACGGAG 
TCGTGCCGGA 
ATACGACCTC 
TCGCCTCACG 
TGCTCGCGAT 



CCGGTAGTGG 
AAGAACAATC 
CTGGCCGAAC 
AGCCTCGATT 
TTGAAAGCCG 
ATACATACCA 
AAACCCGCTC 
CGTAAGAACA 
CCCTCGACTG 
TATCACGGTG 
TATGTATCCG 
AGCGTCCATA 
GCATCGCTGC 
TCCTTCCGCG 
GAAGAGCAGA 
GCGTTGATGA 
ACGATGGCAG 
GTGCATGCCA 
TTCCTTTTCC 
TACCTGTTCG 
TTCTCCGTTT 
CAACTGACGG 
AAGGCGCATG 
GACAGTCTGG 
GGTACACGCC 
GAAGCGAACA 
CCCGAATTTC 
ATTCGCCGGA 
TACGACCTCG 
CaATACGGAG 
GATCTTATCC 
GGCGAGATCA 



GTCGGGAGCA 
CGGTGCTCAT 
GCATCGTGAA 
TGGCTCAGAT 
TGCTCGATGA 
TCGTGGGAGC 
TTGCCCGTGG 
TAGAAAAGGA 
CAGAAGAAAC 
TACGCTATAC 
ATCGTTTCTT 
TCACCAATGT 
GAGAGAACAA 
ATCAGGAGCG 
TGTCCAAGCA 
CAGGCGTTCC 
ATGATCTCAA 
TCCAGCGCAA 
TCGGCCXTCAC 
AGGA'l'GAGAA 
CGCGTCTCGT 
AGCGCGTAAG 
CCGATGTCTT 
GACGGCGCGT 
AGCTCAAAGA 
AGGAGCATAG 
TCAACCGTTT 
TGGTGGACAT 
TCCTTACCGA 
CACGACCGCT 
TCTCCGGACA 
TCGTACAAGA 



GGAGATCGAA 
CGGCGAACCC 
CAGGAAGGTG 
GGTAGCCGGC 
GCTGAAGAAG 
AGGCTCTGCA 
ACAGGTACAG 
CGGAGCACTC 
GCTGACCATC 
GGACGAAGCG 
CCCAGATAAG 
GGTGGCTCCG 
GCTCTCGGCC 
GCGCACTCAG 
CCGCGAGACG 
GGCTGAGCGG 
GACCAAAGTA 
TCGTCTGGGA 
GGGGGTAGGC 
TGCCATGATC 
GGGTGCCCCT 
ACGCAAACCC 
CAATCTGCTC 
GAATTTCAAG 
CTTCGGGCAG 
CCGTTCCGTG 
GGACGATATC 
AGAGCTTAAA 
TGAAGCCAAG 
CAAGCGCACA 
GATCGAGAAA 
ACAAGCA 



720 
180 
840 
900 
960 
1020 
1080 

mo 

1200 
1260 
132C 
13CO 
14 40 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
24 60 
2520 
2577 



35 (2) IHFORllATIOW FOR SPjQ ID NO : 1 65 

(1) SEQUENCE CHARACTERISTICS: 

(A) LttHGTHi 1251 base pairs 
{&) TYPE: nucleic acid 
40 (CJ STRANDEDNESS: double 

(D) TOPOLOGY : circular 

Ui) MOLECULE TYPE: DHA (genomic) 

45 (lit) HYPOTHETICAL: HO 

Uv) ANTI-SENSE: tiO 

(vl> ORIGINAL SOURCE: 
50 (A) ORG AH ISM : PORY PH YROMONAS GIMGIVALIS 

tlx) FEATURE: 

(Al NAME/ KEY: miacjeature 
(B> LOCATION 1...1251 



55 



Ui) SEQUENCE DESCRIPTION: SEQ ID MO: 165 



ATGAATTACC TGTACATACT GATTACACTT TT ACT CTCCG GCTTTTTTTC CGGTGCTGAG 60 

ATTGCTTTCC TTTCTTCGGA CAAACTGCGT CTTGAGTTGG ACAGGAATAG AGGCGATCTC 120 

60 ACAGGCAGAG CGTTAAACTT GCTGTATCGA CATCCGGACC AACTGGTGAC TACTCl'CCTT 180 

GTGGGTAATA ATATCGTTTT GGTAGTCTAT GGTCTGCTGA TGGCGGGATT GCTGGCCGCA 24 0 

CCTTTGGCGC AATGGATTGA TAACGATGCT ATGATCGTCG TTCTCCAATC TGTCTTATCC 300 

ACT AT CATC A TACTGTTTAC CGGGGAATTT CTACCCAAAG CCATTTTCAA GACCAATGCC 360 

AATATGATGA TGAGGGTATT CGCCCTCCCT ATCGTAGCGA TCTATTATCT GCTTTATCCT 420 

65 CTGTCTAAAC TCTTCACCGG TTTATCTCGC TCTTTTATTC GTCTGGTGGA CAAGAATTAT 400 

GTGCCTACAA CAGTAGGGTT GGGGCGCGTA GATCTCGATC ATTATTTGGC AGAAAATATG 54 0 

TCCGGAGAAA ACGAACAGAA CGACTTGACT ACCGAAGTGA AAATCATCCA GAATGCGCTG 600 

GATTTTTCCG GTATTCAGGT GCGAGACTGC ATGATCCCAC GCAATGAGAT GATAGCATGT 660 

GAGTTGCAAA CGGATATTGA AGTACTCAAA ACGACTTTTA TCGATACCGG TTTGTCCAAG 72C 

70 AT CATTATCT ACAGACAGAA CATAGATGAC GTAGTAGGAT ATATCCATTC GAG CGAAATG 7 80 

TTTCGTGGGC AAGACTGGCA AAAACGTATC AATACTACTG TATTCGTACC CGAAAGCATG 84 0 

TATGCCAATA AACTGATGCG ACTACTCATG CAGCGCAAGA AAAGCATTGC GAT CGT CATC 900 

GATGAACTTG GAGGTACGGC CGGAATGGTC AC ATT AGAGG ATTTGGTAGA AGAGATTTTC 96 C 

GGTGACATTG AGGACGAACA CGACACTCGC AAGATCATAG CCAAACAGCT CGGCCCTCAT 1020 

75 ACCTATCTGG TCAGT GGTCG TATGGAAATA GATGATGTGA ACGAACGTTT TGGGTTGTCC 1080 
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TTGCCTGAGT CTGACGACTA CCTTACCGTG GCCGGATTTA TCCTGAATAG CCATCAAAAT 1140 

ATCCCACAGG CCAATGAGGT CGTGGAGATT GCTCCTTATA CTTTTACCAT TCTCAGATCT 12 00 

TCTTCCACCA AG AT CGAACT GGTGAAAATG TCCATCGACG ACCAATCGAA C 1251 

5 

(2) INFORMATION FOR SEQ ID NO: 166 

<i) SEQUENCE CHARACTER TSTICS: 

(A) LENGTH: 87 9 bass pairs 
10 (B ) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
ID) TOPOLOGY: circular 



15 



20 



25 



30 



45 



55 



70 



(ii) MOLECULE TYPE: DNA (genomic! 

(ill) HYPOTHETICAL: NO 

(ivj ANTI-SENSE: NO 

(vij ORIGINAL SOURCE: 

(A J ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NA> IE/KEY: misc_f eature 
<B) LOCATION 1. . . 879 

Ixi) SEQUENCE DESCRIPTION: SEQ ID HO: 166 



ATGAAACAGA ACTACTTCAA AAGAGTCTGC TCACTGCITT GGCTGGTTTT ACCCATCTT 60 

ATTATGCCAT TGGAAGTAGC AGCTCAAGAG ATTATTCCCA ACGAAGAGGT GTTGGAATCA 1?0 

TTGACTTTCG TTGCACCGGT TGAGGAGACA GACGCAATAG AGGCAGAGGT AGAAGCTCTG 180 

CAGGAGATAG TCGCTACTGA GGAGATTGCG GAGCAGGCTG TTCGTTCTTA TACCTACACG 24 0 

GTCTATCCTG AT GGCGTG AA GATTGCTTCA GGATTGACTG AGCCCACTTT TCTCGATGAA 300 

GATGT TCCTG CCGGCGAACA TACCTACTGC GTAGAAGTAC AGTATCAGGG AGGCGTATCC 360 

GACAAAGTAT GCGTGGACGT AGAGGTGAAG GACTTCAAAC CGGTTACCAA TCTCACCGGA 4 20 

ACTGCTTCCA Al'GACGAAGT TTCTTTGGAC TGGGACGGTG TGGAAGAGAA AGCTGAAGAG 4 00 

CCGGCAAGTG ATAAAGCAGT CAGCTACAAC GTCTACAAGA ATGGAACCTT GATCGGTAAT 54 0 

ACAGCTGAAA CTCATTATGT GGAGACCGGT GTAGCCAATG GTACATACAT CTACGAAGTG 600 

GAAGTAAAGT ATCCTGACGG TGTATCTCCG AAGGTGGCTG TAACCGTGAC CGTGACCAAC 660 

W AGCTCATTGA GCAATGTAGA TGGACAGGCT CCTTACACAT TGCGAGTAGA AGGCAAGAAG 720 

ATTATTGCGG AAGCCCATGG TATGATCACG CTCTACGACA TCAACGGACG TACCGTGGCC 7 B0 

GTAGCCCCGA ATOGATTGGA ATACATGGCG CAAACCGGTT TCTATGCAGT GCGCTTCGAT 84 0 
GTGGGGAATA AACACCATGT ATCGAAAATA CAAGTAAGA 



(2) INFORMATION FOR SEQ ID NO: 167 



U> SEQUENCE CHARACTERISTICS: 

|A) LENGTH; 936 base pairs 
5U tD) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(11) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



60 <vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 
„_ (A) NAME/KEY: misc_teature 

" 5 (3) LOCATION 1...936 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167 



879 



ATGATTCCTC TAAGCGAGAG TTTTGAATCA GGTATTCCAG CTATATGGAA GACCATTGAC 60 

GCAGATGGCG ATGGCTATAA TTGGATGCAT TTGACCAATT TCACGGGACA GAGTGGTCT^ 120 

TGTGTCTCTT CGGCTTCATA CATAGGCGGC GTCGGAGCTT TGACTCCGGA CAATTATCTG 180 

ATAACACCCG AATTAAAACT ACCCACAGAC GCGTTGCTGG AAATAATCTA TTGGGTATGT 24 0 

ACTCAAGATC TCACTCCTCC ATCGGAGCAC TATGCCGTTT ATTCCTCTTC TACAGGCAAT 300 

AATGCTGCTG ACTTTGTTAA TCTCTTATAT GAAGAGACTT TG ACT GCC AA ACGGATACAA 360 

/3 TCCCCCGAGT TGATCCGCGG AAATCGCACA CAAGGTGTTT GGTATCAAAG AAAGGTGGTA 420 
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^? TAAATA TGTTGCTTTC CGCCATTTTA ATTCCACGGA TAATTTCTGG 480 

StCCGG^G 53?^SSS I? TCCTGTAT ACCCCTCTTC CCCGAAGAGC TCCGTGTCCG 54 0 

S^J^SSJS GTTACACTTA TTCTGTATTC CGTGATGGAC AAAAGATAGC GAGTGGATTG 600 

S JSSSJIS 3 CA TATATCGA TACGGATGTA CCGTATGGGA CTCAAGACTA TTGTGTCCAA 660 ^ J 

TGCAAGGAGA CTCGTATAAA GTCTGCAAAA ATATAGTGGT GGCAAATTCT 7^0 

StgS? TCAAArrnrn TM6CCTTTT GCGTTGACCG TGGTTGGCAA GACCATTGTA 780 

55^2^f^ TCAAAGGAGA GATCACTCTT TATGACATTC GTGGCCGGCT GATAGCTTCC 84 0 

CGCTTAGGTA CAAAGCGGAA AATGGTTTTT ACCTCATTAA AATACAGGTA 900 ' V 

1Q AACGGAACTG TCTATACTGA GAAAATCCAA ATCCAA 935 - ,f 

(2) INFORMATION FOR SEQ ID NO: 168 

1r _ ti) SEQUENCE CHARACTERISTICS: 

1& t A) LENGTH: 2529 base pairs 

<BJ TYPE: nucleic acid 

|C) STRATI DEDNESS: double 

(D) TOPOLOGY: circular 

20 <ii> MOLECULE TYPE: DMA (genomic) 

(xii) HYPOTHETICAL; NO 
<iv) ANTT- SENSE: NO 



fvi) ORIGINAL SOURCE: 

tA) ORGANISM: PORYPHYROHONAS GINGIVALIS 

( ix ) FEATURE : 

(A) NAME /KEY: misc feature 

(B) LOCATION 1...2529 



60 
120 
100 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168 

^ A ^^ AA GTTTTCTTTT AGCCATAGTA ATGCTCTTTG CCATTGCCAT GCAGGGACAT 
GCGAGCTTTG AGTCTGGCTC GGCTGCCTTT GCGACAGGTA 
JSS^ J^ A ?^ C G C AGT AT CT GACAAGATTT CCATCGATTA CGTTTATCGG 
SJ^™^ CTGAGAGGGG TATCACATCA CAAGAGGAAG GCTCTCCTGC ATATTTTTAT 24 0 

SKEE ^ GGGCTAT GCT ^GTAG CAGCAGATGA cIgZVaIcI loo 

*^T?Il^ ^ ATTCACC CA'^GGCCGT TTCGACATGG ACAGTATGCC GGACAATCTT 360 
^ A nSJ? GC TACAAATTTA CGATCAGGAA ATAGGCCTGA TACTTTCCGG AAAAGCTCAG 420 
CTCAATGAAG AGATATTACG TACCGAGGGC GTACCGGCTG AAGTACATGC TCTGATGGAT 4 80 

2£SZ™ ESS** 08 * TCCCATGCGA TGGAATCAAG S?TACCC^G 540 
SESSSr? rr5S^ G ? CAATCATGCC TATACCGGCT GTGTTGCTAC TGCT GCAGCA 600 
^™™ G CTGGCCGC " rT CAAGGTGAAG GCTCTTTCGA TTATCATGCA 660 
rcc^A^ 7.^^? GTCCGGCACA TTTGGTGAAA TGTACGACTG GATCAATATG 720 
CCCGGAAATC CCGACCTTGA TAATCTGACT CAATCTCAAG 'i'GGATGCCTA CGCCACACTr nan 
A ™Sn™ I GAGTGCATC TGTTTCGATG AGTTTTTATG i££SS££ T^CG^C 840 
Sa74™ ™S AG S CTTGCGAAAC AACTTTCGCT ACAAGCGTTC ACTGCAGCTA 
SnrJ^r^ ^S^SS CT G AC AG GAG TGGCACGATA TGATCCGCGG GGAACTTGCC 
^ GGGGAAGGG CG GT CT ATT A TGCAGGGAAT AACCAGAGCA TAGGACATGC TTTCGTTTGC 
rS^J^lt CTTCGGATGG TACTTTCCAT TTCAACTGGG GTTGGGGAGG TGTTTCCAAC 
GGG 3ZST ACA AACTAACACT CCTCTCGCCG ACTTCGTTGG GTATCGGAGG TGAGGGAATA 

G ATCATCACC GGTATCGAAC CGGCTAAGAC TCCCgS 1200 
ATGCCTTGCC GATCTTGGCA CTGAAAGACA TAGAAGCCGA GTATAAAAGT 1260 
GACCTCGciAT IS™?* 66 GTATTCGA ™ TATAATACAG GTGAAGAGCA ATCAAATCTT llto 
AT^CAATATCT CTTGGTAPGn GGAGAAGTCA TAGAGGTGAA AACTTCATCT 1380 

CAGTTG?^ ££™£2f? A T ACG<5AGAG CATCCCGAGA GTTTCTCATT GGCACCTAAT 1440 
TGaZlrl^r ^^IS™ CACCATCACC Cl'ACTTTATC GTCGCACAGG CACCCAACAG 1500 
GACrPPAATA l^^l GC ACAGGGAGGA TATGTCAATA GCATTAAAGT AAATACGACA 1S60 
A^CA^5g SSSSF"™ GCAAGCTCAG TATCGTCCCC 1620 

AACAGCTTTG TCfc-CAGATCT GAATTCTTAT GAACATAGTA CGATTACAGT ACAGTTCAAT 
SEE™ ^TACACCC CTAGOCTTTG CTCTAtSac SS 

T^SSct^ J^SST* GGGCTGGGTA ATGGCTGAAG TTCCGGGCGG TAGCAGCAAC 

J^^SISf tttggtctaa agacgttctc actctctcgg aaggcgacta TACATTGTGG 

l*l™£ZZl CCATCAACAA CCAAAAGGAT GAATGGAAAA AGATCGGAAG CGTGTCAGTA 
AAAACACCGA CAGAGTATAC GCACCCCTTA TTCGAAGTGG GCCATAATCA AACTTCTA-C 
TATACGCTGG ATATGGCACA CAACAGAGTA TTGCCCGACT TTACACTCAA AAATCTC^A 
SESSEE GGTTGTT GTT TTCCGCCAAA ££S£S£ JSSSSg 
TTATGGG CAG CTCAAGAAAC AGTACATATC AAGCAAGGAG AAACTTTCGT ATATAAACC-1- 
GTTGTCGAAG GCCCTATACC TGATGGATCC TATCGTGCGA CCc¥cCATGC A^Sfc 
ACAG^GGTAG ^ GGGGAAA AGGAACTACA CGGTgS otSSSS? 

GATTATTT^r JJSSS™* 1 * ATCGTCAGAA GAGATCAGAG TATTCCCTAA TCCGGCACGC 
^ ACCTTGCAT ^ CCCCAAGAAA CATCTATCAT TCTTTTCGAT 
CTGTCAGGCA AGATTGTCAT GAAGAATAGT TT AT C AGCGG GGCATGGCAG AATGGATGTC 



900 
960 
1020 
1080 
1140 
1200 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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A?A^gS£ C CTAATGGGGC CTACATCCTT AAGGTGGATG GATATACGAC GAAAATAAAT 2520 



10 



20 



25 



60 



65 



252? 



(21 INFORI1ATICW FOR SEQ ID MO: 169 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 870 bass pair? 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOTjECULE TYPE: DNA ( genomic > 
15 (iii) HYPOTHETICAL: NO 

Hv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROIIONAS GINGIVALIS 

(ix) FEATURC: 

(A) NAME/KEY: misc_f e aturs 
(BJ LOCATION 1...870 



(>:i) SEQUENCE UESCRI PTION : SEQ ID NO: 169 

TATTTCTCTC GCTCACGAGT CTTGTAATGG TCTTCGCTGT TCCAAGTTGC 
GATATAATCG ACAAGGATCA AACCCTCTTG CCGGCTCCGA CCAATGTGAC A-CCGATAAT 
30 CCGGATGACA ATCCTTCGGA GATCGACATT ACGCAGACGC ACACAGAAAA ATATGTTTTG 
^^^A T TTACCGGCCA AAAATGTCTC AACTGTCCGA AAGGTCATCG CAAACTGgJg 

£^^52 agcaatacgg taagagattg actgttgtcg gtatacatgc cgcccctgga 
ZfZ^*l ^cctctttt ccgtacagaa gccggagacg catattatag caagttcgc- 

OCT ^^ ACCC CTCTCCCTGC GCTGATGGTT TCGCGCAAAA AGTTCGGCTC TTCCTACGTT 
Zx^l^ GCTACAAAAC GTGGGACGTG CCTATTGCCG AGCAGATGGA CCAAAAGGCG 
JSS£5£« l^-^ CGT GGCCGAATAC ACCGATACCC AAAAGATCAA GGTGACTGTA 
AAGGGTAAAA TACTGGAGGG GAATACACTC CCGAAGTCCA TGGTTCAGGT GTATCTGTTG 
GAGGATAAGC TGATCGCTCC GCAGGTGGAT GGCAATACGA CAGTCGAGAA TTACGAGCAC (ttn 
40 SZSSEZl TGCGTGGAGC CGTTAATGGT ATTTGGGGCG AAGAATTTGT 720 
40 GATTATTTGT ATACTTACGC CGTTGAACCG CTCTCGGGTA TGTCCTTCGT AGCCGAGAAT 780 
XJESI™ TGGC ™ TTG T ATACGATGTG CAGACGTTCG AAGTGTATGA CGTTGTGCAT 
GTAAAGATCA ATCCGCAATC CGATGGCAAA 



45 (2) INFORMATION FOR SEQ ID NO: 17 0 

(il SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 669 base pairs 
Kn (B) TYPE: nucleic acid 

DU IC) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii> MOLECULE TYPE: DHA (genomic) 

55 (ill) HYPOTHETICAL: NO 

liv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROf tOJ IAS GIUGIVALIS 

U X ) FEATURE : 

(A) NAME/ KEY : nu.sc featur- 

(B) LOCATION 1...66SJ 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 



840 
870 



(Xi) SEQUENCE DESCRIPTION: SF.Q ID NO: 170 

ATGAAGAAAT CAAGTGTAGT AGCCTCAGTT TTGGCCGTGG CTCTCGTGTT CGrrm-rm 

70 GCC^™ ATATGGCAAA ACGCGGCCTT ATCGGCGCcI « 120 

/O GCCGGACTAG GTAACGTAGC CGGAAATACG GCTGTCGGTG CCATCGTCGG TACTGCAGTC 180 

GGTGGAGCAG CCGGTGCTCT CATCGGAAAG AAGATGGACA AGCAGAAAAA AGAACTGTAG lln 

rTCC^l-r £ GATGCTAC GATTCAGACA GTAAATGaS T^^Sact 3 0 n 

^^ CTT TGCGACGAAC TCCAGCACTC TGAGTCCCAA CTCACGCACT 360 

75 g-SIta^ GC ^ cmg ^ aaaaaccccg ACACGGATAT TCGTATCGTA 4-0 

GGCCATACGu ACAATACCGG CTCCGACAAG ATCAACGATC CTCTCTCTGA GAGACGTGCA 480 
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GCCAGCGTAT ATTCTTTCCT GAATTCTCAG GGTCiTGAGTA TGTCGCGCAT GGCAGCCGAA 54 0 

GGGCGTGGGA GCCATGAACC GGTTGCAGAC AATAGCACAG TTGCCGGACG TTCGGCCAAC 600 

CGCCGTGTGG AGGTTTATAT CTTGCCGAAT GCCAAGATGA TCGAACAAGC ACAGCAAGGT 660 

ACGCTGAAG 669 



10 



15 



20 



25 



12} INFORMATION FOR SEQ ID NO: 171 

li} SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 1011 base pairs 
f B) TYPE: nucleic acid 
|CJ STRANDEDNESS : double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic;) 

(ill) HYPOTHETICAL; NO 

(iv) ANTI -SENSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYFHYROMOHAS GINGIVALIS 

(ix) FEATURE: 

(A> NAME/ KEY: miscfeature 
ID) LOCATION 1. . .1011 



Ixi } SEQUENCE DESCRIPTION: SEQ ID NO: 171 



30 



35 



40 



45 



ATGTCGAAAA 
GCTGTGACCC 
CGTACAAAGC 
TTCCGCAACA 
GCGATGGAAG 

, TATGTAAACC 
GATTGCTCTT 
CGTCGGCGTT 

• ATACGAGCAG 
TACTACATAG 

" CAATTGGTAG 
ACGGGTCGAA 
AATCCGAGTA 
AAACGAGGGA 
AAAGGGAAGA 
ACTTTGGAAA 
GGCATTGGCA 



AATCGATCCT 
CCGTCAGAAA 
CCTCGGACTC 
AGATTTCTTC 
CACTCAATTA 
CTTTCGTGGG 
CGTTCGTGAT 
TCGGACGGAT 
CCTTTGACGG 
TCTTGCGCCA 
ACGAGAATCA 
GCACCGGTCC 
CCATTATAGA 
GCAATTCTCG 
AAGGCAGACA 
CAATAGCCAA 
AGAGTAAAAT 



TCTGCTTTGC 
TGTGCGCAAT 
TGTACGGTAC 
CGAAAAAGAG 
CCCTGCCATA 
TGCAGGAACC 
GCCCGTCGAA 
GCACTATCCT 
GAAAGTTCGT 
TCCGAACGGA 
GATCGTTCGA 
TCATCTTCAC 
CTTCGATAAC 
CTATGCAAAA 
AGCTTCTTCT 
AAGGCACGGC 
TTTGACTCCG 



TGTTCGCTGT 
AGCCAAGTGA 
ATTAGCAACA 
CTAAGAAAAG 
GATTTATATG 
GATGTCGAAA 
GATAAGCAGG 
ATTGATCTTT 
GTACGCAGCT 
CTGGAGACTG 
GCAGGACAAC 
TTCGAGACCC 
GGAGTCCCGC 
GCCTCTAAGA 
CCTATGACCT 
ACTTCTGTTC 
GGCAAAGCCT 



GCTTCATTTC 
ACACCAAAGC 
TGATTGCAGA 
CCGAATATGA 
GTGAAGATTC 
TTCCGAACTC 
TCACCTCTCA 
CAGTGAATCG 
ATGAAGCGCG 
TGTACGGACA 
CGATCGGATT 
GCTTCATGGG 
TCCGAGACAT 
CTTCTTCTCG 
ATAGAATCAA 
AGAAACTCTG 
TGAGGATCAA 



TGCTACGAAG 
AAAGACCGAA 
TCCCCTGGAG 
AAATCGGCTG 
TTGGAGCGAG 
CTATGACATT 
ATTT GGCTAC 
TGGCGATACG 
TGGCTATGGC 
CATGAGTCGC 
AGGAGGCAGC 
TATTCCCATC 
TTACACATTC 
CTATGCAAAA 
AAAAGGCGAT 
TGCTACCAAT 
A 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
940 
900 
960 
1011 



50 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 172 

(i) SEQUENCE CHARACTERISTICS: 

(AJ LtLNGTH : 453 base pairs 
(Bl TYPE: nucleic acid 
(CI STRANDEDNESS: double 
(DJ TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA [ genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: HO 

(vi > ORIGINAL SOURCE; 

tA> ORGANISM: PORYPHYROMONAS GINGIVAL! 5 

(Ix) FEATURE: 

(A) NAME/ KEY: miscjeaturs 

(B) LOCATION 1. . M53 

(XX) SEQUENCE DESCRIPTIOtJ: SEQ. ID HO: 172 

ATGGCAAAAA TCAATTTCTA TGCTGAAGGC CTCAGCCTTC CTCGGATCAG AAGACGGATC 60 

GTCGGTAAGT GGATAGCCGA AGTATGCAGC CGATATGGGA AAGCGGTGGG AGAAATCTCC 120 

TATCTTTTCT GTGATGACGA ATATATCCTG AAAGCCAATC AGGAATTTCT CGATCATGAC 180 

TACTACACCG A CAT CATC AC CTTCGATTCC TGCGAACCGG ATACGGT GAA TGGCGACCTG 2 40 
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CTTATCAGTC TCGATACCGT ACGCTCGAAT GCCCGTGCTC TTGATCTTCG ATACGAAGAC 300 

GAACTGCATC GTGTCATTAT CCACGGCATA CTGCATCTTT GCGGATTGAA AGACAAGAGC 360 

AAAAAGGATG AAGCCCAAAT GCGTGCAGCC GAAGAGAAAG CCCTTGT CAT GCTGCGAGAA 4 20 

ACCATCGGAT CGGAGCTTTC CCTATTGCAT ACA 453 

{2> INFORMATION FOR SEQ ID HO: 173 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ID MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: HO 

(ivj Aim -SENSE: IIO 

tvi) ORIGINAL SOURCE: 

(A) organism: poryphyrohonas gingivalis 



|ix) FEATURE: 
25 (A) NAME /KEY: misc_f eature 

(B) LOCATION 1...1173 



30 



50 



55 



60 



65 



70 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173 



ATGAAGGTAA AGTACTTAAT GCTCACATTG GTTGGAGCAA TTGCACTGAA CGCAAGTGCA 60 

CAGGAGAATA CTGTACCGGC AACGGGTCAG TTACCCGCTA AGAATGTTGC TTTTGCTCGC 120 

AATAAAGCAG GCAGCAATTG GTTTGTAACA CTGCAAGGCG GTGTTGCAGC GCAGTTCCTC 180 

AATGACAACA ACAACAAAGA CCTCATGGAC CGCTTAGGAG CCATAGGTTC TCTTTCTGTC 24 0 

GGAAAGTATC ACAGCCCTTT CTTTGCAACT CGTTTGCAAA TTAACGGAGG TCAAGCCCAC 300 

<5t> ACTTTCCTCG GAAAAAATGG CGAACAAGAA ATCAACACCA ATTTTGGTGC AGCTCACTTC 360 

GACTTTATGT TTGATGTGGT TAACTACTTT GCACCATATC GCGAAAATCG TTT CTTCCAT 4 20 

TTAATTCCAT GGGTAGGTGT TGGCTACCAA CACAAATTCA TCGGTAGCGA ATGGAGCAAA 4 80 

GACAATGTGG AATCACTGAC GGCGAATGTA GGAGTTATGA TGGCTTTCAG ATT AG G AAA G 54 0 

CGAGTAGACT TTGTGATCGA AGCACAAGCA GCTCACTCCA ATCTCAATCT AAGTCGCGCA 600 

W TACAATGCCA AGAAAACTCC CGTATTCGAA GATCCCGCAG GACCTTATTA CAATGGATTC 660 

CAGGGGATGG CTACAGCAGG TCTTAATTTC CGCCTGGGAG CCGTAGGCTT CAATGCCATT 720 

GAJ ICCAATGG ACTACGCACT TATCAATGAT CTGAATGGTC AGATTAACCG TTTGCGCAGC 780 

GAGGTCGAAG AACTCTCAAA ACGTCCTGTA TCATGCCCCU AATGTCCTGA AGTAACTCCT 84 0 

GTTACTAAGA CAGAAAATAT ACTGACGGAA AAAGCTGTAC TGTTCCGTTT CGACAGCCAC 900 

GTTGTGGACA AAGATCAATT GATCAACCTG TATGACGTAG CTCAGTTTGT AAAAGAAACT 960 

AACGAGCCGA TTACCGTTGT TGGTTATGCT GATCCTACGG GTAATACTCA ATACAACGAG 1020 

AAATTGTCTG AGCGTCGGGC TAAAGCCGTT GTTGATGTT C TGACAGGTAA ATATCCTGTG 1080 

CCTTCCGAAT TAATCTCTGT AGAATGGAAG GGCGACTCTA CGCAACCGTT CAGCAAGAAA 114 0 

GCTTGGAATC GTGTTGTAAT CGTTCGCTCC AAG 1173 



(2) INFORMATION FOR SEQ ID NO: 174 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1155 base pairs 
(BJ TYPE: nucleic acid 
(CJ STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ID MOLECULE TYPE: DNA (genomic) 

(iii J HYPOTHETICAL: HO 

<iv) ANT I -SENSE: HO 

ivi) ORIGINAL SOURCE: 

(A) ORGAN I SH : PORYPHYROHONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: mlsc_f eature 

(B) LOCATION 1...HS5 



(:;i) SEQUENCE DESCRIPTION: SEQ ID HO: 174 
75 ATGACATACA GAATTATGAA AGCTAAATCT TTATTATTAG CACTTGCGGG TCTCG CATGC 



60 
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ACATTCAGTG CAACAGCCCA AG AAG CT ACT ACACAGAACA AAGCAGGGAT GCACACCGCA 120 

TTCCAACGTG ATAAGGCCTC CGATCATTGG TTCATTGACA TTGCAGGTGG AGCAGGTATG 180 

GCTCTCTCGG GATGGAATAA TGATGTAGAC TTTGTAGATC GTCTAAGTAT CGTTCCTACT 240 

TTCGGTATCG GTAAATGGCA TGAGCCTTAT TTCGGTACTC GTCTCCAATT CACAGGATTC 300 

5 GACATCTATG GATTCCCGCA AGGGAGCAAG GAGCGTAACC ACAA'CTACTT TGGAAACGCC 360 

CACCTTGACT TCATGTTCGA TCTGACGAAC TATTTCGGTG TATACCGTCC CAATCGTGTC 420 

TTCCATATCA TCCCATGGGC AGGTATAGGA TTTGGTTATA AATTCCATAG CGAAAACGCC 480 

AATGGTGAAA AAGTAGGAAG T AAA GAT GAT ATGACCGGAA CAGTTAATGT CGGTTTGATG 54 0 

CTGAAATTCC GCCTATCAAG AGTCGTAGAC TTCAATATTG AAGGACAAGC TTTTGCCGGA 600 

10 AAGATGAACT TTATCGGGAC AAAGAGAGGA AAAGCAGACT TCCCTGTAAT GGCTACAGCA 660 

GGTCTAACGT TCAACCTTGG CAAGACAGAG TGGACAGAAA TTGTTCCTAT GGACTATGCT 72 0 

TTGGTCAATG ACCTGAACAA CCAAATCAAC TCACTTCGCG GTCAAGTGGA AGAGTTGAGC 780 

CGTCGTCCTG TTTCATGCCC TGAATGCCCT GAGCCTACAC AGCCTACAGT TACTCGTGTA 84 0 

GTCGTTGACA ATGTGGTTTA CTTCCGTATC AATAGTGCAA AGATTGATCG TAATCAAGAA 900 

15 AT CAATGTTT ACAATACAGC TGAATATGCG AAGACCAACA ACGCACCGAT CAAGGTAGTA 960 

GGTTACGCTG acgaaaaaac CGGTACTGCG GCCTATAACA TGAAGCTTTC AGAGCGTCGT 1020 

GCAAAAGCGG TAGCCAAGAT GCTTGAAAAG TATGGTGTTT CTGCGGATCG CATTACAATT 1080 

GAATGGAAGG GCTCATCAGA GCAAATCTAT GAAGAGAACG CTTGGAATCG TATTGTAGTA 114 0 

ATGACTGCAG CGGAA 1155 



20 



35 



55 



70 



(2) INFORIIATION FOR SEQ ID MO: 175 



U) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH; 570 base pairs 

(D) TYPE: nucleic acid 
( C ) STRANDEDNESS : doubl e 
(D> TOPOLOGY: circular 

30 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(IvJ Aim -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS GINGIVALIS 



<ix) FEATURE: 
40 (A) NAME/KEY: misc_f eature 

(B > LOCATION 1. . .570 

<xi) SEQUENCE DESCRIPTION: SEQ ID N0:175 

45 ATGGAATTTT TCATGTTATT CATAGCGGCG GTTTTCGTTA ATAACGTCGT GCTGTCGCAG 60 

TTCCTCG*3TA TATGCCCATT CTTAGGCGTA TCGAAGAAGG TAGACACCTC AATCGGTATG 120 

GGTGCAGCCG TGACATTCGT ATTGGCACTG GCTACCTTGG TTACCTTCCT GAT TC AG AAG 180 

TTCGTTTTGG ATCGTTTCGG ATTGGGCTTT ATGCAGACCA TTGCATTTAT TTTGGTCATT 240 

GCCGCCTTGG TGCAGATGGT GGAGATCATA CTCAAGAAAG TATCTCCTCC CCTCTATCAG 300 

50 GCACTGGGTG TATTCTTGCC CTTGATTACG ACGAACTGCT GTGTGCTCGG TGTGGCTATT 360 

TTGGTTATCC AGAAGGATTA TACCCTGCTC CAGAGCTTCG TCTATGCAAT ATCCACGGCT 420 

ATCGGTTTCA CCTTGGCAAT GGTTACTTTC GCAGGTATTC GAGAGCAACT CGATATGACC 480 

AATCTCCCCA AAGCTATGAA GGGAATACCT TCGGCACTCT TGGCTGCCGG TATATTGGCT 54 0 

ATGGCTTTCA TGGGCTTCAG CGGTATCGCC S70 



(2) INFORMATION FOR SEQ ID NO: 176 



It) SEQUE11CE CHARACTERISTICS: 
60 (A) LENGTH: 558 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDHESS: double 
{ D) TOPOLOGY: circular 

65 <iil MOLECULE TYPE: UNA (genomic) 

(lii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYFHYROtiOUAS GIHGIVALIS 



(ix> FEATURE: 
75 (A) tIAl IE/KEY: mire feature 
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(xl ) 



(B) LOCATION 1...553 
SEQUENCE DESCRIPTION: SEQ ID NO: 176 



ATGTTATTCA 
TGCCCATTCT 
ACATTCGTAT 
CGTTTCGGAT 
CAGATGGTGG 
TTCTTGCCCT 
AAGGATTATA 
TTGGCAATGG 
GCrATGAAGG 
GGCTTCAGCG 



TAGCGGCGGT 
TAGGCGTATC 
TGGCACTGGC 
TGGGCTTTAT 
AGATCATACT 
TGATTACGAC 
CCCTGCTCCA 
TTACTTTCGC 
GAATACCTTC 
GTATCGCC 



TTTCGTTAAT 
GAAGAAGGTA 
TACCTTGGTT 
GCAGACCATT 
CAAGAAAGTA 
GAACTGCTGT 
GAGCTTCGTC 
AGGTATTCGA 
GGCACTCTTG 



AACGTCGTGC 
GACACCTCAA 
ACCTTCCTGA 
GCATTTATTT 
TCTCCTCCCC 
GTGCTCGGTG 
TATGCAATAT 
GAGCAACTCG 
GCTGCCGGTA 



TGTCGCAGTT 
TCGGTATGGG 
TTCAGAAGTT 
TGGTCATTGC 
TCTATCAGGC 
TGGCTATTTT 
CCACGGCTAT 
ATATGACCAA 
TATTGGCTAT 



CCTCGGTATA 
TGCAGCCGTG 
CGTTTTGGAT 
CGCCTTGGTG 
ACTGGGTGTA 
GGTTATCCAG 
CGGTTTCACC 
TCTCCCCAAA 
GGCTTTCATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
S58 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 177 

U) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 24 99 base pairs 
{BJ TYPE: nucleic acid 
(CJ ST HANDEDNESS: double 
(DJ TOPOLOGY: circular 

fii) MOLECULE TYPE: DNA (genomic) 

(lit) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

<vl) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMON AS GINGIVALIS 

(ixj FEATURE: 

(A) NAME /KEY : misc_f eature 
IB) LOCATION 1...24 99 

(xi J SEQUENCE DESCRIPTION: SEQ ID MO: 177 



ATGAAACAAC 
TCGGCTCAGC 
CCTATCGCAT 
AACGGAAACT 
ATGGGTTACT 
GACCTCTATC 
AACGAGACTT 
CTCAAAACCA 
GAGGACAACT 
TATTCGCAAA 
TTGGAGCAGA 
GCTCTGTTCG 
CGCAATTCGG 
AG CTTCC AAA 
GTCATGGTAT 
ACCGAACTAC 
CTCTACAGCA 
AGACTGGACA 
GGCGGAAGTT 
GCTTCAGCAC 
CTGCTGAACG 
ACGGCTCTTA 
CGT CAT ACCT 
CTCGATGACA 
CAGTACGACC 
6ACTATGTTC 
CCTAGAGCCA 
GAAGGATTCC 
ACTCCTATCA 
GCTTCTT TCG 
TTCTCCACCT 
GGCAAAGAAT 
GGTGTGAATC 
ACATGGCAGA 
GGACAAGCCG 
GTTGCTACGG 



TAAACATTAT 
AATCGGGCGG 
ACG TACAAGT 
ACTCGATCAA 
CCACTTGCGA 
TGACCGAAGA 
TCCGCCGTCA 
ACTCTACCAA 
GTCAGAACTG 
TTCTTATCGA 
TGCCTGCCAA 
GCTCTAATGC 
CCGAGATCAG 
ATACGACCCA 
TCGGCCAACA 
CCAATCTGCG 
AAGCAACCCT 
ATCCTCCTTT 
TCAAATTCGA 
AAGACGTTCA 
GAGCAGTTCA 
CTTCCTACGG 
TCGGAGAAAA 
GAAGCGGCTA 
AGCTCGAATA 
TCCTCAATCA 
ACGTACGATA 
GCGCTCCTCA 
GCCGTGTCCT 
ATTATTACCA 
TTAT CAGCAA 
GGATCATTCG 
TGGAGGGAAG 
GAAGCCGCTA 
AGATCTCTGT 
TACGTCCTAC 



CAGCTTCATC 
ATCCGTTACA 
ATTCGTCAAA 
GGGAATCCCT 
AGAAAAAGTA 
GATTCTCTCT 
AGCACCCTCG 
CCTGAGTCAG 
CGGTTTCAAC 
CAGCCATCCC 
TATGATCGAA 
TGTGGGAGGC 
CCATTCTACG 
GTTCAACGGT 
CAACTACCGT 
CAACCGCTCG 
CGAATATCAC 
CGAAGCCCAG 
TCAGGGCTTC 
GCGTCGTAGC 
GAGTGGAAGC 
GACTACCAAG 
CTGGGACTTT 
CAGACCGAGC 
TAAGACGGAG 
GGATGGCAAA 
CAATCCCAAT 
GTATTTCGAT 
TTCCCCCAAT 
CAGAGCCGAC 
TCAGTTCAAA 
TACCATCTAC 
AATCGCCTAC 
CGGAAGCATC 
GAAAGACTAT 
CGAGCACTTC 



ATTGCTTTCC 

GGTACCGTAG 

GGAACCACTC 

TCGGGTAATC 

CATATAGAAA 

CTCGATGGGG 

TTGGTAACGG 

GGACTTAAGT 

CAAGTT CGT A 

ATCTTCAGTT 

CGTGTAGAAG 

GTTATCAACG 

ATGACCTTOG 

TCTAT GCTGA 

CCCGGACAGG 

CTCGGTTTCC 

AGCATGCAGG 

ATAGCGGAAT 

AGCGGTGGCA 

TACTACGGGG 

ACCGAATCGG 

GGA1TCGATT 

ACCGGCGGAC 

AAAATAGATC 

AAGTTAAGTG 

CGCTATATCG 

AAGAATCT'CA 

GAAGATCTGC 

CTGAAAGAAG 

GAATGGCAAT 

CCATCCGATA 

AACGACAAGG 

AACAAATCGT 

TATACCGCTG 

GTACGCACTC 

GCCATCAATC 



TATTCTTAGG 

TGGACAAAAG 

TCGGAACTTC 

AAACTATCGT 

AGGGTGGTTC 

TAGTGGTATC 

TACTGTCGCC 

TCCAGCCCGG 

TCAATGGACT 

CGCTTGCCGG 

TAATTCGCGG 

TAATTACGAA 

ACCACGCGAA 

CGGAAGACCG 

ATATAGACGG 

G CTCAT ACT A 

AGTACCGTCG 

ATCTCCAGCA 

AGGATTTCTT 

GTGGCGACTA 

ACGAATACAA 

TGCAAGGAGG 

TCGAATATAT 

AGAATACCTC 

CCCTTATCGG 

ATCCGCTCTT 

GCTTCCGACT 

ACGTAGAGTT 

AACGTTCACG 

TCAAT AT CAT 

AGGTCGAAAC 

ATGGAGTATC 

TCGACCTCCA 

TGGAAGCGGA 

CGAATCTGTA 

TCTCCGGTAC 



AACGAGCGCA 
CTCAAAAGAA 
CACGGATGCA 
AGCCCGACTC 
CCGCCACGTA 
TGCCAATAGA 
GGAACTTTTC 
TCTGCGCGTG 
CGAAGGAGCC 
TGTCTATGGC 
TGGAGGTTCG 
AGAACCGCTT 
AGGGTGGGGG 
CAAAGCCGGT 
CGACAACTTT 
TAAGACCGGT 
TGGTGGCGAC 
CTATATCAAT 
CAGTCTGTAT 
TACCGAAAAT 
CGATGCTTTC 
AGGTATGTAC 
CTACGGCCAA 
TACTTTTAGT 
AGCACGTATC 
CATTTTCAGT 
CTCATACAGC 
GGCCGGTGGT 
AAGCATCAGT 
GGGAGAAGCC 
CACGAGCGAT 
GAAGGTATAT 
GCTCGGCGGT 
CAAAACAACG 
CGGCTATTTC 
ATTCACGGCC 



60 
120 
1B0 
240 
300 
360 
420 
480 
S4 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1O80 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



120/490 



AAAATGGATG 
GACGGATCGT 
GCCAAGCTCG 
CACCTTGCTT 
TATCAGAAAG 
CAGCCCAGAA 



TAGTACACGA 
TCGACTTTGA 
TCAAGACTCC 
CCACTATGAC 
ACACGGACAA 
GGATTTTCGT 



AGCCTATGAA 
AATGAATGGT 
GGCCTTCGCC 
CTTGGAATTG 
GGGACCGGGT 
CGGTACAAAG 



GGCGATATTC 
CAGCAATTCA 
GATATAGACC 
AATGCCGGAA 
AGAGCTTCTA 
ATCAATTTC 



CCGCAGAACA CATAGCTCCG 
AAGGTTTGGC CGAAGGTCAT 
TCAAGCTGAG CCACGACTTC 
TACAGAACAT ATT CAACAGC 
CTTACGTATA CGGTCCTATG 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



12) INFORMATION FOR SEQ ID NO:178 



li J SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 2673 base pairs 
<R) TYPE: nucleic acid 
(CJ STRANDEDUESS: doub] - 
ID) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

Uli) HYPOTHETICAL: NO 

<iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

<A> ORGANISM: PORYPHYROMONAS GINGIVAL!* 
(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1. . .2673 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178 



ATGTACAAAA 
GGACAGTCCT 
AAGCCTATTC 
TTTGTCTTGC 
GCCATGTCTG 
ATCGCGGATA 
CGCATCAGTA 
AAAATCGGTC 
ATCGTACAGA 
GAACCGGATC 
AAAACCAAGG 
AGAATGGCGA 
TTCTTGAAAC 
CGATTGATAG 
GTCGTGAAGG 
AAGTATTATA 
GAACGAGTGC 
CTCAATGAAG 
TGGGTCGATC 
GCGGAGGGGA 
CAAGACGTAG 
ATCATTAACT 
CCCCGTCCGA 
CGTAGCAGTG 
GGAGCCATTA 
AAAGCGATCA 
AAGTACTATC 
GATATGTTCA 
AATAGCAATG 
AGTTATTACA 
GATCGTTCGC 
GACAATTGGT 
AGCTACAATA 
CGTCTCTCTC 
GTTTCTGTTG 
AACCTCAGCG 
CGAGTTTTTA 
CGAGTGGAAG 
TTCTATATGG 
GGTTT GCGTG 
TATATGCGGC 
TGGCTCTTAG 
TTTAACCTGA 
CTCGGTATCG 



AGATTATTGC 
CGGATCTGAC 
ATAAGACCAT 
GCAATCTTTC 
CTGCCGTGAA 
AATAT GTCGG 
AGGTTACTTT 
TTCGCGAGGG 
AGTATTTTAC 
TTTCCAAAGA 
TGAATGAAAT 
TGAAGAACAC 
TTT TTAGTAC 
AGAAGTATCA 
CTCCTGACGG 
TTAAGGATGT 
TCGGAATAAA 
ATGAAGATGC 
CCGTGGAAAC 
AGCAGGCCAA 
TACGCCGAGA 
CTATTCGTCT 
TTCCCAATCC 
ACCAATTGGA 
AGTTCACGAA 
TTCCGCAAGG 
AGCAGTATAG 
GCTTCAGTGC 
CCGGCAACTA 
ACGGTATGTC 
TTCAGATGTT 
TCCAGATTTA 
CCTTCCAAAA 
GTACTTCCAT 
CTGCTACTCT 
TAAGCGATCG 
CTCCATTGCT 
GAGCAGTTCT 
GAGGTGATGG 
GTTATAAGAA 
TTACGATGGA 
CTTTTGCCGA 
AGCGATCGGC 
ATTGGGGATA 



CGTAGCAGCT 
CCCTCAGGAT 
AGCATCTATT 
AGGCTTGGCT 
T AGAATT AT G 
CAATAAAGTC 
TAGCGGGGTA 
GATTCAGATG 
TGAGAAAGGT 
TGGCTTTGTC 
TTATTTTTCC 
CAATGCCAAA 
TCATAAGTTT 
GGAATATGGA 
CAAAAGAGTG 
CAACTTTGTG 
ATCCGGAGAT 
TGTGGGGAAC 
AAATGTAGTG 
TATCAATAAG 
GCTTTACACA 
CATCAATCAG 
CGAAACAGGA 
GCTTTCTGTC 
CTTCTCTGTC 
GGATGGGCAA 
TGTCACATTT 
ATTCTATTCC 
CTATAATGCC 
GAACTATACC 
AGGTACTTCG 
TACTTCTCTG 
TTTCCATCAT 
CGATAATCCT 
TCCTTATTCT 
TTACAGATTT 
CAATCCTGCT 
TGGTTCGTAT 
TATGTCCAGC 
CGGATCTATT 
ACTACGTTTC 
AGCAGGCAAT 
CGGTGTAGGA 
TGGCTTTGAC 



CTCTTCTGCG 
ACTATATATA 
GAGATCGAGG 
GTAGGTGATG 
CGTCAGGGCT 
TATCTGAAAA 
AAGAAGTCTG 
ACCAGAAATA 
TATCGOGATG 
AATGTGCTTA 
GGCAACAAGG 
TTCAGTCTTA 
GTGGAAGAGA 
TATCGTGATG 
GAT AT TTATC 
CGCAATTCAC 
GTGTACAATC 
CTGTACTATA 
GGGGATTCTG 
GTGATCATCA 
AAGCCCGGCC 
CTTGGGCATT 
ACAGTGGATA 
GGTTGGAGTC 
GGCAACTTGC 
ACACTATCAC 
ATGGATCCAT 
AAGACTACGG 
TACTATAATA 
GGCGACCTCT 
ATCGGTTACG 
AACTACACCT 
6GCTCGGCTA 
ATTTATACCA 
TTGTGGGACA 
ATCGAGTATC 
ACGCATAAAT 
AATTCCAATA 
TATTATGGTG 
GCCGGTAATA 
CCGATTCTGT 
GCGTGGCGCA 
TTGCGTGTAA 
CGTCCGGACA 



CCAGCATAGG 
GCCCTGAAAT 
GAATGACGTC 
AAGTCCTGAT 
ACT TCTCAAA 
TCATTGTCAC 
AGAGAGAAGA 
ATGAAGACAA 
CCAGCATACG 
TCTCG AT TG A 
CCCTTAGCAA 
GAAAGCATAT 
GCTACCGTGA 
CTCAAATACT 
TCAACATCGA 
AATATCCATC 
AGAGACGATT 
ACAATGGCTA 
TTTCCCTTGA 
AAGGAAATAC 
AGCTCTTTAG 
TCGATGCCGA 
TAGAGTATGA 
AGTCCGGACT 
TCCATCCCTC 
TGAGTGCTCA 
GGTTTGGGGG 
CGATTGACTC 
GCTACTACAA 
ATACTCAGGC 
GTAAGCGTTT 
ACTATAGACT 
ATGATCTCAA 
GAAGCGGATC 
ATCATGACTA 
ACAAGTGGAA 
ATACACCGGT 
AGAAATCTCC 
GCTACATGAA 
ACTACGACTA 
TTGAAAACTC 
GTATCGACAA 
CGTTACCGAT 
ATTCT CT ACA 



G ATCCT G AAA 
ATCCTATGCC 
TTTCGATGAC 
TCCTGGAGAT 
TGTGCGAATC 
TGAACGTCCT 
TCTTGAAATG 
GGTCAGGCAA 
GATAACGCAG 
GAAGAAAAGC 
TCATAAGCTA 
TCGCTCATCT 
AGATTTGGTC 
GACCGACAGT 
AGAGGGGCAG 
GGAGTATTTG 
GGCTAAGCGT 
TATTTTTGCG 
TATTCGTATA 
TGTCGTGTAC 
TCGCGAGGAT 
AAAATCTATT 
TTTGGTGCCG 
TCTGTTCCGA 
GATGTATAAG 
GACCAATGGA 
CAAGCGGCCG 
CAAGTTCTAC 
CAACTATAAT 
CAGCGATCCG 
GACTTGGCCG 
GCGAAATTGG 
CTTGGAGCTG 
GGATTTCATG 
TGCCAGCCAG 
GTTTAGAGGA 
GCTCATGAGT 
TTTCGGTACT 
TGAGACTATA 
TGCATATGCT 
ATTCAATGCG 
TTATAATCCC 
GGTCGGAATG 
GCGAGGAGGA 



2220 
2280 
2340 
2400 
2460 
2499 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



sL.fJ '^j; , ,ii„ fin!" C3i a* iL^'iL*^ Jlw dr. iL.« £ J 5 ; - * , 



WO 99/29870 PCT/AU98/01023 

121/490 

AGCAATGTCC ACTTTGTGCT CGGACAGGAG TTC 2673 

(2) 1 1 i FORMAT I OH FOR SEQ ID NO: 17 9 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDHESS: double 
10 (U) TOPOLOGY: circular - 

til) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: HO 

15 

Civ) ANTI-SENSE: NO 

(v1 ) ORIGINAL SOURCE: 
2Q tA > ORGANISM: PORYPHYROHOHAS GINGIVAMS 

<ix) FEATURE: 

(A) NAI IE/KEY: misc f eature 

(B) LOCATION 1...S22 

25 ( X i) SEQUENCE DESCRIPTION: SEQ ID NO; 17 9 

ATGAATGGCG ATATGAAACG GTTTTTGATT TTGATCGGCT TTGCACTGGC GGTAGCTTTC 60 

TCCGGTTTTT CCCAAAAGTT CGCTTTGGTA GATATGGAAT AT AT CCTCAG GAATATTCCT 120 

GACTATGAGA TGATGAACGA ACAGCTGGAA CAGGTGTCCA AGAAATGGCA AAATGAAATC 180 

GAAGCTCTCG AAAATGAAGC CCAATCTATG TATAAGAAGT ATCAGAGCGA TCTCGTATTC 24 0 

TTGTCTGCTG CACAGAAGAA AACCCAAGAA GAGGCTATCG TAAAGAAAGA GCAGCAAGCA 300 

TCCGAGCTCA AGCGGAAGTA TTTCGGCCCG GAGGGGGAGC TGTATAAGAA ACGCTCCGAT 360 

CTGATGAAGC CTATTCACGA TGAGATTTGG AATGCTATCA AAGAGATTGC CAAGCGTAAC 420 

AACTATCAGA TGGTGCTTGA TAGAGGTACG TCCGGAATTA TCTTTGCCAG TCCGTCTATT 4 80 

dD GACATTAGCG ACCTTGTACT GAGCAAGATG GCCTTTAGCA AG $22 

(2) INFORMATION FOR SEQ ID WO: 180 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 base pairs 

(B) TYPE: nucleic acid 
<C> STRAW DEDNESS: double 
(0) TOPOLOGY: circular 

45 

(ii> MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

50 liv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOHAS GINGIVAL! S 

(Ix) FEATURE : 

(A) NAME /KEY: raise feature 

(B) LOCATION 1...510 

<xi) SEQUENCE DESCRIPTION: SEQ ID HO: IB 0 - 



55 



60 



ATGAAACGGT TTTTGATTTT GATCGGCTTT GCACTGGCGG TAGCTTTCTC CGGTTTTTCC 60 

CAAAAGTTCG CTTTGGTAGA TAT GGAATAT ATCCTCAGGA AT ATT CUT GA CTATGAGATG 120 

^IS^r 0 ^ 0 AGCTGGAACA GGTGTCCAAG AAATGGCAAA ATGAAATCGA AGCTCTCGAA 180 

ft* ^«™ CCC AATCTATGTA TAAGAAGTAT CAGAG CGATC TCGTATTCTT GTCTGCTGCA 240 

S?™^ 7 ^ ^CPJKGPJKGA GGCTATCGTA AAGAAAGAGC AGCAAGCATC CGAGCTCAAG ^00 

CGGAAGTATT TCGGCCCGGA GGGGGAGCTG TATAAGAAAC GCTCCGATCT GATGAAGCCT 360 

*Z T< ^ MG AGATTTGGAA TGCTATCAAA GAGATTGCCA AGCGTAACAA CTATCAGATG 420 

GTGCTTGATA GAGGTACGTC CGGAATTATC TTTGCCAGTC CGTCTATTGA CATTAGCGAC 4 80 

7Q CTTGTACTGA GCAAGATGGG CTTTAGCAAG JJJ 



75 



12) INFORMATION FOR SEQ ID NO: 181 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 489 ba&e pairs 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



,„p liJ /,J' !.!!& iLI'ii ,„iL III!, ! iCIi iOi 



LI ;cp i£~K 



% „ii ii r - 



s .4 



WO 99/29870 



PCT/AU98/O1023 



122/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



<BJ TYPE: nucleic acid 
(CJ STRAMDEDNESS: double 
(D) TOPOLOGY: circular 

lii) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

liv) ANT I -SENSE: HO 

(vl) ORIGINAL SOURCE: 

(A) ORGANISH: PORYPHYROfiONAS GINGIVALIS 

(ix> FEATURE : 

(A) HAIIE/KEY: misc feature 

(B) LOCATION 1...489 

(Xi) SEQUENCE DESCRIPTION: SEQ ID HO: 181 

ATGAAGAAAT TTTTTCTCAT GCTTCTGATG GCTCTTCCTT TGAGCCTCTT 
GTSGCAGTGG TAAACACTGA G G AGAT CATT TCCAAAATGC CGGAACAAGT 
AAACAGCTCA ACGAATTGGC CGAAAAGTAT CGCCTTGATC TCAAGAGTAT 
TTTGCCAAAA AGACAGAAGA ATTTGTAAAG GAAAAAGACT CTCTACTGGA 
AATCGTCGTC AGCAGGAACT TCAGGATATT CAAACTCGTT ATCAGCAGTC 
ATGCAGGAGG ATTTGCAAAA GCGCCAACAA CAGCTTTTTG CTCCTATCCA 
GCTGATGCCA TCAAGAAAGT GGGTGACGAA GAAAACTGTG CCTACATCAT 
ATGATGCTTT ACACCGGAGC TACTGCTATT GACTTGACCG CAAAGGTAAA 
GGAATCAAG 



(2) INFORMATION FOR SEQ ID NO: 162 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2481 base pairs 

(B) TYPE: nucleic acid 
<C) STRA1IDEDNESS : double 
(DJ TOPOLOGY: circular 

Ui) MOLECULE TYPE: DNA (qetiomic) 

(Hi) HYPOTHETICAL; NO 

liv) ANTI -SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPJIYROMONAS GINGIVALIS 

dx) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1...24 01 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182 



GGCACAAAAG 
AGCTGCTACC 
GGACGATGAG 
GAACATCCGC 
ATACCAAACG 
ACAAAAGGTG 
GGAGGCCGGT 
AGCGAAACTC 



ATGAAGGAAG 
TCATTCATTC 
GTATCGGGCT 
CGTGGTACGG 
GCCACGACGG 
AGCTTTCCGT 
GASCTTTCGA 
AATACCCGAG 
ACCTACGCAC 
AGCTACGATG 
CGCTCTGCAC 
TTCTCCGCCG 
CGCTACAAGC 
GCCTACTATG 
GCCAAATCGC 
GGACAGACAT 
ATTTCGCAAA 
AGCGATGCCA 
TACTTCGGTG 
CTTTCGGCCT 
AACGATGTGC 
AACGGCTTGG 



CTATTCCCCG 
TGCTATGCTG 
ATGTGCTCGA 
GGACAGGCGC 
ACAGCATCAC 
CTCTTGACCAA 
GCGTGACGGT 
ACCTTCGTGT 
GAGTAACGCA 
AGAAT AT GGT 
AGCAGGAAGG 
GAGGGTTCAC 
AACCGCAGGA 
GCAGCAGTGC 
TCTTGGGCAC 
TCATGACGTA 
CTCGCTACAA 
AAAAGTTCAA 
CCTTCAGCAT 
TCAACAGTAA 
AGCTGGGGGC 
GCATCGGGCG 



AAAGAACAAG 
CCTGCTATGC 
CCGTGGGGAA 
AACGACGAAT 
GATCGAGTTC 
GGACACTCGG 
ACAGGCCACA 
CAATGCAGGG 
GAACAATGAA 
CTATGTAAAC 
TCTGAGCTTC 
GGCCGACTAT 
GAAGGAAGGA 
CGGAGCCTTC 
TACGGACACG 
CCGTTTCAGC 
GTTTGTCCCT 
GATCTTTTTC 
GAACTTCGTG 
CGAACGGGAG 
GGACGGAACT 
CAATCACGAG 



TATATAAAGC 
TCTCAGGCAG 
AAGCCGATCC 
CTCAAAGGAT 
AGCTCCATGG 
CTGAATGTTC 
AAACGCAGAC 
CCTACGGGAG 
CTAAGCTCGC 
GGAGTGGAGG 
GTCAATCCGG 
GGCGACAAGA 
GCGGTACTCC 
AGCCAAATCA 
AAAGCCGAAT 
CCCAAGCTGT 
CAGACCCGTG 
GACGGTCAGG 
CCGGACGACA 
ACCTACGATA 
GCTTCGATGG 
CATGCGCGCA 



TCAACGGTAT 
CTATGGCACA 
CGTTCGCCGG 
ACT ACGAGT T 
CGTACCAAGG 
GTTTGGCAGA 
TCAACACGAT 
GGGTGGAATC 
AATACTCGGT 
TTTATCGCCC 
ATCTGACACA 
TGTCCTCCGT 
TCGGGATGCT 
CGCGTCTACG 
ACGATCCGAT 
CGGTTAGTTT 
AGACGAGCTT 
AACAAGAT03 
AACAGCGGCA 
TTCAGGGAGA 
CTTCGGGCTC 
ACAGG CTGAG 



ATACAGATTG 
AGGCGTCAGG 
AGTCAAAGTG 
TCGGATGAAG 
GGTAAGTCGC 
GGCCGAGATG 
GGAGCGCGTC 
GCTCATCAGT 
TCGCGGAGGA 
GCTGCTGGTT 
ATCCGTACAG 
ACT GG AT ATT 
ACAATCGA'JT 
CTACAAGAGT 
CTATGCGGAC 
CCTCGGCAAT 
CGGTACACTG 
TTTCCTGACC 
TACGGTTACG 
ATACTTTCTG 
AGAGAACTCC 
CTACCGCGTG 



60 
120 
180 
240 
300 
360 
420 
430 
489 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
lOfiO 
1140 
1200 
1260 
1320 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



St-Ji ^.-^ 



\\ iCii jt^, tUJs lUi-li-Ji 



WO 99/29870 



PCT/AU98/01023 



123/490 



10 



15 



20 



CTGAACATGG 
GTATCGGCAC 
GTAGGATACA 
GATACGCAGA 
ATGGGAGGAG 
AAGGAGTTGC 
TTGGTACTGC 
CAGACGCATA 
CAGGGAGCTT 
TACAAGTTTA 
GAGAACGTGA 
GATCTCAAGC 
ATAAAAGCCC 
TACAATTTCT 
CTGCGGGCTG 
CCGGCCTTTA 
CACCCGGATG 
ATAGGGGCTG 
TCGGATGCCT 
CTGCGTCTCC 



GTTACAGAGG 
AGATGGAGAA 
ACCTACCTCA 
TGAGGGGAAC 
GTACATTTTC 
TCGTCAGCCC 
GTGCAGCCGC 
AGGATGCCGA 
TTCACATTCT 
CGGCAGAGGC 
AGATCCGCTA 
TCTTCGGCGA 
GTCAGAAACT 
CCTTCTTCCT 
CACTAAGCGG 
CCGCACCGGC 
ACTCCTTTGC 
ACCTCTTCAA 
ACCAACAGCA 
TCGTCGAATT 



GGAGATGAAG 
AATAGCCGAC 
CTCGGAGACC 
GCGCTTGTCG 
TCTCATTCCG 
ACGTATCAGC 
CGGACTTTAT 
AGGCAATAAC 
CGCAGGAGCA 
TTACTACAAG 
TCTGGGCGAA 
ACTGGTACCC 
GGATGGCTAC 
TCAGGAGTAC 
AGGATTGCCC 
CTATAAGCGT 
CGGCCGAAGC 
TCTGTTCCAC 
ATACGCCGTA 
C 



CTGAACGAGA 
CAT AT CAGCG 
GTATTGCTGA 
GCATTCGTAC 
GGTATCAGAG 
GTGGGTTATT 
TATCAGGCAC 
GTGGTTGTCC 
GACTATACCT 
AGCCTGTTCA 
AACAT CGGTT 
GGAGTGGATT 
GGTTCTTTAC 
GTGCCGGGCA 
CAGCTCAATC 
GTCGATCTGG 
AAATGGCTAA 
ATGACCAACG 
CCGAACTACC 



AGCATCGCCT 
AATGGGAACG 
TGl'ACAATAA 
AGGATCGATT 
CTTCGTGGTG 
CTCCCGAAAG 
CGTTTTACAA 
TCAACGAGAA 
TCGAAATGGG 
ACATCAACCC 
CGGGTTATGC 
CGTGGCTGAC 
CACTGATGAA 
ATAAACGCAT 
CGAGCAAAGG 
GGGTAATGTA 
TGGGAGTAAA 
TCAATTCTTA 
TGACACGCCG 



GGAAGCCGGC 
GAGGGATTCG 
CCTATATGCC 
CAACTTCAGC 
GAGCTTCAAC 
CAACCCGGCT 
AGAGCTAAGG 
GATCCGGTCT 
GGGGCGAAAA 
GTATATAATA 
TGCGGGTATC 
GGCTTCCATT 
cgcacccact 
cacagccacc 
gcttagctcg 
caaatggctc 

AGGGGCCTAC 
CTACTGGGTG 
CCAATTCAAC 



1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2200 
23-10 
2400 
24 60 
2481 



25 



30 



35 



(2» T N FORMAT lOW FOR SEQ ID MO: 183 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2016 base pairs 
CD) TYPE: nucleic acid 
[Ci STRAW DEDHESS: double 
(D) TOPOLOGY: circular 

(lij MOLECULE TYPE: DNA (genomic J 

(ill J HYPOTHETICAL: NO 

<iv) AUTI-SEHSE: NO 



40 



lvi> ORIGINAL SOURCE: 

(A) ORGANTSIt: PORYPHYROMONAS GIIIGTVALIS 

(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION 1...2016 



45 



50 



55 



60 



65 



70 



75 



(xi> SEQUENCE DESCRIPTION: SF.Q ID HO: 163 



ATGTACAGCG 
TTTGCTGCCT 
TATACCAAGG 
GAGATGAAAG 
GCCAAAGCTT 
CTGCTCCGTT 
TTCCGTGGAT 
TGTCTCTTTG 
GAGTGGAATT 
TATTTCACAT 
AAACCCAACG 
AGCGTGTCCG 
GGTAGTACCA 
ATCTATTCCG 
ATGCGCGATT 
TATTTCGTCA 
TCGGATCGTT 
GAAATGTTTC 
GGTCTGGGAG 
GTCAATATGG 
AAAGGCAAAA 
AACCGAGGCG 
ACCGAGATTC 
AGGATCGTAG 
TCCTATAAGA 
TATTTGAATC 
GTGGACTTTT 
TTCGATAAAG 
CTCACCGACA 
GATGCTTACA 



GACATCATAA 
GCAAGAGCGT 
CTGCCGATAT 
CTTATACGGC 
TGCGTGGCTA 
TGGCACAGAC 
ATCTGGAAGC 
CCCGCCAGCA 
CGGCACGGGG 
CGAGCAGAAG 
ACATTTATAT 
GAGGAATCAA 
TATATTATAC 
CCGCT CGGAG 
CGCTCCGTAT 
GCAATATAGG 
CTTATGGTTC 
CCTTCATAGA 
GACTGGATAT 
GACAACCGGT 
ACAAAGAAGA 
ATGCACGCGG 
AAGGTTATGT 
GCGAACGCGG 
TGAGCGTGCA 
AGTACGTAGA 
TCCTTGCATC 
CTACTCTTCG 
ATCCGGATAT 
ACCTCGGACT 



AATCCATTAT 
GAAGTTGAAA 
GTACAATACA 
TTTCCGATCC 
TCTGAATGCC 
TTATCAGCAA 
TTATCCGAAA 
AAAGGAATAT 
CGACTTCGGC 
CAAAGACGAC 
CATCAAACGA 
CACTCCATGG 
GTTGGCGCAG 
CGGAGAAGGC 
GGCTGCTCAT 
CGGTAGCTAT 
ACCGGAGAAT 
TGGGGATAGT 
TTTCAAAGCC 
CAATTCCTCT 
AGCTTTGCCG 
ATGGCCGCAC 
GATGGACAGA 
CCCCGTAGGA 
GGGCGATACT 
ACTCAAGACC 
GCGTGAGAAA 
CCCCGAAAGC 
TCGGATCGAA 
ATCTGACCGC 



CCTTTTCTTA 
GAT GCG GAGA 
TTATACAGGC 
GGTGAAAACT 
AGACGCTACG 
GGAGGTAACT 
AGTTATTTTG 
CCTACACGTT 
CCGGCCTATG 
GGTTTGGATA 
GATGCACAAG 
GATGAAGGCG 
CAAGGAGCCG 
GGTTGGAGCA 
CCCTCTATGT 
GGCGGCAAGG 
TTGGGGCCTG 
ACCCTTTTCT 
ACGCTGGACT 
GCCGATGATT 
GACAACGGAG 
CTCTTCCATT 
GAAGAAAATC 
CAGGGATTCG 
CGCTATGTAA 
GATACCGCCA 
GCCGAGGGCT 
ATGAAGAGCT 
TTGGGTTCGC 
AGAGCCAAAT 



TCCTGTTGGT 
AGGCACATGA 
GTACCCGACG 
ATCGTGCCGC 
GGTATCCGGA 
ATAAGGAAGC 
CAGCTATCGG 
ACCGGATACG 
CACCCGATGC 
ATAGCAGCAT 
GACGATGGGG 
TGCCAACGAT 
ATTACGACCG 
ACGGTTCGCT 
CGGCATCCGG 
ATATTTATCG 
ATATCAATAC 
TCGCTTCGGA 
CTACCGGCCA 
TCGGCTTGGC 
TCAAAGGTGT 
TCGAACTGCC 
CCATAGCCGG 
TGACTACTCG 
TGCTTGCCGG 
AGCAGAGTGA 
TGCAAAATAT 
TGGACGAACT 
ATGCCGACAG 
CCGTGGTGGA 



ATGCCTTGCT 
TCGCCAAGAG 
AAAGCAGGTG 
CGGCAGACAA 
TTCTGTGGTA 
CGAGGTACTC 
TTTGGAGGGG 
GCGAGCTGCC 
TTCGGCTCTC 
AACGGGACTG 
ACGTCCCGAT 
CACGCCCGAT 
TACGGTACAG 
CGTGGACATT 
CGATTACCTG 
TGTCAAGGTG 
GCCGGGGGAC 
CGGACACGCC 
ATGGCATGTA 
TGTGGAGCCT 
ATTTTGTTCC 
GGCTATCTAC 
AGCCACTGTC 
TGACGATGGC 
AGCATCGGGT 
GACCTACTAT 
TTTCTATGAT 
GATTCGTATC 
GAAAGGCCCC 
TTACCTCACG 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1600 
1740 
1800 



SUBSTITUTE SHEET (Rale 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



124/ 490 



AGTCGTGGCA TAGCGGCCGA CAGGCTTACG TGGAAACGCT ACGGTAAGTC TGTCCCCAAG 
ACGGTGACAG CCAAAATTGC CGAACGGCAC GAT TTCCTGA AGGAAGGCGA TGTGCTCACC 
GAGGAATTCG TAGCACCTTT GACCGAGGAG CAGCAGTCAG TCTGCGACCA ACTGAACCGT 
CGTACCGAGT TCCGTGTGAT CGAAGAAGAG TTGCGT 



I860 
1920 
I960 
2016 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



(2) INFORMATION FOR SEQ ID NO: 184 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2124 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
ID> TOPOLOGY: circular 

Hi) MOLECULE TYPE: DMA (g~nonuc) 

(iii) HYPOTHETICAL: NO 

liv) A1ITI-SEHSC: NO 

Ivi) ORIGINAL SOURCE: 

{AJ ORGANISM; POR Y P HYROI -IONAS GINGIVALIS 

(ix) FEATURE: 

<AJ NAME/ KEY: mJ.-j C _feature 
(B) LOCATIOIf 1. . .2124 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 8 4 



ATGAAAAAGT 
ACGGACAACG 
TCTACACGCA 
CGCAACATCA 
CTCGATGTCA 
TCCGGCAAGT 
ACGCTCAACA 
TACGGCACCA 
ATCCAT GGC A 
TTGGGTGGCC 
AACAAGGACT 
ATAGTAGATG 
ACGGGACGTC 
AACGTATTCC 
TCCAAAAAAA 
CACACGCTTC 
GATCCCACCG 
GACAAGATTT 
ATGGAGTCAG 
GCCACGAACA 
TCGATATCTG 
CTCAACAATG 
TATGAGTTTG 
GATGCTTTCC 
GACCTGAAAC 
TGCGGGATCC 
TTGTCCAGCC 
AT GAGCGGT A 
TCTCTCCGCG 
ACCGATGCCC 
GAATATCGTG 
ATCGAGCAAA 
GCAGCAGAAG 
TTCAATGCCT 
AACATCTTGG 
ATGGGTAAGG 



TTTTCTTCGC 

TCCCGACAGA 

CCGCCGTACC 

AGCAGTCCGG 

TACAATACCC 

ATGTAACCGT 

CGAGCAACAT 

ATGCCATGGG 

ACGTTTCTCT 

GCTTCGAGGA 

ATAAGACCGG 

TAAATGCTAC 

TGCGTTTCGG 

TCGGAGATGC 

ATCTGAATCG 

AATTCTCCCC 

GTTTCATCAA 

CCTTTGGAGG 

AAAGATTCGA 

ATATCGGTTT 

CCGGTGCACG 

AAGCCAAACA 

TGAAAGGCCT 

AAAAAGCAGG 

CCGAAAAGTC 

AAGCCGACGT 

CTGACTATGC 

TCGAGGCCCT 

CATTTGCGAA 

CTTGGAGCGA 

GCAAAGAAGG 

ACTGGTATGC 

ACCCTGAATT 

CTGCTTACTA 

ATGAGCTTTA 

TTATGGTCAA 



GCTACTATCG 

TTCGCTACGA 

TCTGAAAAAG 

CTTTAACAAC 

GGGCTTTAGT 

ATTGGTAAAC 

CGAACAAATC 

CGGTGTGGTG 

CTTCGGCGGT 

TATTTTCTCA 

ATCAAACAAT 

CAAAAACAAG 

TATCGACTTC 

GATCCCCGTA 

TTCTTCGACC 

CTACTTCAAC 

CTACAAAAGC 

ACAAAATATC 

GCAGGCAGGA 

GTTCGGACAG 

TGCCGACTTC 

GGAAACTCAT 

TACAGCTCAT 

CCAATACGTA 

CATGACCTGG 

AACCTTAACC 

TAATAATATC 

TTTGTCTTAT 

TGCCACGATC 

AATGTACTAC 

ACTTGAAGTG 

TTACTACCCC 

GGCTGCTCAG 

CCACATGAAC 

TACGGAGAAA 

CTTC 



ATTGGT ATT T 
GTACACAATC 
AT ACCGG CCA 
ATGACCGACA 
TCGAACATCG 
GGCATCCCTG 
GAGATCCTCA 
AACATCATCA 
AGCTACCAGA 
TTCGATCTTA 
TTCCTATCCC 
AAAATGAAGG 
ACGCCCGAAT 
GGAGGATCTA 
TCTTTCGAGC 
ATAGAGAAAT 
GACTACTACA 
GTACTCGGTG 
GTGAATACAA 
GCCAATTTCT 
ATGTTCTTTG 
AACGTAATCA 
GGTACATTCG 
GGCCCGTTCG 
GACTTCGGTA 
TATTTCCACA 
ACCACATACA 
GACTTCGGCA 
ATGCTCAATT 
GTTCGCAAGC 
ATGCTCAACG 
GAAGTTCGCC 
GGACTGCTCC 
AAGTATCTCA 
GACGGCTACC 



CAGCGCAGGC 
TTCAGACCGT 
AGATGGAACT 
TCCTCAAGAC 
GTATCCGCGG 
CGGGAACGGA 
AAGGCCCGTT 
CCCACAAATC 
CCATGGCCGG 
GTCTGGGCTT 
TGAGCAAACT 
GGAGCGACTA 
GGTCCCTGAA 
TATGGGGCGT 
TGCTCGGCAA 
CGGAGAACTA 
CCTATGGTCC 
TCGACAGCCG 
AGCCATACAA 
ACCTGCTGAA 
ACCTGAAAGC 
ATCCGAATGT 
GTAGTGCATT 
GCACG AC CAT 
TCGGATACAG 
CCGACCACAA 
TCAATGCCGA 
GCCTCTTTGC 
CCGAGATGAA 
AGAACATCAC 
GTCGCTTCAT 
CCGAACTCCA 
GTCATCCGCA 
CCT TCGGTGT 
ACATGCCCGG 



TTTTGCCAAG 
CACGGTCTAT 
CATCTCATCG 
GCAAAGTTCG 
TTTCAAGCCC 
CAATATCTC? 
CTCTTCCATC 
CAAGGACAAG 
ATCATTCAAC 
GGACAAGCAG 
GGAAGAAGCT 
TACT GT AGCA 
TCTGTATCAA 
TTACGGAGAA 
ACATGGCTGC 
TAACAATGCC 
CCTACTCCAG 
AAACATGACG 
CCCCGGATAT 
CGATGCTd'A 
GAACGAGTAT 
CGGAATCAAA 
CAGTGCTCCC 
AGGCAATCCT 
CAATGCACGC 
AGATCTGATC 
CAAGGCTCGT 
CAACAAGTTC 
GAAAAGCCAG 
CTTCGGTATC 
GGCACGCAGG 
GCAACTGCTT 
AGCAATGGTG 
GAACrTGAAC 
ACGTAACATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
€60 
720 
7R0 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
I960 
2040 
2100 
2124 



70 



75 



(2) INFORMATION POR SEQ ID NO: 165 

<i> SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 1386 bases pairs 
IB] TYPE: nucleic acid 
(CJ STRANDEDNESS: double 
(D) TOPOLOGY: circular 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



^u, cim ^ ii/ji ci: £Z ^.'i^ij vi* 1 •**;*, 



WO 99/29870 PCT/AU98/01023 

125/490 

Ui) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: HO 

5 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

10 (ix) FEATURE: 

(A) NAME/KEY: misc feature 
(E) LOCATION 1-..1386 



15 



20 



30 



35 



40 



45 



50 



55 



60 



65 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 185 



(2) INFORMATION FOR SEQ ID NO: 186 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1476 base pairs 
<B> TYPE: nuclei c acid 
(CJ STRANDEDNESS: double 
{D| TOPOLOGY: circular 

(ii) HOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

( ix J FEATURE : 

I A) NAME/KEY: mlsc_f satuie 

(B) LOCATION 1...1476 



ATGAACAGGT TTTCAAATCA TTGGCCCTGC ATCCTCGTGG GGTTTGTACT CTGGTTTGTA 60 

TCGGCGAGTC GGACTGTGGC ACAAAACGCC TCCGAAACGA CGGTATCGTA CGATACGGAT 120 

ACCGCCGTAC TCTCCGAAGC CGATGTGCTT CGGATCGCTC TTAGTGAGAA TGCCACAGTG 180 

AAAGTGGCCG ATATGGATGT GCGCAAACAG CAATATGCAC GTAGGGCAGC ACGTGCCGAT 24 0 

CTCTTCCCGA AAGTAGACCT CAATGGCGTT TACAGCCATA CGCTAAAGAA GCAGGTCTTA 3«0 

TATATAGATA TGCCCGGTTT CAGCAGTAGC GAAGGTATCG AAATGGGGCG TACACACAAT 360 

ACGCAAGGAG GGGTGAACGT CTCCATGCCA TTGGTGTCGG CACAGCTTTG GAAAAGCATT 420 

GCCATGACCG GAGAACAGCT CGATCTGGCT CTGGAGAAAG CTCGCAGCTC CCGAATCGAT 4 80 

TTGGTGGCAG AGGTGAAGAA GGCTTACCTC AGTGTATTGT TGGCCGAGGA CTCTTATGGC 540 

^ GTATTCAAGC GCAGCTATGA CAATGCTCTG GCCAATTATA AGAACATATC CGACAAGTTC 600 

GATCGTGGAC TTGTGGCCGA GTATGATAAG ATTCGAGCCA ATGTACAGGT ACGCAACATC 660 

GAGCCTAACC TCTTGCAAGC GCAGAACTCC GTAGCCCTTG CTCTCTGGCA GCTCAAGGTC 720 

CTGATGAGCA TGGAAGTGGA AACTCCGATC AGACTCTCCG GTTCATTGTC CGACTATAAA 7 80 

GAACAAGTCT ATACCGGCTA TTTTGCCGCC GATACGCTTA TTTCCAACAA CTCCTCCCTG 84 0 

CGTCAGCTCG ATATACAGCG TCGTCTGGCT GTCAGTGCAG ACAAGCTGAA CAAGTACAGC 900 

TTCCTGCCTA CACTCAATCT GGGAGGGCAG TACACCTATT CGCTCAACAG CAACGACATC 960 

AAATTCTG3G GCGAGGGACA ACGCTGGACG CCTTTCTCCA CCATATCGCT CAGCCTGTAC 1020 

ATTCCTATAT TCAATGGAGG CAAACGTCTG TACAACGTGA AGCAAAGTGC TTTATCGATC 1080 

CGTCAGATCG ATCTGCAACG ACGCCACATA GAGCAATCCA TCCGAATGGG AATCAAGAAC 114 0 

CAAAATGACC GTCTGCGTAC CTGTATGCAG AGATTTGTGG CCTCGGAAGA GGCTGTCCGA 1200 

AGTGCAGAAA AGGGCTATCA G AT AG C AG AG AAACGCTATC AGACAGGCGA AGGCACTCTC 1260 

GTCGAGCTCA ACGATGCCGA TGTGGCTCTT TTGCAGGCTC GACTCAATTA TAATCAGGCC 1320 

AT ATT CG ACT TTATGACCGC AAAGGCCGAA TTGGACAAGA TGAACGGCAT GGGGATTCCC 1380 
GAACAA 



1386 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1*8 6 

^™^ GGG ACAGCGATGG AGTGGCGCCG AACCAAGTGC GCCGAACGCT GGTGAAGGTA 60 

^FTJ^IS AATCCCTTCC TCCGGGTGCA AAACAGATTC GT AT CGGATT CTCTCTTCCG 1 20 

f^^^S AGGAAAAAGT CACCGCCCTA TATCTCCTTG TGACTGATTC TTTAGCGGTG 180 

^^^T TGC CGGACTACAA AGGGCGAGTC TCTTACGATA GCTTCCCGAT CTCAAAGGAA 2*0 

70 SSS^S CAGCCCTTTC TGCGGATTCG GTAGCCGGAC GCCGCTTCTT TTATTTGGCT 300 

/U ^G^TATAG GGCCTGTTGC TTCTTTTTCC CGATCCGATA CGCTGACTGC CCGTGTGGAA 360 

™SSTS? TCGATGGCCG CCCTTTGCCG TTGAAAGAGC TGTCGCCTGC CTCCCGTCGT 420 

?I™I AGGG GGTA TGAGGC CCTCTTTGTA CCCGGTGATG GCGGATCGCG GAACTATCGT 4 80 

ilSSS™™ TTTTGAAAAC GGCTAATGGA ACACTCATAG CGATGGCCGA CAGACGAAAA 54 0 

75 r^lln^ ^GGATCTGCC GGAGGATATA GATATAGTCA TGCGGCGCAG TACGGACGGA 600 

75 GGGAAATCGT GGAGCGATCC CAGGATTATC CTACAGGGAG AGGGGCGCAA TCATGGCTTT 660 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



lUIi "7^ n'Pii „£}„ W,", 



#31 



WO 99/29870 



PCT/AU98/01023 



126/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



60 



65 



70 



75 



GGCGATGTAC 
GTAGGCCTGT 
GACGAAGGAC 
TGTOCCGATC 
GTGCTGCCAT 
GTCCTGAACA 
TGTGCATACC 
ATGAGCATAC 
GACGATGGCC 
AATGGAGCTA 
CTCGGCCCGG 
TGGTCCGCTC 
GCGGATGGAA 
CGGTTCGTCC 



CCCTGGTGCA 
GGCAGTCTAC 
TGACTTGGTC 
CGGGACGCAG 
CCGGTCGTAT 
ACTATGTCCT 
GCCGTGGCGA 
GCAATCAGGG 
TTACTTGGGA 
TGCTTCAAGT 
ATGGGCGTCG 
CCGTTGTTGT 
CGATCGGTTA 
TTGACGATCT 



AACCCAAGCA 
CCCCGATCGT 
GCCTCCTCGG 
TCGCTGGTTG 
CACGTTTGTG 
CTATAGCGAC 
TGAGGCAAAG 
ACGGCAGGAG 
GAGAGCCAAG 
GAAAAGGAAC 
CGATGGAGCT 
CAATTCAGGA 
TTTCGTCGAA 

CTTCGATGTC 



GGAAAGCTCC 
CCTCAGCGCA 
GATATAACCC 
GCCTCCTTTT 
GCTGCCATCC 
GATGAGGGCG 
CTTTCATTGA, 
AGCCGACAGC 
CAGTTCGAGG 
GGAAGGGATC 
GTCTATCTCT 
TCGAGTCCCT 
GAGGGCGATG 
CGGCAA 



TGATGATCTT 
CTTATATATC 
ATTTCATCTT 
GTGCTTCGGG 
GCGAATCAGG 
ATACATGGCA 
TGCCCGATGG 
GTTTCTTCGC 
GCATCCATGA 
AAGTGCTGCA 
TCGATCATGT 
ACTCGGATAT 
AGATCTCATT 



TGTCGGTGGA 
GGAAAGTCGG 
CGGCAAGGAT 
ACAAGGGCTT 
GCAGGAGTAC 
GCTTTCCGAC 
CAGGGTACTG 
TCTCTCCTCC 
CCCCGGCTGT 
CTCCCTGCCT 
CTCCGGCCGC 
GACTCTGCTG 
GGTT1TCATT 



(2) INFORMATION FOR SEQ ID NO: 187 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LtNGTH: 735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

UU MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: HO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY P HY ROMON AS GINGIVALIS 

(ix) FF.ATURE : 

(A) NAME/KEY: miac feature 

(B) LOCATION 1...73S 



(xl) SEQUEHCE DESCRIPTION: SEQ ID NO: 187 



ATGAAAAAAG 
TATGCTCTTG 
GGTATGGCCG 
CAGATGATGG 
GACTATTTGA 
TCTAAAGAAT 
GGGTATCTGC 
ATATCTCGCG 
GCCTATTACT 
GACGCTTTCA 
CTATCTTCTT 
TGGGGAGGTT 
TCTTTCGCTT 



AAAAACTTTG 
GCCGCAGTGT 
AGCGTAATTT 
ATCTCGAATC 
AAAACAAGCA 
ACAACTACTA 
TCAGCCAGAC 
ATATAACGGA 
ACACCAAGCT 
ATCGTGCTTC 
CGATGGGCGT 
CGTTCAATAC 
TGAAG 



GATTGCGATC 
CGCTCAGCTA 
CAAATCCGAT 
GGCCTACAAG 
GCTGCCCGAT 
TTACGATCCT 
AGTTACGGTG 
GCTGATCAAT 
CAATGAT CTG 
GGTCATTGCG 
GTTCCAGATA 
GTCT TCCAAG 



GTCGCCGGTT 
CGCCGCTCTC 
CTGATCGTTT 
GCTTTGAAGG 
TCGTCTTATA 
CGGCAGGAAC 
ACCTCACAGG 
CAGGGGGTAG 
AAGGTGGAGA 
GAGGGGAGCG 
GTGGGGCTCA 
ATGAAGACGG 



TGGCTTTCGT 
AGCCTTCGGT 
GGACTGCTTC 
AAAAACAGAT 
TCTTCTCAAG 
AAAACGTCAG 
ACATCGAACA 
AGATTACCTC 
TGCTGCGCAA 
GTTCCTCCGT 
ACTCGAACGA 
CAAGCATAAC 



ATTGGGCCTT 
GACTGTGACC 
GTACCAGCTC 
ATTGGTAGCA 
CGTAGCCATC 
GACCTTTGCC 
TGTGGAGAAA 
CGACCGTCCG 
TGCCTCCGAA 
GGGTAAGATG 
AGATTATAGC 
GGTTAAGGCT 



55 (2) TN FORMATION FOR SEQ ID HO: 188 



li> SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 828 ba^e pairs 
CD) TYPE: nucleic acid 
(C) STRANDEDUESS: double 
ID) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(ili) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

(lx) FEATURE: 

(A) NAME/KEY: misc_featur- 

(B) LOCATION 1 . . .028 



720 
7S0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1476 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
735 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



-ii'c:;)! « j» o o ,«i *o i^' ^:M";L.^'i];:^^^"? t ^' , ^' 



WO 99/29870 PCT/AU98/01023 

127/490 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 188 

ATGAAAAAAA CAATTGCAAT TATCGCCTCA GCCCTCTTGG CTTTAGGAGC CGTCGGCTGT 60 

AAGAAAAATG CTGACACTAC CGCTGTCAGT GAAAAGGATA GCATAGCCTT GTCCATGGGT 120 

, ATTTTGTACG G ACAGGAT TT TGCCAATCAG TTCGAAATGT CCCGCTTGCA AGGCCAGCCG 180 ' C ^V^'""' 

ATTGATTCGG TAGCTTTCTT GGACGGTTTC AAATATGGTA TCGATACGAC GCGCTTCTCG 24 0 ..„--; 

TACAATCTGG GAGCCATCTA TGCTTCCAAT ATAGCT CGTC AGCTGGCTCA TGATTCCATC 30O ' h " 

GATATCGACA AGTTCTATGC AGCCATGCGT GCGGCTCTTC TTAAAGACAC CGTATCTATC 360 - , . C^-^Z'<- r- „ 

GCCATGAAGC CTGCAGATGC AC AGGCTT TC ATGCAACGAA TCCAAGCCAA AAAGCAGCGA 420 

GAAAACAATA TGAAGCAGTT TGGCCAGAAC AT CGAAAAGG GTAATGAATA CATCGATACC 4 80 - . 

TTTAAAAAAG AAGATGGTGT AACTGTTACG ACAACTGGTC TGGCATACAA GACTCTTCAG 5*10 

GAAGGTACGG GAGCTACTCC CTCTTTGGCC GATACTGTAC GTGTCAAGTA TGTGGGTACT 600 * , s 

CTGGTCGATG GTAAAGAGTT CGACAAAAAC GAAGAAGGAA TCGAATTT GC CGTTACCGGT 660 

GTGATTAAAG GCTGGACGGA GATGCTCCAA CTCATGAAGG TCGGTCAGAA AGTTCGCGTG 720 

IO GTAATCCCAC AGGAGCTGGC TTATGGGGAG ACCGGCAACT ATACCATCGA ACCGTTCTCT 78^ 

ACCCTGACGT TCGAGATGGA ACTT ATCGGG ATCAAGCCCG GGAAAAAG 82 8 



10 
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12) INFORMATION FOR SEQ ID NO: 18 9 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2325 base pairs 
{ B) TYPE: nucleic acid 
tC> STRANDEDHESS; double 
(D) TOPOLOGY: circular 

(11 } MOLECULE TYPE: DMA (genomic) 

till) HYPOTHETICAL: HO 

llv) AHTI — SEWSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORT PHYROMOMAS GINGIVAL1S 

( ix ) FEATURE : 

(A) NAME/ KEY: miacjeature 
(8) LOCATION 1...2325 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 189 



ATGAAAGTAT 
TCCACCACAA 
ATACTCCGGC 
CTGAAAGTTA 
CCATTCGGGC 
ATATTCGACA 
GCTAAGGTGG 
AGTGTGACCA 
GCCTCTCCTT 
ATTCTGGCTT 
AAGCTGCACG 
TTCCGCCCAC 
CTGCGAGCCA 
AAACGGACAG 
GAGGATATGA 
CGCTTTCGTT 
TCCTTGGCTC 
ATTTCCGGCC 
TTAGAGACCT 
CTTGCTCGGC 
TATGAGTGGG 
TCGTACAACA 
CTGGATAAAT 
AACAGGGCAC 
CCCTCCAAGG 
CATCAGACAG 
CAGAGTCAGT 
AAAAGTCCTC 
CTGATCTATC 
TTCTCTCAGT 
CAGTCACTGG 
GCACCCTATA 
CGTAGCATCG 
CAGGTGGGCG 



TACGGCAAGT 
AGAATCTGCC 
AG GACAAG AG 
CACCCAATGG 
TATGGCTATA 
AGTTTGCAGC 
CGACGAACAT 
CTCTGAAAAA 
ATCATTACGA 
ACAGGCAGAC 
AAGAGCGTCA 
AG GAT ATT AT 
AGCTCTCGGA 
CAGTCCTGCT 
AAGTCCTTTA 
TCTTCTCCGG 
GTTTGGGAGC 
TTTTGGATGT 
TGTTCACGAG 
GCAATGTATT 
AGACCGGCAA 
TGAATACGGC 
ACTATTACTA 
ACTACTTTAG 
AACACCGGCA 
AAACTTTCCA 
TCCTTGCTCA 
AT CATCTTTG 
TGGCAGCCGG 
TCATCAAAGC 
CAACCCGTTT 
GCGAGCAGTT 
GCCCCGGACG 
AATTCAAACT 



ATTCCTCCCC 
GGAAGGCGAA 
CCACGCCGGC 
AGCTATTTTC 
CAACAGCTTC 
CAAGCCGGTT 
CCTCCGCGAA 
GGACTCGCTC 
CAGCATCATT 
TCCGTCTTTG 
GACCATCAGT 
CTACGAAGCC 
AGATACTCCA 
CGGAATGAAC 
CTATCGTAAA 
CAATCTGTAT 
CTTCTCCGTT 
GCGACTGCTA 
CAAAAGCAAT 
CGGCGGAGGA 
TCGTCCCGAA 
CGTGAACCTC 
CCCCACGACT 
CATGTACTCT 
TGCTATTTTC 
GGCCATTACG 
AATGGGGTAT 
GATGCAATTC 
CAAGAAGTAC 
CACGGGAGAA 
CGGGACAGGC 
CTATGTAGGC 
GTTCAATCCG 
CGAAGCCAAC 



ATCCTTTTTG 
CAGCTGTATA 
CAACAGGCTC 
GGCAGTGCAA 
GTGGGGGATT 
TTCATCAGTC 
CACGGGTACT 
AAAGCCAAAA 
CCCTTACCGA 
ATCAGGAAAG 
GCCCTGCTGA 
GATACCCTCC 
CCCCAAGCCA 
GGAGAAAGCC 
ATGCCGGTTC 
CGGCAGAAAG 
ATCGATCTCA 
ACCACCCTCG 
GACTTCATCG 
GAAAATCTTT 
AATAGCAGCA 
TCGTTTCCCT 
ACGACTTTTC 
TTCGGCTTTT 
CCGCTCAAGC 
GCGAACAATC 
ATCTATACGT 
GGACTATCCG 
AGCGACACCA 
CTGCGCTATT 
GTGATATATA 
GGTGCCAATA 
■3ATTCCGACA 
GTGGAATATA 



TCCTACTGAC 
TCGGAATGGG 
TGACCGAAGT 
GTGCCTCCTT 
CCACTGTCAT 
AGGTCAAATC 
TCGATGCTAA 
TCTCCTATAC 
TCAGCACTTT 
GAGACCAGTT 
GAGACAATGG 
TCGTAAGAGG 
TGCGCCCGTG 
CGACAGACTC 
GCCCCAAGAT 
ACGATGAGAC 
ATTTTTTCCA 
ACAAACCTTG 
GTCCCGGACT 
CTTGGAATAT 
ATCGGCTGAT 
CGAT TGTATT 
AGGCTT CTGC 
CGACCACCTA 
TCAACTACAA 
CGCCCCTGCT 
TCAACAAATC 
AGGCAGGCAA 
AGAATTTCGT 
CCTATACCAT 
GCTATGGCAA 
GTATCAGAGC 
AT CAGTATTC 
GAGGCAAGCT 



AGGTGCCTGC 
CAAGACACAG 
GGAGAGTACA 
ACCCAAGATA 
TTCGAAATGG 
CGATAGCCGG 
AGTAAAAAGC 
GGTGGATATG 
CCCCGACAGC 
CAATTTGGCA 
TTACTACTAC 
TGCCGTATGC 
GAGGATAGGG 
GCTCGAAGTG 
TTTGGCCAAA 
GACACGCAAA 
ACGCGATTCC 
GGATGCATCA 
GAATTTTGCT 
CGGTGGATCG 
CGATATAAAT 
TCCCGGTCTG 
CACCGCGCTG 
CGAATTTCAG 
CCTCCTGGGG 
GCTCAGCCTT 
CGTTTCAGAG 
TCTCCTGAAT 
CGGCG'fCCCC 
AGACCGCAAT 
TATGCGAGTG 
TTT CACCGTC 
CTATTTGGAT 
TTTCGGGGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1360 
1920 
1980 
2040 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 PCT/AU9 8/01 023 

128/490 



CTCCACGCAG CCGTTTT CCT CG AT GCGGGC AACGTTTGGC TCTTGAGGGA GGATTCTTCC 2100 

CGTCCGGGCG GTGCTCTGTC CGAAGTGGGA TCGGTGAGCA ATTTCCTGAA TAGCATCGCT 2160 

CTCGGCACCG GTGTCGGCCT TCGCTACGAT CTGGCATTTC TCGTGGTTCG TGTCGATGTC 2220 

GGCTTCGGTC TCCACCTTCC TTACAATACG GGTAAGAAAG GTTACTACAA TATCCCACGC 2280 

5 TTTAAGGATG CCATCGGTTT CCATTTGGCT GTCGGCTATC CCTTC 23 2 S 



(2) INFORMATION FOR SEQ ID HO: 190 

10 (ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2322 base pairs 

(B) TYPE: nuclsic acid 

(C) STRA11DEDHES5 : double 

(D) TOPOLOGY: circular 



15 



{ii) MOLECULE TYPE: DMA (genomic) 



(iii) HYPOTHETICAL: NO 
20 (iv) ANTI -SENSE: NO 

{vi ) ORIGINAL SOURCR: 

(A) ORGAIIISH: PORYPHYROMONAS GINGIVAL1S 

25 (ix) FEATURE: 

tA) NA1 IE/KEY: mice feature 

(B) LOCATION 1 . . .2322 



(xi) SEQUENCE DESCRIPTION: Si 

30 

ATGTCCTCGC ATTCCGTTCG GTATCTAATC 
GCTTCCTCCT GCTCGGTCAC CCGTTATGTG 
ACGATCGCAA GCGAAACGGG CAGTATCGCT 
CAGCAACCCA ATTACAGACT GTTCGGGATG 

35 TCGAATCCGA ACAGCAACAG CTGGTGGAAC 
GTCCTCATCG ATTCTGTCCT CACCGATCGT 
' GGCGATGGCT TTCTCGATGC TACTGCTCGT 
GCTCGCATTA CTTATCTGAT TCAGCCCGGA 
GATGTGAAGA ATCCACTCCT TCCTCCCGTT 

40 AAGGTCGGGA TCAGCGAGGG TTCTCCCTTG 
GCGATAGCTC GTCATATGCG CAACAACGGC 
TATGAAGCAG ATACTACCGT T TCAGGAGG A 
GTGGTCAATG GCATCGGGCG TTATCCATAT 
TATGATCCTC TCGAATCGGA CTTCAGAGTT 

45 CGTGGCGATT ACACTGTTTA CTATGGGAGT 
ACGCGGTCGG TGTCCGTTAC ACCGGGAGCT 
TATATCAAGC TGAATGCGCT CCCTATCGTT 
AATGGTAAGG AT GAG ATT GC TCTGGCGGAT 
ACCGTTCCGG CCAAGAGCAA ATCGTTCGAA 

50 GACTTCGGGG CGGCTTTGTC TCTCGGTTTC 
ATGTTCAATA TCAAACTCAA GGGTGCTTAC 
ATGGAATATG GGGTGGAAAG CTCGCTCCGT 
GACGAAACGC GCCGGCGGCT ACGGGCATCC 
ACACGTCCGG AGTTTGATCG GGTGATTCTC 

55 TACCTGCACA ATCGTCTGCG TCATACGATC 
CCGTACATCG ATCCCGACTT CGCCCAATCC 
ACGCAGCAGT TTATCCTCGG CTCGGCATAT 
GAGCGTACCG TATCCAATCC TTTTACGGCA 
CTGCAAGCCA TTTCTTATCT GACCGATTCT 

60 TTCGGTCTGC ACTATGCTCA GTTCGTCAAG 
CTCGAAAAGG ACAATACTTT GGCACTGCAT 
AATGCTCGCC ATATACCCTT TGAGTTACGT 
GGCTGGAGTG TCCGTACCCT CGGCCCGGGG 
TTCGATCAGA TGGGTGATAT TCGT CTG GAT 

65 TGGAAGTTTC GCGCAGCAGC TTTTGTCGAT 
GAGAATCAGG AGGACGGTCT CTTTCGTTTC 
TACGGTCTGG GGCTTCGTCT CGACTTCGAT 
AAAGCCTACG ATCCTCAGCA GACAGGGCGT 
TCTTCCAATT TCGCTTGGCA CATTGCAGTA 

70 



10 ID HO: 190 

GGCATTGCCG GCTGCTTGCT CCTCATGCTT 60 

CCGGACGGTA GCAGACTATT AGACAGGGTA 120 

CTGCCGGAAG ATATTCGGGA CTATACCCTC 180 

ACTCGCTGGC TACTGCGCGT CTATAGCAGC 24 0 

CGTTCGCTCC GGAAAATGGG CGAACCGCCT 300 

ACTGCCAACC GTCTGGCAAA GGCGATGGCC 360 

GCCGTGGTAG ACACCGGCTT GTACAAGAAA 420 

AGCCGTTATT ATATACGCAA TATGGCTTTG 400 

GCGCTTGGCA ATTCGCTTCC TTCGGCATAC 54 0 

TCGCCCATCG TACTCGATGA AGAGAGAAAG 600 

TTCTGGAAGT TCTCCGCCGA GGATGTTTAT 660 

TCGGGTACGA AATCTGCCGA TCTGAAATTA 120 

CGGATCGGCA GGGTATTCTT TCATGCCGAT 780 

CAGGAGCTGC CACGTATCGA TTCGATTTCG 840 

AGGGGACGTT ATATCCGGGC ATCGGCTCrC 900 

TTTTTCTGCG AGGATGATGT GGAACGCTCT 960 

CGGAACGTGA ATATCCGATT TGTGGAGCAC 1020 

AGCTCTCGCC TTGTGGACTG CTATATTCTT 1080 

GCCGAAGTCC TCGGCACCAA TTCCGCTGGA 114 0 

ACCGATCGCA ATTTGTTTCG TGGGGCGGAG 1200 

GAAGCCATTC GCAAGGGTTC GCACAGCTTC 1260 

TTCCCTCGTC TCCTCTTCCC ATTCATTTCT 1320 

ACGGAATGGA AGATCGGGTA TAATTACCAG 1380 

TCCGCTCAAC TCAATTATTC ATGGCAGACC 14 40 

CGCCTGCTGG ATGTCGATTA TCTCCATCTC 1500 

CTTCCGCCTA CGACTGCACT GTATAACTAC 1560 

ATACTGAACT ATACCACGGC TTCGTCCATG 1620 

CGGTTCAGTA TCCAGACAGC CGGCAACCTG 1680 

CCGAAAGACG AACACGGGTT GTATAAAATG 17 4 0 

CTCGATCTCG AT CTGGCTAA AACCGTTCTT 1800 

CTGGGTTTCG GACTGGCTTT CCCTTATGGC I860 

TACTTTGCCG GACGATCGAA CAGCGTTCGC 1920 

AGTATGAAGA TGACTCCGGA CAAGACCTTC 1900 

CTGAATGTCG AATACAGGAC AAAGCTGTTC 2040 

GCCGGCAATG TCTGGACGAT AAAGGAGTAT 2100 

GATCGCTTCT ACAAGGAAAT AGCTTTGGCC 2160 

TATTTCCTTG TGCGGCTGGA TGCCGGACTG 2220 

TACAAATGGG CTATCACACG CCCAAACCTT 2280 

GGCTATCCGT TC 2322 



(2) IN FORMAT I OH FOR SEQ ID HO: 191 

(i) SEQUENCE CHARACTERISTICS: 
75 (A) LENGTH: 2601 base pairs 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



129/ 490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(U> TYPE; nucleic acid 
(C) STRANDEDNESS: double 
(0) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

<1v) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

( lx ) FEATURE : 

(A) NAJ IE/KEY: mlsc_f saturp 

(B) LOCATION I 2 601 

SEQUENCE DESCRIPTION: SEQ ID NO: 191 



ATGAGAAAAA 
CTCGCCATAG 
CTCATCGGTG 
ATGGATGGTG 
GGTTATGCTA 
CTCCTCGATC 
AAAGACCGTA 
TCGTTGAATG 
GGAAGCGGAG 
GGTGTACTCA 
AATTGGAGTG 
TCCAAGCTCG 
GCCAACACCG 
TTCTCCATTT 
ACGGGTCTGG 
TCGAAGAAGT 
CACAACCAAC 
AATCAATCCT 
TACCACAAGC 
TATACcgCAN 
AGTAAGTGGG 
CCCGATGGAC 
TCGGAAGCAA 
TTCAAGAAGA 
CGTGGCGACC 
CCCAAGACAA 
AATCGGGACT 
TCCGAATGGA 
AACTATGGCG 
CTTCCGTGGA 
GCCAATGGCG 
ATCATTCCCA 
TTCACGAATC 
CGCGTGACCT 
CACTGTGGGG 
ATGTT CTCTC 
GCCGGCAATG 
GCACAGATGA 
GGTAAGTACA 
CAGCAGTACG 
CTGTTCGATC 
TTCAACATGG 
GGTAAGAAAC 
ACCGGTATTC 



GAATTCTACA 
CjCAAACAGT 
TATCCGTAAG 
GCTTCCGATT 
CCGTAACTCG 
CCCAGGCCAT 
TGACGCCGGT 
TCGAATTTCC 
GTTTCGGTGA 
TCAACGGAGT 
GTCTGATGAA 
GTATCAGCTC 
GAGGTTCGGC 
CTACGGGTAT 
GTTATGTGAA 
TCAATGAACG 
GTTCTTCCAA 
TCGGCTATCT 
CCCAGTTCTC 
CCTACGCATC 
TATTGATCAA 
TTATCGACTA 
TTTTTGCCCT 
AACTTAATAG 
ACTATGACAA 
AGCTCGCATA 
ACACAGGCGA 
TCGATGCATT 
GTAGCAAGTC 
GTGGCAAGGC 
GTTTCTT CAC 
ATGACAAAGC 
ACAAAAACTT 
CGAAGAGAAT 
TAGAGGCTGA 
TCGGTGACTG 
AGACAGGGCA 
CGGCTGCTGT 
ACTTCCTTGG 
AAGCGGATGG 
TGTCTGCATC 
ACAACGTAGC 
ACGATGAGGC 
GTGTAAACTT 



ACTTTTCCTG 
GGTGACCGGT 
CACCGGTCAG 
CGAAGTACCG 
CTCTATAGGC 
CGGCTTGGAT 
ACCCGTTTCC 
CGAACTGGTT 
TGGTCGTACC 
TCCTGTCAAT 
TCAAGCCAGT 
GGTAGGTGGT 
TTATGTCGGT 
GAACGACGGT 
GGGGCTGAAG 
TCATACCCTC 
ATATTCTGTA 
GCGAGGCGAA 
GCTGAACCAC 
TTTGGCTACC 
CTACAACACC 
CGATGCCGTA 
TGGCTCCAAC 
TTCGCTGACT 
GAT CACCGAT 
CCATGCGGAA 
AATCATGTGG 
CGTAT CAGGA 
CACCGGCTAC 
AGGTCTGAGC 
ACGTGCACCA 
CAATATGGAA 
CGAGTTCAAT 
CGGAAACGAG 
GGTCAGCTAT 
GACTTGGCAA 
GGATATAACC 
ATCGGCAGAC 
CAAGAACTAT 
CAAAGAAATC 
C1ACAATTTC 
CGACAAGCGA 
TTCGGCTCTC 
C 



ACCGCATTGC 
AAGGT GATCG 
GGAGCATCCC 
GCCAAATCTG 
AGAGGTTCTC 
GAGATTCAGG 
AATATCCGTG 
AAATCCACTC 
AATGTGCGTG 
GGTATGGAAG 
ACCATTCAGA 
ACGATGAACA 
ATGGGTAATG 
TGGGCTATCA 
GGACG TGCAT 
TCTCTTACCG 
GCCGACTATG 
CTGACTCCTA 
TTCTGGAAGA 
GGTGGAGGTC 
GGACAACCCT 
CTGGCTGCCA 
TCTCACAAGT 
TTGACAGCCG 
CTGCTCGGCG 
GGTCAGCAAC 
CACGGCCTCT 
TCTATCAACT 
CTGCCCGGCG 
TACAAGTTCG 
CTCTTTGGCA 
AAGGTGCTTA 
ATCAACGGAT 
TATGTTTATC 
CGTCCTATTC 
AACAATGTAA 
TATATCAAGG 
ATAGAGCTGT 
GCAGGATTCA 
GTGGAATCAT 
AAGCTTGGTT 
TATGT GAGCG 
GTATGGTACG 



TGCTGGCATT 
ATTCAGAAAC 
TCCGCGGTGT 
TCTTGACTTT 
AAGAAGACCT 
TWTAGCCTC 
TGGCTGATAT 
CCTCTACCTA 
GATTCGACAC 
ACGGGAAAGT 
TTCAGCGCGG 
TTATCACGAA 
ATGGATTGCA 
CCATTGCAGG 
TCTCTTACTT 
GATTCGGTGC 
ACAAATACGG 
CGGCTTATGC 
TGGATGAAAA 
GTCGCGCTTA 
ATGAACAAAC 
ATGCTGCGGC 
GGTTCGGTCT 
GATACGATGG 
GTAGCTACTA 
TGAAAGTGGG 
TCGCACAGAT 
ACGAACTATA 
TATCGCCGTG 
CACAGGGACA 
ATATCTATGC 
CAGGAGAGGT 
ACTATACGAA 
TCAATGGCGT 
GTCAGATCGA 
GTTACACTTC 
GTCTTCACGT 
TCAAGGGTTT 
ACCCCGCAAC 
GGAAGTTGCC 
CACTCAGCAC 
ATGCCGACGA 
GTTTCGGCCG 



AGGCTCCTCT 
GTCCGAACCG 
AACCACCGAT 
CCGTT GCGT A 
CGGTACGATT 
TGTGGTGCCC 
TCAGGCAGCA 
TACGACAAAA 
TTACAACTTC 
ATATTGGAGC 
ACTCGGAGCC 
GACTACGGAC 
CAAAGAATCG 
CTCCCATATG 
CTTCAACGTT 
ACCACAATGG 
CATCCGTCAC 
TTACAATACG 
TACCTCTCTT 
TGGAAAGAAC 
AAAGGTGACT 
GAGCAATGGC 
ACTCTCTTCA 
GCGTTACTAC 
CATAGAGGAT 
TGACAT TGTA 
GGAGCATTCG 
CCGCAATCA'J 
GAAAAGCTTC 
CAATGTATTC 
TGCGGGGGCT 
CG G CT AT GO A 
GTGGATGGAT 
TGATGCTGTT 
CCTTCGCGGT 
TTACGACGAA 
CGGAGATGCA 
CCATGTCATA 
GCGTAATGCA 
CGATGTACGT 
CACATTCTAT 
CAATATCATC 
CACTTGGTCT 



60 
120 
180 
240 
300 
36C 
420 
400 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
Z4C0 
2460 
2520 
2580 
2601 



(2) INFORJ1ATION FOR SF.Q ID NO: 192 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

fil) tlOLECULE TYPE: DNA (genomic) 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



"H 'i!"? 1 =iiL iD liUji it; H»-$ ^Hi: iw,'i£-«~iLj iLJ* V^jM^-S?^^ 



WO 99/29870 



PCT/AU98/0I023 



130/490 
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75 



till) HYPOTHETICAL: MO 
(IV) ANTI-SENSE: NO 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOIIAS GINGIVALIS 
( ix | FEATURE : 

(A) NAME/ KEY : misc featur- 

(B ) LOCATION 1...1293 

txi) SEQUENCE DESCRIPTION: SEQ ID NO:192 



ATGAAGTTTT 
CCTGCTCTCG 
CTCAAAGCCA 
AAACAAAAGC 
CTCTTGGACA 
CATCACCTCT 
ATGCAAAAGC 
GACGAAGGCA 
TCTGTCCGGC 
GCCAAAAAGG 
GGACAGCAAG 
GAAGCGCAGC 
CAGATTGCCA 
GCACGCGAAG 
ACCAAAGGCG 
AACAAAGGTC 
GTGCATCAGC 
GTAGCAACAG 
ATACCCGGTT 
AATCTGAGCA 
CGTGCCTATA 
CGCAGCAAAC 



CAATCCGCCT 
GACAAAAATC 
TCGAAAAAAC 
ATCTCAACCT 
ATGAGGTCAA 
CTGTAGAAGA 
GGAAACGCTC 
TGCGACGGAT 
TGCGCGATAC 
AGAAAGGACA 
CCGAGCAACG 
TGCGAAAGCA 
AGGAAATAGA 
CCAAAGCCAA 
GCTATGCTAT 
GCCTGCCCGG 
ACAGTGAGCT 
G ATCCGAT GC 
ATAATTCGGC 
AAGT GTATGT 
CGGATCCTTC 
AAAACCCAAG 



TTTCCTCTGC 
CAAGCAGGTA 
CGATCGCGAA 
CCTGAACAAG 
AGAGTTGCAA 
GAAAGCCCGA 
GTTGGATCGC 
GCGTTTCTTG 
ACGTAGCAAG 
TCTCTTAGTC 
TCGGCAGGTG 
GAAAAAGCAA 
ACCTGCCGAA 
GGGTAAGCCG 
GGATGCCTCT 
CCCCGTTCGC 
GAAAAAAGTA 
TACCAGCGTA 
CGTAATGGTT 
AAATTCCGGC 
CAACAACCAG 
ACT AT GGTTA 



ATCATCTTTC 
CAGCGACTTG 
CTACGAAATA 
CAGGTTGCTC 
TCCGACATTG 
TCCGATGAAT 
ATCCTTTTCA 
GAACAATACG 
TTGGAGACTG 
ATCAGAGAAG 
CAGGCTTTGG 
GCCGAAGCTC 
CGTCGTGCTC 
GTTCCTGCCG 
GAGCGTGCTC 
G GCAGAT AC C 
CAAGTTAATA 
TTCGATGGTG 
CGTCACGGTA. 
ACTCGTGTTA 
ACCATTATTC 
CGA 



TCCTCTCTGC 
AGAAGCAACG 
CCAACAAAGA 
AACGCAAGCA 
ATTCCATGAC 
ATGCCCAAGC 
TTTCATCGGC 
CTTCTGCATA 
AACGTGCGAC 
AGGAAAAAAA 
GAGCCAAACA 
TGAACAGAAA 
GAGAAGAACG 
AACCGGAACG 
TCTCGGGCAG 
GAATCGTAAG 
ATGGAGGTAT 
TAGTGTCCAG 
ACTATATCAC 
AAACGGGTCA 
ACTTCGAAAT 



ATTTATCCTG 

TAAGGAGGCC 

CAAGCAAGAC 

GATGGTACAA 

GGGTGTATGT 

TCTACAGTCT 

CAAGAGCTTT 

CAAGCTGGCA 

TGTAGAAGAC 

GAAACTCGAA 

AAAAGACTTG 

GATCGAGAAA 

TGAACGGTTG 

GAAGGCGGAG 

CTTTGCACAG 

CGACTTTGGC 

CGACATCGCT 

TGTATTCGTG 

GGTTTATGCG 

GGCTCTTGGT 

CTGGAAAGAA 



(2) INFORMATION FOR SEQ ID NO: 193 

(i) SEQUENCE CHARACTERISTICS : 

{A) LENGTH: 999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (yeriomic) 

(iii) HYPOTHETICAL: HO 

(iv) ANTI- SENSE; WO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: POR Y PH YROMOHAS GINGIVALIS 

(ix> FEATURE: 

(A) NAME/ KEY: misc feature 

(B) LOCATION 1 . . . 99 9 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 193 

^^™ T ATTTGTTATA TGCCTCGTTG CTAACGAGTG TTTTGCTCTT 
S S ™£55???? GGT GGAAGAC AGATCCATCG 
«A 1 TT CACCA AAACOGGTGA GGCAGTACGC TATGAAA&SA ATr-JlArr-imr- 
AGGCTCATTA CCAATCTTTA CCTCTTGTTG TTCGATCAG? "IS^r™. 

CCGGTAACAC TTTCAcSg'a GGGACCTGGC ^ 
G^H^O-* I2 A = ACAATC CGAGGCCGGA GAGCGCAAAG TATATGTCGT 
AAGAGGACCA S^S^ TCTTGAT6CT GTCGCTAACG AAAGCGATTT 
M^S^AC S?SS2S SE 08100 ** GATATAGCCT CTCCTTTCCT 
ATTr?™?^ acgacttctt GGCCAATCGT cttttggaca ATGTGCCCCT 
ATTGCCAAoG TGGAGCTGAA TATCTCGCTG AGT GAGAAAT TTCAGAT^rT 
GTCAATCGTA GTTTGAGTGA GTTCAAGTTC AGATA^GTAA ACTralSI 
^AGTGAAGC CAACGACCAA GCCGGACAAT CTCATTAGTT ^TGCTAATGG 
CAGATTACAG ATTGGACTGT ATGGGGTGCT TCCTTAAATA CTTCT^r 
GGCAWGGTT ATACATTGGA TGCAAATGGC AAGGTAACGG 
CTGAATGAGC GCGATAGCAA AGGGGCTACG GTAGAGGTCG CATTGCCTCG 



TTCCTGTTCA 
AAGGGTAGAT 
TGCTGCCGAA 
TCCGGCGAAA 
TATGAAGGTG 
AGCCAATGTT 
GCAGACTGTA 
GAT GTCCGGA 
TGTGCGTGCC 
GCCGATAATT 
GGAGACCTAC 
TGTTTGGCCT 
TCCGGATGCG 
TGTTACCTAT 
TGTGGATGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
€00 
660 
720 
780 
840 
900 
960 
102U 
1080 
1140 
1200 
1260 
12 93 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
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GGCACCCTTC CTCCTCCGGA ATTCGGTCCG GAGCTTTATC GTTTGCCTTT GCCGGACAAG 960 

ATCCTGCGCA AT CAT TGGTA CAAGTATGAA GTCGAGATT 999 

5 (2> INFORMATION FOR SEQ ID Noels'! 

<i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 945 base pairs 
<B) TYPE: nucleic acid 
10 (C) STRANDEDNES3: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA {genomic) 

15 (ill) HYPOTHETICAL: NO 

(W) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISI I: PORYPHYROMONAS GIHGIVALIS 

( ix ) FEATURE : 

IA> NAME/ KEY: mlsc_f eaturs 
(D) LOCATION 1 . . .945 



25 



45 



55 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 19<J 



ATGATCAGAA CGATACTTTC ACGATATGTA TCCTCGAACT TTTGGAGTCG GGGAGCTACC 60 

TTTTTTTTCA CGATTTTCCC GGCCTTCATC CTCGCCGCTA CTGCTTTGCC GGCTTGTGGA 1^0 

dU GGGGGTACTG CTTCAGGCTC CGATCGTACG CTGGCTGTGA CCATCGAGCC ACAGAAATAC 160 

TTCATCGAGT CCATT GCGG A TAAGTCGGTG CAGGTGGTGG CATTGGTACC GGCCGGCAGC 24 0 

AATCCGGAGG AATACGACCC TTCGCCTACC GTGATGAAGC GTTTGTCCGA AGCAGATGCC 300 

TACTTCTATA TAGGAGGACT GGGGTTCGAG CAAAGAAATC TCGCTGCCAT TCGGGACAAT 360 

AACCCTAAGC TCCCTCTTTT CGAAATGGGC AAAGCCTTGG CGGATGCCCG AAGTGCAGAT 420 

35 CTCCACGGCT CCTGCACAGA TCATT CTCAT ACAGACCTGC ATGCCCATGA TCCGCACTAT 4 80 

TGGAGCAGTG TGGTAGGGGC AAAGGCACTC AGTCGTGCTG CATACGACGC GCTTGTGGAG 540 

CTTTATCCGA ACGAGAAAGA CAAATGGGAC AAAGGGCACG ACCGTCTCAA CGGACGTATC 600 

GACAGCGTGA AGAGACTCGT CGATACCATG TTTGCCAATG GCAAAGCAGA CAAAGCCTTC 660 

G TC AT AT AT C ACCCATCGCT CAGCTTTTTC GCCCAAGAGT TCGGCCTGCG GCAGATCGTC 720 

4U ATAGAGGAAG ATGGGAAAGA GCCTACGGCT GCCCACCTTC GT CGTGTG AT CGATCAGGCA 7 60 

CGTGCCGATG GTGT CAGAAT CGTATTTATC CAACCCGAAT TTGAAACGCG TCAGGCGGAG 84 0 

GACATCGCAC GCGAGATCGG TGCTCGTCCG GTAAGGATCA ATCCTCTGCG CAGCTCGTGG 900 

GAGGAGGAAA TTTTACATAT TGCTCGCGCT TTGGCTCATG AACGG 945 



(2) I N FORMAT I ON FOR SEQ ID NO: 195 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2544 base pairs 
50 (B> TYPE: nucleic acid 

(C) STRA1IDEDHESS: double 
( D I TOPOLOGY : cl rcul a r 



til) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
tiv) ANTI-SENSE: NO 



60 (vi) ORIGINAL SOURCE: 

(A) ORGANISI I : PORYPHYROMONAS GINGIVALIS 

( 1 x ) FEATURE : 

(A) NAME/ KEY: misc_f eature 
65 (B) LOCATION 1...254 4 

(Xi> SEQUENCE DESCRIPTION: SEQ TD NO: 195 

ATGATCGGAA AAAAAATCTT TTTTATCCTG CTGGCGCTCA TTGCGTTCAG TGGGCTGAAC €0 

7U GCAGCGACAG ACACTGAGTT CAAGTACCCG ACCGATGCCA ATATCATCGG TCACGTCAAA 120 

GACAGCAAGA CGGGTGAACA CCTTGTCGGT AT CACTATTG CTATCAAAGG CACTACCTTT 180 

GGTACATCTA CAGATGCAAC CGGGCACTAC TATCTTCGTA ACTTGCGTCC GGGTGAGATC 240 

ACTTTGATTA TGCGTGGCAT GGGCTATAAG AGCCAGGAGC GCGTAGTCCG CGTAGAAAAG 300 

GACAAGACTA TCGAGGTGAA TTTCGAAGCA GAAGAGGATG CCATCAATCT GGACGAAGTC 360 

/i> GTGATTTCGG CCAACCGCGA ACTGACGCTT CGCCGTCTTG CTCCTACTCT GGTAAATGTA 420 
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10 



15 



20 



25 



30 



35 



TTGAACGAAA 
CAGCCGGGAC 
AATGGACTGG 
CTTGCCGGTG 
GTACGTGGTG 
ATCACCAAGG 
TTCAGCAAGC 
GCCGGTGCCA 
GGTTATTCCG 
TTGAGCGACT 
GGCGATCGTA 
GTATTTAGCG 
GCTTATACTT 
GACGTCAATG 
AATAATTATG 
TTGGACAAAT 
GAACTCAATG 
ACCATTCCCC 
AACGAATGGA 
GAAGTCAAGG 
ATCAACCTGC 
TTGCACGTAG 
CCTGAAATTT 
CAGACCAACT 
GAGCAGCCTC 
AAAGTATTCG 
GGTCTTACCC 
AAAGACACCA 
AACGAATCCA 
TTGGCCTACA 
ATGTATGTAC 
AACAATCCTG 
ACACCTGCAT 
ACTGAGGTTC 
GATCGTGGAG 
TACATGGGCT 



AAGTCTTCTC 
TTCGTGTAGA 
ATGGTCGTTA 
TTTACGGTCT 
GAGGATCGGC 
AACCTTCTCA 
TGGATAACAA 
TGGTATTCGG 
AATTGGGTAA 
ACAGCAAATT 
TCGATTTGCC 
GAAACTTGAA 
CCGGACAGAT 
GCCACCCCGG 
GCGTGACCAA 
TCCTCCTCAT 
ACGTGATGCC 
TCTATCCCGA 
AAAATGACAG 
ATATGATTCT 
GCGCTACATA 
GGGTTGTAGG 
CTCATGCATT 
TCCTTGTGGA 
ATCAGCACGA 
GTCTCAATCT 
TGGCCAGCAA 
ACGGAGCTTT 
TGACGGATAC 
ATCCTGCTCA 
CCCACGCTAT 
AGATTACCGA 
TCTTCGATTT 
AACTCTATGT 
CTGCACGTGA 
TGGTAGTGAA 



GCAAGTCAAT 
G AACAACT GT 
TGCACAGATC 
GGAGCAGATC 
CTTGTACGGT 
CAATTCTTTC 
CACGAACTTC 
GCAGGCTCGT 
AATAGATGCC 
GACGGGAGAG 
TCCTCACGTA 
ATACGATCTC 
CGTAAATCGC 
TGGTACGGAA 
AGGCAAGACA 
GCCTTCGCAA 
CATCCTTTCA 
ATTGGATCAG 
ATGGAGCATC 
GAGTCCTCGT 
TGCAAAAGGG 
CGGTGAGGCA 
CAGTTTGAGT 
AGGCTTCTAT 
TGGCATCAAA 
CGAAGGTAAG 
CAAATACGAC 
TGTTACCGAG 
GCAGATCACC 
CTCATGGAAC 
CGAATATGGT 
CGAAACCGGA 
GGGCTTGAAA 
AGGTATGAAC 
CAGCGGATAT 
GTTC 



GCTTCTAACC 
CAGAACTGTG 
CTCATCGACA 
CCTGCCAATA 
TCTTCTGCTA 
ACATTCAATG 
AATGCCTCCA 
TACCGCAACC 
CGCTCGCTGG 
TTTCACACGA 
GTGGGTGTAG 
TTCTCTTCCA 
AAGAGCTATT 
GGCTACCCTA 
TATATGGGCG 
CTTTTGTTCG 
TGGCAGACCG 
AATATCAACA 
CTTGTTGGCG 
ACCACACTGC 
TTCCGCGCAC 
CAGAAAGTAT 
GCCGATATGT 
ACTCGTTTGC 
CGCTACACGC 
GTCGCATACA 
CAAGCACAGG 
GCCAATGCAA 
CGTACCCCCA 
ATAGCCCTTA 
GTGAAGTCTG 
AAGGCTCCCC 
GTGGGTTATG 
AATATCTTCA 
ATCTATGGTC 



TGGCTCAAGG 
GTTTCAATCA 
GCCGTCCCAT 
TGATCGAACG 
TTGCCGGAGT 
AATCTCTGAG 
TCGTCAGCGA 
ATTGGGATGC 
GAGCGCATTC 
TCAGTGAATT 
CTUAACAAAC 
ACTATAAACA 
ACGGAGGTAT 
TCCCTCAAGA 
GTATCCAGTA 
GAGCCGAATA 
GCGAGGATGC 
ACTACAGCCT 
CTCGCTTGGA 
GTTT CAACGT 
CGCAGGTATT 
TCAACGATCC 
ATCATCGTTT 
TGGATGTATT 
GTATCAACGG 
AGTCCTTCCA 
AGTGGGGTCT 
ATGGACAACA 
GCGTATACGG 
CGGGAGCATA 
CCGAACTGGA 
GTATTGATGA 
ACTTCCACGT 
ACTCTTTCCA 
CTACGCAGCC 



CTTGTCATTC 
AGTTCGTATC 
CATGAGTGCC 
TGTGGAGGTA 
GGTGAATATC 
CTTTACCGGT 
TGACAACCGT 
TAACAATGAC 
TTATTTGCGC 
CCGCCGTGGT 
TGACCATAGC 
CCACTTCCAG 
CG G A GAG ATT 
TCAATACGGC 
CAGCTACGAC 
TACGCGTGAT 
CAATGGGAAT 
ATTCGGTCAG 
CAAGCATAGC 
GAATCCGGAC 
CGATGAAGAC 
GAACCTCAAG 
CGGTAACGTC 
CACCAACGAG 
TAGCGGAGCC 
GOTCCAAGCC 
GAATACGGTG 
GGAATACAAG 
TTATTTTACT 
TACCGGTCAG 
TATTATGCAG 
GCTGAAGAAG 
ATTCCAGGCT 
GAAGGACTTC 
GCGTACAGGC 



4S0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1300 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
I960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
254 4 



40 



45 



50 



55 



60 



65 



70 



(2) INFORMATION FOR SEQ ID WO: 196 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
ID) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

liiil HYPOTHETICAL: HO 

(iv> ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE; 

(A) ORGANISE: PORY PHYROMONAS GINGIVALIS 

tlx) FEATURE: 

(A) NAME/ KEY : misc_feature 
(BJ LOCATION 1...606 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196 



ATGACAGTAA 
TCACCTTCTC 
AACAAGATAC 
TCGGCATTTC 
TATCGCTATC 
TACTCCAAAT 
CCTATCGAGC 
CGCAGCAACC 
ATGATGCACA 
TTCTCCCGTC 
AAAAGT 



AGCGCGCAGT 
TTGTTCGGGC 
TCTATGGAGA 
CTTTTATAGA 
GCGGT CCTTC 
TCGACATCAA 
GCGAGGATGT 
GTATCGGGCA 
GCCGCAATTC 
GCTTGGTGAG 



GCGAATAGCA 
GCAAAGTCTT 
GTCGGCGGCG 
CAAACTCATC 
CCCATCGCCG 
ACTCCCACGT 
TCGTCCGGGC 
TGTAGCTTTG 
GCGAGGGATC 
CTATGGCAGG 



CTTCTCACGC 
TTCAGCACCG 
GATACCGTCG 
AATCTCGGCC 
ATGGACTGCT 
GGTGCGGCAG 
GACCTCCTTT 
GTCGTATCTG 
GTGATCGAAA 
GTACCCGGAC 



TGATAGGCAT 
AACATGTCTT 
CAGAGAAAAC 
GC ACT TTCCT 
CGGGCTATGT 
CACAGAGCCA 
TTTTCAAAGG 
TCGATGAAGA 
AACTCAATCG 
CCAAGAGAGT 



TCTTTTTTCC 
GCAACTATAC 
GGCAGGTGAG 
CGGCAAACCA 
GTCTTACCTC 
ATATACGAAT 
CCGCAATGCA 
TGATATTACC 
CAGTGCATAC 
GATCCCACGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
606 



75 (2) IMFOR11ATION FOR SEQ ID NO: 197 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



Cji „JL ICL Ci^ hi sL.ij ni.li It", iL.fi'i] 
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ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1365 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRAIIDEDNESS: double 

(D) TOPOLOGY: circular 

tli) MOLECULE TYPE: DNA (genomic) 

10 tiiiJ HYPOTHETICAL: NO 

(iv) AHTI —SENSE : NO 

(vi) ORIGINAL SOURCE: 
15 (AJ ORGANISM: PORY PHYROHONAS GINGIVALIS 

(Ik) FEATURE: 

(AJ NAME/ KEY: misc_feature 
(B) LOCATION 1...1365 



20 



70 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197 



ATGAAACGGA CAATCCTCCT GACGGCACTG ACCGTCCTAT CTTCGCTCTC CTTGCTTCGT 60 

GCACAAAATG AATCCGAAGC ATCAACCAAT CCGATGTCAG GCCTCTCCCT C GAAGACTGT 120 

25 ATCCGGATAG CCAAGGAGCG CAACCTGAAT CTGCGCAGAC AGGAGATCGA ACAAGAAAAC 180 

CGAATCATTA GTCTCGATGC AGCAOGACAC AGTTTCCTGC CCTCGGTCAA T GC AGGCATC 24 0 

GGACACAACT ATAGCTTCGG ACGTTCGAAA GACAAAACGG GAGTAACCGT AGATCGCTCC 300 

TCGATGAATA CCAATCTCAG CATCGGAGCT TCGGTGGAAG TATTCAGCGG CACACGTCGT 360 

CTGCACGACC TCAAGCAGCA AAAGTACAAC GTGGAGGATG GTATAGCCCG ACTTCAAAAA 420 

OU GCGCGTGAAG ACCTCAGCCT GCAAATCGCG GCTCTCTATA TCAATTTGCT CTTCCGTCAG 4 60 

GAAATGACTC GTACGGCAGA AACACAGTTG GCACTGATTC GCGAGCAACG CAATCGCACG 54 0 

GCCGAAATGG TTCGCGTAGG TAAAT GGGCA GAGGGTAACC TCCTCGACAT AAATGCCCAG 600 

ATGGCCAAGG ACGAACAACT TCTCGTACAA TATCGTTCGG AGGAGGAGCT GGCTCGTCTG 660 

GACTTGGGGC AAGCCCTCGA ACTGGAGCAC CCCGAAAGCA TTGCAGTCAA GGCTCCCGAC 720 

JD ACAGACGTTC TCGTAGCAGA AAGGTTGGGA TCTCTCCTTG CTCCCGAAGA GATCTATCGC 780 

ACGGCTCTCG GCTTGAAACC GGCACTGCAT TCGAGCGAGC TGCAAATAGC TTCGGCACGC 840 

GAAGGTCTGG CCTCGGCTCG TGCGGCATAC TTCCCGACGC TCAGCCTCTC TG C CG GAT AC 900 

AGCAACGGTT ACTTCCG CG A CCTCGGCAAG GAGTATGCCG CCATCAACCC CTCCTTCTCC 960 

GAACAGTGGA AGAACAACGG CAGCTACAGT ATCGGACTCT CTTTGAATAT CCCCATCTTC 1020 

40 TCTGCCATGC AAACGCAAGA TCGCGTTCGG AGCAGTCGCC TGCAAATACG CTCAAGCGAG 1080 

CTTCGACTCC TCGAAGAGAA AAAAGCCCTC TATAAAGAGA TCAGGCAAGC ATACAGCAAT 114 0 

GCCGTGGCAG CCGATAAGGC CATCGCAGCA GCCGAAAACA GCAAGGCCGC TACGCTCAAG 1200 

GCATACGAAT ACGCTCGCGA CAGCTTCGAG GCAGGGCGCT TGTCTGCCTA CGAATATGCC 12 60 

GAGGCAAAAA CAAAATACGC CCTCAGCCAA GTGGAAGAAC TTCGTGCCAA GTATGACTTC 1320 

45 ATATACAAAG CCAAAGTTTT GGATTTCTAT CAGGGCAAAG ACTTC 136b 

(?) INFORMATION FOR SEQ ID NO: 198 

50 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1332 base pairs 

(B) TYPE : nucleic acid 
{C> STRAhJDEDNESS: double 
(D) TOPOLOGY: circular 

DO 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

60 |iv) AHTI -SENSE: NO 

(vl> ORIGINAL SOURCE: 

(A) ORGANISM: POKYPHYROHONAS GINGIVALIS 

65 ( ix ) E*EATURE : 

(A) NAME/ KEY: mlsc_featurc 

(B) LOCATION 1...1332 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:198 



ATGCGTTTCC AAC ATT AT CT CAT CT GTACG GCTGCCGTAG CGGCTTTGGC TGCGAATCCC 60 

CTTACGGGCC AATCGAATAT GACCCTCGAA GAGTGCATAG ACTATGCACG CCGGCACAGT 120 

TCGGCCGTGG CGCTGTCCGC TGCGGAACTG GAGCAGTCCA AGGCCGATTA CCTTCAGGCC 180 

GTCGGCAATT TTCTGCCCCG TGTATCGCCC GGAACCGGTG CTTCGTGGAA TTTCGGACGC 240 

/O GGATTGGATG CCGAGACGAA TACCTACACC GACATCAACA GCTTCAACAA TTCGTACAGC 300 
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3 GO 
420 
480 
540 
600 
660 
720 
780 



ATACATGCCA CG AT GACCCT TTTCGACGGT TTGCAGAGTG TCTATCGACT GCGGATGG03 
£2££^ SSSST^ GCSCC **«= GTTCGCGAGC aSaGGAgS „ 
CA^A^n^ SS^ST*™ CGAOTCGTC TATGCGCGCC AAATGCAAGA GCTGGCCATG 
Si^SI — AGGAGAGCAG CCGCCTCCAC CGGCAGACGG CTCGAATGGA AGAGCTGGGG 
ATGAAGAGTC GTCCCGATGT CCTCGAGATG CAGTCGCGAA TGGCCGGTGA CCGTTTGGCC 
£££££ ^r™S GT GCATCATC CCTCTGATCC GGCTCAAAGA AAlJSKS 

I^S^SIfS ATGACGAACT ^tcgtagac gatatgccgg ctgacagtct ctccgccgac 

ATGGCCGAAT CGGACAGCTC GGCCGGCGTC TTCGCCCGTG CTGCCCATCA TCATCCCGTC 780 

^™ ££££££ ^2?™ G <^GACC GTTTGCGAGC S£££SS «!o 

>^™2^ CGAGTGTGTC GGTATCCGGA GGATGGAACA CGGGATTCTC ACGCTTTTTG 900 

A^S^ ™iS5S CTTCAGCGAG CAGTTTCGGA ACCGTCGGGG ._ I™ 

Gtt^Scr CATCTTTTCG GGATTCAGCC TTGTGAGCCA TCTGCGTCAG 1O20 

SSSSSSS AACGCAGGGC GGCAATCGTC CGACGGGGCG AAGCGGAGCG CAGGCTCTAC 1080 

AGCGAGATCG CCCAAGCCAT GGCCGACCGG GATGCCGCTC TGGCTTCCTA CCGCCAGGCG ll!o 

£cSta GCAAACCGCT TACGAAGCCG TCTTGCAGCG T ™ J^O 

S^S?™ v CCTGACCACT CAGGCCAATC GGCTCCTGGA TGCCCGTGTG 1260 

S^SSSS a2 ^ GACCTACCGG AACTCATAGC CTATTACGGC 1320 



1332 



(2) INFORltATION FOR SEQ ID NO: 199 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2820 base pairs 

(B) TfPE: nucleic acid 

(C) ST HANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii> MOLECULE TYPE: DNA (genomic} 
Uii> HYPOTHETICAL: MO 



(iv) ANTI- SENSE: NO 



(vil ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHOMAS GINGIVALIS 

( ix J FEATURE : 

(A) NAME/KEY: misc feature 
<B> LOCATION 1-..2820 

(Xi) SEQUENCE DESCRIPTION: SEQ ID HO: 199 



ATGAACAAAT 
ACTGCACTGA 
AGATCCGCCG 
AATCCGGAAG 
CGGATAGGAC 
TCCATCGGAG 
ACGCTTTATT 
GACCATGAAA 
GCCACAGAGC 
ACTTTGCCTG 
CCCGTAACGG 
AACATCAATT 
ATCATGGTAA 
GGAGACTTTA 
GGTGTAACGC 
TGCAGCAATG 
GCTTTCCTCC 
CCTCTGCGCT 
GGTGCCGGTA 
CCGGCTCTGA 
TTCAAATACG 
AATAAGCACG 
TACGGCAGAC 
GACATCTATC 
GACGGTTATA 
GTCGAATTGA 
TACCACATAT 
GCCATCGACG 
ATTTATCCCT 
GACCTTGCCA 
GGAGGAGTAT 
GAAAGCCCCA 



TTTACAAATC 
CCGCTTCTGC 
GAACGCATTC 
ACCTGATCGC 
AAGTAATACC 
ACGTAGATGT 
ACGATGCATT 
TTGTGTTGGG 
CGGTACCGGG 
ACATCAAGAT 
ATAACCATTA 
GTCCTGAAGG 
AAGGACAGTA 
CTCCGCTGAT 
AATCCGAGTT 
GTACATTGGC 
CGATCAAAGG 
ATCGTGTCTA 
TTCATCATCC 
ATACATGGAT 
ATCAAGGTGG 
TGGTCGGCAC 
TGAACAGTCA 
TGGATCCCCA 
AGCCTTTGCC 
AT TGGACGGC 
TCCGAAATGG 
AAAGTATTAT 
CGCCGTTGGA 
TAGGAGACAT 
CATTAAGCTG 
ATCCTGTGTT 



ACTTTTGCAG 
ACACATTTCG 
ATTCGACGAT 
ACAGAGCCGT 
GGTGGATGTG 
ATATCGCCTG 
CAATATTCCG 
AGCATATACG 
GAGTGAGCTT 
CTCCGGTGCG 
CGGGATCGGT 
TGCAGACTGG 
TATCTCAATG 
CATTTCTGCC 
GGATAAGTGG 
CATCTTCCGT 
TAAATCCGAT 
TTACAATGGA 
GGCCGGAGAT 
CTCCTCCAGT 
TACGGAAGGA 
ACTGACCGGA 
TTGGAACGAG 
AAACAATGGC 
CTCTGTGCCC 
TGTTCCTGCC 
AAAGGAAATA 
CGGTAGCGGT 
TGGAGTGGAA 
TCAGACCAAG 
GAAAGTTCCT 
CAAAACCTTT 



TCAGGACTGG 
TTCGGAGGGG 
GCAATGACTA 
TGGCAATCGC 
GACTTTGCAT 
CAATTCAAGT 
GAGGGC-3GAC 
AACGCCACTC 
ATTATGGATT 
GGTTATATAT 
GAGGACGATT 
CAGGCAGAGA 
TGCTCAGGCA 
GGACACTGTG 
ATCTTCACTT 
CGCAACAGTA 
GGTCTCTTGC 
TGGGACAGTA 
GCCATGAAGA 
GGTTCCGGAG 
GGATCGTCCG 
GGTGCCGGCA 
TAT GCATCCG 
CAGACGACCA 
CGGCTATTGT 
GATCAATATC 
GCTACGACAA 
ATCAT TCG AT 
TCTTATAAGG 
CTGAAGCCGG 
TTCTTAAGCC 
GAAGTGCCCT 



CTGCCTTCGT 
AACCCTTGAG 
TCCGCCTTAC 
AAAGAGATGG 
CCAAGGCTTC 
TGGAAGGAGC 
GCCTCTATAT 
ATCGCCGCAA 
ATGAAGTGTC 
TCGACAAAGT 
CCGATTCGGA 
AGAACGGTGT 
ACCTGCTCAA 
CTTCCATAAC 
TCCACTATGA 
TCATCGGAGC 
TGCAACTCAA 
CGCCCGATAT 
TTTCCATCCT 
GGACTGACGA 
GTTCTTCTCT 
ATTGTGGCGG 
ATGGCAATAC 
TCCTCAACGG 
TGCAGTCTAC 
CATCATCTTA 
AGGAGTTGTC 
ACGAAGTAAG 
ATACGGACAA 
ACGTAACACC 
AGTTGGTTTC 
ATGTTTCTGC 



GTCGATGGCA 
TTTCTCTTCA 
TCCGGATTTC 
CCGGCCCGTC 
GCACATCTCT 
CAAAGCCATT 
CTATACCCCC 
CGGAGCTTTT 
TCGCGGAGGG 
CGG CGGACGC 
TTGCGAGATC 
GGTGCAAATG 
TAATACGAAA 
AACCAATTTC 
AAAAAGACGA 
TTCCATGAAG 
CGATGAAGTC 
TCCCTCGAGC 
AAAGAAGACT 
TCACTTCTAT 
CTTCAATCAG 
GACGGAGTTC 
GAGCCGCATG 
AACGTATCGT 
AGGCGATCAG 
TCAGGTCGAA 
CTATTCGGAT 
CGCACGCTTC 
GACTTCTGCC 
TCTCCCCGGA 
CCGATTCGGA 
CGCAGCCGCA 



60 
120 
100 
240 
300 
360 
42C 
480 
540 
600 
660 
720 
780 
840 
500 
960 
1020 
10B0 
1140 
1200 
1260 
132C 
1380 
1440 
ISOO 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
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CAAACCCCCA ATCCTCCCGT TGGCGTAGTC 
CCCGAAAAGG CTGCTATCGC T GCCGTTT AT 
CACCTCTTCC TCAAGAGCAA CACAAACAGA 
TGGCAGGCCG GAACATGGTT GAGGATCAAT 
5 CAT AT GCT TT TTGCCGGTAT CAGAATGCCT 
TATGTAAGAA ATCCGGATAA CCTTTTCTCC 
GGAGTCTCTT TCGAAGGCTA CGGAATACCC 
CTGCTGGTAA AT AC C GAT GC TCCGAAGATC 
AAGGGAACGA ATGTGGCTCC ATTCCCCGAA 

10 ACATTTATCG GCACACAGGA TCCATCCGTC 
AGCGATGAAT ACGAAATAAA ACTGGTATAT 
CAGATTGAGA ATAACAATGC TGTCGTTGCA 
ATTAAGAACG CTCATATGGT TCACGCTGCC 
CGTTCTTGGA ACAACCTCCG CAATGGCGTG 

15 ACTTATATGC TCGTTATGCA GACGGCAAAC 



ATTGCAGACA AGTTTATGGC CGGTACATAT 19BO 

GTAATGCCAT CCGCTCCGGA CTCTACTTTC 204 0 

AGATTGCAGA AGGTGACAAC TCCCTCCGAT 2100 

TTGGATAAGC CGTTCCCGGT GAATAATGAC 2160 

AATAAGTACA AGCTCAATCG TGCTATCCGT 2220 

ATTACCGGTA AGAAGATTTC ATATAACAAC 2280 

TCGCTCTTGG GCTATATGGC TAT CAAAT AT 234 0 

GATATGTCGC TTGTACAGGA GCCTTATCCT 24 00 

TTGGTCCGCA TAT AT GTCT A TAAGAACGGA 24 60 

AC AACT T ATT CGGTTTCAGA CGGAACAGAG 2520 

AAGGGATCGG GCATTTCGAA TGGCGTTGCT 2580 

TATCCGTCTG TTGTAACAGA TCGTTTCAGC 264 0 

GCCCTCTACT CATTGGATGG GAAGCAGGTT 2700 

ACATTCAGTG TTCAAGGACT TACGGCCGGT 2760 

GGCCCTGTGA GCCAAAAGAT CGTGAAGCAG 282 0 



(2} INFORMATION FOR SEQ ID NO: 2 00 

20 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2010 base pairs 
IB) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY; circular 

25 

<il) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
30 (iv) ANTr-SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOHAS GINGIVALIS 

35 (i x > FEATURE: 

{A» NAME/ KEY: mlscjeature 

(B) LOCATION 1 2010 



40 



60 



65 



70 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 200 



ATGAAATATC TTATCAGACT CTTCTTATCA TTGATGTTAC TCTCTCTCTG GACGGGCTGT 60 

ACACACGAGG AGCTCTCTAT TTGCGATGGC GAGAATACGC TTGTTTTACG CGTAGAGACC 120 

GGTAAAGCCC CAAATGCTCG TGCCACAGAA CCCGGTCAGG GCATATACAA TGAGAATAAA 180 

GTACGCTCCA TTTCTGTGCT CTTCTATTTA GAGGGACAAC TTCGTTGGCA GGTGAAGTCT 24 0 

ACAGACTATC AAATCCATGA AGGGGCCTAT ATCATTCCGG TCAAAGAGCA AATGCGACCA 300 

CTATTCAATG GCAACAACAA CTTCAGCATC TATGTAGTGG CCAATCT CGA TTTCAATGCT 360 

CCGGCCACAG AAGCTGCGCT TTCTCAATTT GTGGTAGAGA AATCTATTGA AGTCTCTTCT 420 

ACGACAGCCC CTGCCGATTT CGTAATGCTT GCTCATGGCA ATAAGCAGAT CAATATGGCT 480 

ACGACAGAAG GGAAACTGTT GGGGGATTAT AAACTCAAAC GAGTGGCAGC AAAGATTCGC 54 0 

OU ATGATAAAAC CCACCATCAA TGTGCAAGGA TATGAAGTGG TCGGAAATAT ACAGGCAAAG 600 

TTTCGCAATT CGGTAACCAA GGGGTTCCTT ACCACAGAAC CTCAAGAGAT CCCAGCTGCT 660 

GCATCCTATA AGACATCGGA ATATCTTGAT ATTGCAGAGT CGGCACCTGC CAATTCTATC 720 

CATTTCTATT CTTACTATAA CAAATGGACA CTCTCCACAC CGGAGAAGCG ACCGGAATTC 780 

TT CAT CAT GG TCAAATTCAA AAAGACAGGA CAGCCGGACA ACACAGCCAA ACCGTACTAC 84 O 

TACAGAGTGC CCCTCGAATC TCAGGACAAT CAGGTCAAGA GCAATGTCCT CTATAATCTG 900 

AATGTGAAAA TCGAAATCTT GGGTTCTTTA CAAGAGCCGG AAGCTGTTTC TGTAAACGGC 960 

ACACTCGCAA TAGAAGAATG GATTCTCCAT CAGGATGCAT TCAATCTGCC TGCCACCAAT 1020 

TACT TGATAG TGGAACAGCA CGAAATCTTC ATGAATAACG TGAACACATA CTCGGTGAAA 1080 

TATCAAACTT CGCAGAAACC AATCAGCATT AGCATACAGT CAGTTACCTT TAGCTACGTC 1140 

TCTTCTGATG GCACTCAGCA CAATGATCTT GTAGCAAGTA GTAGCGACCA GTATCCf ACG 1200 

ATTACAAGCG ATAATACAAG CATCATAATC ACTTCCAAGA TACCGGTTAA TAACGTACCA 1260 

AAGAAGATCG TTTTTGAGGT AACTAATGGG GTAGCCGGTT TGAAAGAGAC TGTCACAGTA 1320 

CTCCAATATC CTGCACAATT TATTGTCAAT ACACTTGGCA CAGCATCGGC ATGGAGACCA 1 3B0 

GACGGATCTT TGGCTCCGGG GCTTAACAAT AAAGCGATTT ACCATGT CGT AGTACTGGTT 14 4 0 

CCACCCGAGA ATT TATTTGA AGATGGGACA CAGACAATCA TCGGTTATCC CCCCACTGAA 1500 

ACAATTTCTT TTCATAAGAA AGAGAACAAT ACCTATCCGA TAGTATGGTC TGACACAAAT 156C 

ACGACAAAAC AGGACCTTGA GACATCAAGA ATGATTTCAC CTTCCTTTGA GTTAGCCTCC 1620 

CAACTTGGGG CTACTCTCCC GATGCCCTAT CTCGAGTATT GGCCAGGGAC ATCATATCTC 1680 

CTTGACTATT CGGGAAACTA TAATAATAAG AGATACGCCT TGTTTAATTG CGCTTTTTAC 17 4 0 

TGGGAGAAAA GAAAAGTTAA TAACGAAGAA ATTAAATTCG ATGACTGGCG TTT GCCGACA 1800 

GAAGCTGAGA TCAAATTGAT AGATAAGCTG CAACATAATG AGCAGAGTGC TGTCCAAGCT I860 

ATCATGACAG GGAATTATTA TTGGGATAGT TACTCTGCAA ATGGGTCTTA TAAAATGCAA 1920 

GGAGGAGGGG GCCAAGGAAA TTCCTCCAAA GCCTATGTTC GTTGCGTGCG GGATGTGAAA 1980 

AAGCCGATTC GTGACAACAA GTCAGGTAAG 203 0 
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[2> INFORMATION FOR SEQ ID HO: 201 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3846 base pairs 
(P) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D> TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

Uii> HYPOTHETICAL i NO 

Uv) AUTI -SENSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : PORYPHYROHONAS G1NGIVALIS 

Ux) FEATURE: 

(A) NAME/KEY: misc feature 
1B» LOCATION 1. . .384 6 

lJCi> SEQUENCE DESCRIPTION: SEQ ID NO: 2 01 



ATGCGAAAAA 
ACTTGGCATG 
AGTTGTACCC 
GAAGGCAACC 
GGCGAAGCTG 
AACGTTGCTG 
CCTACCTATG 
ATAAACAATG 
CAAGACGGGT 
AACCCTGTCC 
AATCCTCAGC 
TTTATTAATT 
GGTACAACAA 
TTGGTTATTT 
CTGTGCGAAC 
GTATTGAATA 
TTCATTCGCT 
TACGTGCTAC 
GTCCCAACCT 
ATATGCGCCT 
TTTATCGGTC 
ATCAACAAAG 
AAAGGCTGCG 
GTCAACTCTA 
AATAATGGTT 
GGGCAGGGAC 
TTATGTACGA 
GAAGTCCTGA 
GCCACCCAAT 
TTACCTTATT 
ATTATCAATA 
CCTGCACTAA 
AACACCATTA 
AAAGGAGTTT 
GAAATAGGCA 
GTTTACGGCG 
AACTT GTTTT 
AGTGCCCCGC 
CGAAGTGACA 
TCGGGGATAA 
GCAAAGATTA 
ACGCCAGGCT 
TACATTTCCG 
AACAGAACGA 
AGCACGCTTT 
GGAGTACGTC 
GAAGATAAGC 
TTCCCTCTCA 
AACGACACGT 
TTGTTTGATC 
GATGGATTGC 



TTTTGAGCTT 
GAGATCCGGA 
GAAT CACGT T 
AAGTCTTTCA 
AATTGCCTGT 
TAAAAATCAA 
TCGTAGAGGA 
ACTATTATAG 
ATT TTCGCT C 
GACAAGAAAT 
CACCTTTACA 
ATGAAGCTGA 
CGTACATAAG 
ACGAT GAT AT 
ATAGAGCCTT 
GCTTCCCATC 
CAGTTTACAA 
TGAT CGG AAA 
CTTTTATTCA 
CCGACTATTT 
GATTTAGCGT 
AAATCTCATA 
ATGCTCCAAT 
TCTTAAAATC 
CCCATCATTT 
TCGACGTCAA 
GTCTATCATG 
CCACATACGC 
ATGCCG TTT A 
CTTTATATCA 
CTAATTCTGT 
ACATTATGGC 
TAAGCAGTCC 
TGCATTTTAC 
ATCAGGCTAA 
ATGGTCTTGC 
CTACGCATTC 
TGTATACAAC 
TTATTTCAAA 
CGGTATTCAA 
CCGACAATAC 
TAACAGCAAC 
GTAATAAAAT 
ACAGACTCCA 
ATAATTCCTA 
TCCTCAACAA 
AGACGTTTAT 
ACTTCCATTA 
ATACGAATCG 
CGAATCAGGT 
CAAATGGGAG 



TTTGATGATG 
CTCAGTGGCA 
CGAGGTTGTT 
GCUCATTTCC 
TTTGAAAAAG 
AGAGACGGAG 
GTTGCCTGAG 
CCAAAATGTA 
ACAAAGATTT 
TCTATTTGCA 
AAAAAACACC 
TGGCAAATCG 
CGGAAATG'l-r 
GTTCAACGTA 
CTACAACGGC 
AAATGCCACC 
CCAAAGCAAT 
ACCATTGAGC 
TAATGTCTCC 
TTTTAGTTGT 
CACCAATGCT 
TAATCCTATT 
CTTACGTTTA 
TAATCAGGTC 
TTATTTTAAC 
TACTCTAACA 
TAGT TCGGCT 
ACCTAACAAG 
TTTAGAAGGC 
CAATCTCTCG 
TGATACGTAT 
TCATGGCATG 
GATAACAATA 
TAATAATGGC 
AATATCCGGA 
GATTAACAAG 
GGTCATGCCC 
GAACTGTATT 
GAATTGTGAC 
GCCTATGGCT 
TT1TTTTGCG 
CTCCAATGCT 
AGTCAATTGC 
CAATATCACA 
TGGTATTTAC 
CAGTTGTTTT 
TTCTAATAGG 
CAACAGCTTG 
CTATATTGAC 
TTTCAATACG 
ATCGGGCAAT 



TGCTCTCTGC 
GCCCTACCTT 
TTCCCCGGAT 
ATGCCGGGTT 
ATGATAGCCG 
ACATTCGACA 
GGGGGGACTT 
AGCCTCCCTT 
ATCGAAGTTA 
AAAAAAATCG 
GGCATATTTA 
GCGATAGAAA 
GCCAGCAACC 
AATCAACAAC 
TTTGATGTAG 
TCATACATCA 
GCGAAGAGGA 
AAATATTTGG 
TTAATTCCAA 
GTTTCGCCCC 
CATGAATTGC 
GCACACAAAA 
TTCTTAAAAG 
TCTGCAATAG 
ACTCATGGAA 
GCCCGATTGA 
GTAGCAGATT 
GGATTCTCGG 
CCCTGTCCTC 
ACTGTTGTTG 
TCGAAATTCA 
GAGGTTAGTA 
AAAAATGGTG 
TCCATACAAG 
GAG ACCGGT G 
CAGGTAGAGA 
AAATTTCATT 
GTGGAGATAA 
CTAAGCGTTG 
ACAAGCTCCA 
ACAGGAAACT 
GCCATCAAGT 
GATGAGGCTC 
CGGAATGTGA 
AACCGAAATA 
TATTTCGATA 
ACTTGGCAGC 
CAGGGGGGAG 
GTTTCAAATA 
CCAGACTTGT 
AGCTCTGCTG 



ATTTAGGTCT 
CTATCGGTAT 
TTTATAGTGT 
GTGGCTCGTT 
TTCCGGAATT 
ATTATAATAT 
ATCTGGTAGA 
CTACTCACTA 
CCCTGTATCC 
AGGT TACAAT 
ACAAAGTAGC 
ATGATATGGT 
TCCCTCAGAA 
CACACGACGA 
CTGCTGTAAG 
ACGAAACTAA 
CTTTAGATGG 
CTGACACTGA 
GTCATCCAAC 
TTGATACTGT 
ACAATCTGAT 
ATATTCTTTA 
AAATCGCCTC 
ACTCGATATT 
TGCCGACTGT 
ACAATACATC 
CAACTATTAG 
CTTTCTTAGG 
CGTCAGAATT 
GCGAAATGTT 
ACTTCAATTT 
ATTGTATTAC 
GCTGCCTAAA 
TCATGTCCCG 
CTAACCCCAC 
TAGACAATAT 
TTGACAGTGT 
GCAATTGCGA 
AAAACAGTAT 
TCACCGGATT 
TCGCCTACCA 
TAGACAATAT 
TTGTACTAAA 
TAAAAAACTG 
AGATCAGTAA 
ATGCTCCTGT 
TCTATTCATC 
ATACAGATAC 
AT CACTGGGG 
TCATTTGGAT 
AAGCAGTAGA 



ACAATCTCAG 
TCAAGAGTCA 
GGAAAAACGA 
TGGGAATCTG 
TTCAACAGCT 
CTATCCTAAT 
GGCTTTCGCG 
TGTCTATTCT 
TTTTCGATAC 
AACTTTCGAT 
CTCCTCTGCA 
GTTCAGTCGT 
CTGTGACTAC 
AATCAAACGG 
TATAAAGGAC 
ACTGAAAAAT 
CAAACTGGGA 
TAATACAAAA 
TTTTGGTTCC 
CGGCGATTTG 
TGAAAAGACT 
CGCAGAAGGG 
TGGTTACACA 
TGACTGCTTG 
TTGGGGGATA 
TTCGCAGGGA 
ATCGCTTGGA 
AGGAAGCAGA 
TTATGAATAT 
GCTATCATCC 
GCTTGGCGAC 
ACTACCAAAC 
AATACCGGAA 
AGGAACTCTG 
CTTTATTACC 
AGACCGACTT 
GAAATTCAAC 
ATTTACCAAT 
GTTTAGCAGT 
ATCTACAAAA 
TATCACAAAC 
TCCTGAGTAT 
TAATAGTGGC 
TAGGATTGGG 
CAATCATATA 
AATCAATGAA 
AAACGGTACA 
ATGGATTTAC 
CAACAATGAT 
ACCTTTTTGG 
ATTCCAAACA 



60 
120 
180 
240 
300 
360 
420 
4 BO 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2200 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
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GCATTGGACT 
GTTGAAACCT 
GAGAAAATGT 
ATCATCTCTT 
GTGTGTGAAA 
TCCTATCAGG 
TTAGACTCAC 
TCGCTCGAAA 
GGTACTCTCC 
TCTATTTCGC 
TCCTGTTCTG 
CCCAAACATC 
GGTTTCTACC 
ATCAAA 



GTATTGGCAA 
ACCCGGAATC 
CAGGCAACGA 
CCCAGAACTT 
ACTATCAGTC 
ACAGTGTTTT 
TCAGAGGGAC 
GCCATCAAAA 
TGCCTCCATT 
CCAATCCGGC 
TAATAAAAAT 
TATCCGAAGG 
TGGTAACGCT 



TAG CG ATT AT 
CGACTTTGCA 
TTACGAAGGC 
GTTCCCGACA 
TGCCATCGAT 
TGCAGTCATT 
TGGTATAGAT 
TGTAAAAAAT 
AGAATGCAAC 
GAAAGCTGTT 
ATATGGAATA 
TTATTACAGC 
AAATGTTGAT 



CTTTCGGCAA 
ATAGCTGCTT 
TTGAAAGATT 
GCTGATTTCC 
TGGTACGAAA 
GACCTTGGTG 
TTGAACATAC 
TATTTGTTGT 
AAATCAAGCC 
GTAACAATAA 
AATGGAGCCT 
ATACAGTTCA 
CAGAAAATTA 



AAGTGGCTCT 
TGAAGGAATT 
ATTTCAGATC 
TGTCTGCGCG 
ATCGCTTGAA 
ACATTTATTG 
TTTCCTGTGA 
CAACTCTTCC 
TTGATAAATC 
TCTACTATAC 
CGGCTGATAT 
ATACATCCAA 
TAGATACGGA 



CAAGATGATG 

gttcaggata 
caatccaacc 
atgcgatatt 
tagtgaaatc 
gaatatgcag 
acaaaggaaa 
cgaatcaaca 
caagataatc 
cgataaccct 
aaccgggttg 

CTTTGATCCC 
AAAATTACGA 



3120 
3180 
3240 
330O 
3360 
3420 
3480 
354 0 
3600 
3660 
3720 
3780 
3840 
3846 



20 



25 



30 



(2) INFORMATION FOR SEQ ID HO: 202 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 3822 base pairs 
(8) TYPE: nucleic acjd 

(C) STRAW DEDNESS: double 

(D) TOPOLOGY: circular 

<ii> MOLECULE TYPE: DltA (genomic) 

<iii» HYPOTHETICAL: HO 

Uv) AHTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(!>:) FEATURE: 

(A) WANE/KEY: miscjeature 
{B> LOCATION 1...3022 

(Xi) SEQUENCE DESCRIPTION: SEQ ID MO: 202 



ATGATGTGCT 
GTGGCAGCCC 
GTTGTTTTCC 
ATTTCCATGC 
AAAAAGATGA 
ACGGAGACAT 
CCTGAGGGGG 
AATGTAAGCC 
AGATTTATCG 
TTTGCAAAAA 
AACACCGGCA 
AAATCGGCGA 
AATGTTGCCA 
AACGTAAATC 
AACGGCTTTG 
GCCACCTCAT 
AGCAATGCGA 
TTGAGCAAAT 
GTCTCCTTAA 
AGTTGTGTTT 
AATGCTCATG 
CCTATTGCAC 
CGTTTATTCT 
CAGGTCTCTG 
TTTAACATC 
CTAACAGCCC 
TCGGCTGTAG 
AACAAGGGAT 
GAAGGCCCCT 
CTCTCGACTG 
ACGTATTCGA 
GGCATGGAGG 
ACAATAAAAA 
AATGGCTCCA 
TCCGGAGAGA 
AACAAGCAGG 



CTCTGCATTT 
TACCTTCTAT 
CCGGATTTTA 
CGGGTTGTGG 
TAGCCGTTCC 
TCGACAATTA 
GGACTTATCT 
TCCCTTCTAC 
AAGTTACCCT 
AAATCGAGGT 
TATTTAACAA 
TAGAAAATGA 
GCAACCTCCC 
AACAACCACA 
ATGTAGCTGC 
ACATCAACGA 
AGAGGACTTT 
ATTTGGCTGA 
TTCCAAGTCA 
CGCCCCTTGA 
AATTGCACAA 
ACAAAAATAT 
TAAAAGAAAT 
CAATAGACTC 
ATGGAATGCC 
GATTGAACAA 
CAGATTCAAC 
TCTCGGCTTT 
GTCCTCCGTC 
TTGTTGGCGA 
AATTCAACTT 
TTAGTAATTG 
ATGGTGGCTG 
TACAAGTCAT 
CCGGTGCTAA 
TAGAGATAGA 



AGGTCTACAA 
CGGTATTCAA 
TAGTGTGGAA 
CTCGTTTGGG 
GGAATTTTCA 
TAATATCTAT 
GGTAGAGGCT 
TCACTATGTC 
GTATCCTTTT 
TACAATAACT 
AGTAGCCTCC 
TATGGTGTTC 
TCAGAACTGT 
CGACGAAATC 
TGTAAGTATA 
AACTAAACTG 
AGATGGCAAA 
CACTGATAAT 
TCCAACTTTT 
TACTGTCGGC 
TCTGATTGAA 
TCTTTACGCA 
CGCCTCTGGT 
GATATTTGAC 
GACTGTTTGG 
TACATCTTCG 
TATTAGATCG 
CTTAGGAGGA 
AGAATTTTAT 
AATGTTGCTA 
CAATTTGCTT 
TATTACACTA 
CCTAAAAATA 
GTCCGGAGGA 
CCCCACCTTT 
CAATATAGAC 



TCTCAGACTT 
GAGTCAAGTT 
AAACGAGAAG 
AATCTGGGCG 
ACAGCTAACG 
CCTAATCCTA 
TTCGCGATAA 
TATTCTCAAG 
CGATACAACC 
TTCGATAATC 
TCTGCATTTA 
AGTCGTGGTA 
GACTACTTGG 
AAACGGCTGT 
AAGGACGTAT 
AAAAATTTCA 
CTGGGATACG 
ACAAAAGTCC 
GGTTCCATAT 
GATTTGTTTA 
AAGACTATCA 
GAAGGGAAAG 
TACACAGTCA 
TGCTTGAATA 
GGGATAGGGC 
CAGGGATTAT 
CTTGGAGAAG 
AGCAGAGCCA 
GAATATTTAC 
TCATCCATTA 
GGCGACCCTG 
CCAAACAACA 
CCGGAAAAAG 
ACTCTGGAAA 
ATTACCGTTT 
CGACTTAACT 



GGCATGGAGA 
GTACCCGAAT 
GCAACCAAGT 
AAGCTGAATT 
TTGCTGTAAA 
CCTATGTCGT 
ACAATGACTA 
ACGGGTATTT 
CTGTCCGACA 
CTCAGCCACC 
TTAATTATGA 
CAACAACGTA 
TTATTTACGA 
GCGAACATAG 
TGAATAGCTT 
TTCGCTCAGT 
TGCTACTGAT 
CAACCTCTTT 
GCGCCTCCGA 
TCGGTCGATT 
ACAAAGAAAT 
GCTGCGATGC 
ACT CP AT CT T 
ATGGTTCCCA 
AGGGACTCGA 
GTACGAGTCT 
TCCTGACCAC 
CCCAAT AT GC 
CTTATTCTTT 
TCAAT ACT AA 
CACTAAACAT 
CCATTATAAG 
GAGTTTTGCA 
TAGGCAATCA 
ACGGCGATGG 
TGTTTTCTAC 



TCCGGACTCA 
CACGTTCGAG 
CTTTCAGCGC 
GCCTGTTTTG 
AATCAAAGAG 
AGAGGAGTTG 
TTATAGCCAA 
TCGCTCACAA 
AGAAATTCTA 
TTTACAAAAA 
AGCTGATGGC 
CATAAGCGGA 
TGATATGTTC 
AGCCTTCTAC 
CCCATCAAAT 
TTACAACCAA 
CGGAAAACCA 
TATTCATAAT 
CTATTTTTTT 
TAGCGTCACC 
CTCATATAAT 
TCCAATCTTA 
AAAATCTAAT 
TCATTTTTAT 
CGTCAATACT 
ATCATGTAGT 
ATACGCACCT 
CGTTT ATT TA 
ATATCACAAT 
TTCTGTTGAT 
TATGGCTCAT 
CAGTCCGATA 
TTTTACTAAT 
GGCTAAAATA 
TCTTGCGATT 
GCATTCGGTC 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1O20 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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ATGCCCAAAT 
TGTATrGTGG 
TGTGACCTAA 
ATGGCTACAA 
TTTGCGACAG 
AATGCTGCCA 
AATTGCGATG 
ATCACACGGA 
ATTTACAACC 
TGTTTTTATT 
AATAGGACTT 
AGCTTGCAGG 
ATTGACGTTT 
AATACGCCAG 
GGCAATAGCT 
GATTATCTTT 
TTTGCAATAG 
GAAGGCTTGA 
CCGACAGCTG 
ATCGATTGGT 
GTCATTGACC 
ATAGATTTGA 
AAAAATTATT 
TGCAACAAAT 
GCTGTTGTAA 
GGAATAAATG 
TACAGCATAC 
GTTGATCAGA 



TTCATTTTGA 
AGATAAGCAA 
GCGTTGAAAA 
GCTCCATCAC 
GAAACTTCGC 
TCAAGTTAGA 
AGGCTCTTGT 
ATGTGATAAA 
GAAATAAGAT 
TCGATAATGC 
GGCAGCTCTA 
GGGGAGATAC 
CAAATAATCA 
ACTTGTTCAT 
CTGCTGAAGC 
CGGCAAAAGT 
CTGCTTTGAA 
AAGATTATTT 
ATTTCCTGTC 
ACGAAAATCG 
TTGGTGACAT 
ACATACTTTC 
TGTTGTCAAC 
CAAGCCTTGA 
CAATAATCTA 
GAGCCTCGGC 
AGTTCAATAC 
AAATTATAGA 



CAGTGTGAAA 
TTGCGAATTT 
CAGTATGTTT 
CGGATTATCT 
CTACCATATC 
CAATATTCCT 
ACTAAATAAT 
AAACTGTAGG 
CAGTAACAAT 
TCCTGTAATC 
TTCATCAAAC 
AGATACATGG 
CTGGGGCAAC 
TTGGATACCT 
AGTAGAATTC 
GGCTCTCAAG 
GGAATTGTTC 
CAGATCCAAT 
TGCGCGATGC 
CTTGAATAGT 
TTATTGGAAT 
CTGTGAACAA 
TCTTCCCGAA 
TAAATCCAAG 
CTATACCGAT 
TGATATAACC 
ATCCAACTTT 
TACGGAAAAA 



TTCAACAGTG 
ACCAATCGAA 
AGCAGTTCGG 
ACAAAAGCAA 
ACAAACACGC 
GAGTATTACA 
AGTGGCAACA 
ATTGGGAGCA 
CATATAGGAG 
AATGAAGAAG 
GGTACATTCC 
ATTTACAACG 
AATGATTTGT 
TTTTGGGATG 
CAAACAGCAT 
ATGATGGTTG 
AGGATAGAGA 
CCAACCATCA 
GATATTGTGT 
GAAATCTCCT 
ATGCAGTTAG 
AGGAAATCGC 
TCAACAGGTA 
ATAATCTCTA 
AACCCTTCCT 
GGGTTGCCCA 
GATCCCGGTT 
TTACGAATCA 



CCCCGCTGTA 
GTGACATTAT 
GGATAACGGT 
AGATTACCGA 
CAGGCTTAAC 
TTTCCGGTAA 
GAACGAACAG 
CGCTTTATAA 
TACGTCTCCT 
ATAAGCAGAC 
CTCTCAACTT 
ACACGTATAC 
TTGATCCGAA 
GATTGCCAAA 
TGGACTGTAT 
AAACCTACCC 
AAATGTCAGG 
TCTCTTCCCA 
GTGAAAACTA 
ATCAGGACAG 
ACTCACTCAG 
TCGAAAGCCA 
CTCTCCTGCC 
TTTCGCCCAA 
GTTCTGTAAT 
AACATCTATC 
TCTACCTGGT 
AA 



TACAACGAAC 
TTCAAAGAAT 
ATTCAAGCCT 
CAATACTTTT 
AGCAACCTCC 
TAAAATAGTC 
ACTCCACAAT 
TTCCTATGGT 
CAACAACAGT 
GTTTATTTCT 
CCATTACAAC 
GAATCGOTAT 
TCAGGTTTTC 
TGGGAGATCG 
TGGCAATAGC 
GGAATCCGAC 
CAACGATTAC 
GAACTTGTTC 
TCAGTCTGCC 
TGTTTTTGCA 
AGGGACTGGT 
TCAAAATGTA 
TCCATTAGAA 
TCCGGCGAAA 
AAAAATATAT 
CGAASGTTAT 
AACGCTAAAT 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
354 0 
3600 
3660 
3720 
37CO 
3822 



(2) IH FORMATION FOR SEQ ID MO: 203 



35 



(1) SEQUENCE CHARACTERISTICS : 

tA) LENGTH : 2775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDfclDNESS: double 

(D) TOPOLOGY: circular 



40 



(ii) MOLECULE TYPE: DMA (genomic) 
(iil) HYPOTHETICAL: NO 



45 



<iv) ANTI-SENSE: HO 



(vl) ORIGINAL SOURCE: 

(A* ORGANISM: PORY PHYR.OMONAS GIUGIVALIS 



50 



lix) FEATURE : 

(AJ NA11E/KEY: miscjcature 
(B) LOCATION 1...2775 



(y.x) SEQUENCE DESCRIPTION: SEQ ID NO: 203 



55 



60 



65 



70 



75 



ATGGCTATCA 
GCAGCGATCA 
GCTCCGGCTC 
GGCTGGCTTG 
TTCTCTGTAC 
TCAACAGCGG 
TGGGTATGCA 
GGGACTGCCA 
CCTCT7GTAT 
CGACATTACA 
TATAGGTCGA 
AATGTGGGAC 
GAAGAGTTGC 
AAAGATGTCT 
CAAGTGGAGT 
GGCACGCTGC 
CCCAATGGTT 
AATGCATACG 
GTTCCGGGTA 
GCCAAACGTG 
GCGGTGATGG 
ACCATGACAG 



TGATGAAAAG 
CGAATCCGAC 
AACCGGATAC 
AGATAGATGC 
CTTACGGACA 
GCAACTATCT 
ATCAGTATAG 
TTGAAGACTT 
GGCGTAGACG 
AAGTCACCGA 
TCGAAGGGCC 
GATTGACTTG 
AGCTTAGCGG 
CCATACTGGA 
ACTGCGTTAC 
ATTACGCCAC 
GGCTTGTGAT 
ACGCTTTTCC 
TAGGCCCGGT 
TCAAGTACTG 
CTTCGACAAC 
CGAAGCCGAC 



TATTGTTTTT 
TGCTCAAGAG 
TATCTTATAT 
TGATGCTGAT 
CAATGGCCTT 
GATTACACCC 
TACCAATCCG 
TGTTTTGTTG 
AATCGTGGAC 
CTCACACACA 
CGAACCTGCT 
GAACTATCCG 
CTACAACATC 
GTATGTGGAC 
AGCCGTTTAC 
GGATGCCATC 
AGACGCTGAT 
CGGCCATAAT 
GACTCCCGAC 
GGTAAGCACG 
GGGGACTGCT 
CGGCGCATGG 



AGAGCATTTC 
ATCTCAGGCA 
GAAAGTTTTG 
GGTGCCACTT 
TGCACCTACT 
AATATAGAAG 
GAACATTACG 
TTTGATGATT 
TTACCGGAAG 
GAATTCTTGA 
ACCGRCTTCA 
GAGGATTATC 
TATGCGAACG 
AGCACTTACT 
GATGAAAGCA 
CTTTATGAAA 
GGAGATGGAT 
GGAGGCCATT 
AACTATCTGA 
CAGGATGCCA 
GTCGGAGATT 
TATGAAAGAA 



TAACGATTTT 
TGAATGCATC 
AGAATGGACC 
GGGGAAGCCC 
CCCATATACG 
GAGCCAAACG 
CAGTAATGGT 
CCATAACAGG 
GGACCAAATA 
AATTGGATGA 
CAGTAATCAA 
AACCGGAAGG 
GTACACTACT 
CTTTGCGAGA 
TAGAATCTTC 
ATTTTGAGAA 
TTAGCTGGGG 
GCTCCTTGTC 
TTACCCCCAA 
ATTGGGCAGC 
TCGTCATATT 
CCATCAACTT 



GCTCTCGTGG 
CTGTCTGGCT 
TGTTCCCAAT 
ATCAGGCTCT 
TTCCGGTATC 
GGTCAAGTAC 
ATCGACAACC 
GAAACCGACT 
TATTGCATGG 
TGTCACTGTG 
TATTGGTCAG 
AAAGGGGAAT 
GGCACAAATA 
CAATCCCTTG 
GACCGTATGT 
TGGACCTGTT 
ACACTATTTG 
GGCTTCTTAT 
GGTTGAAGGA 
GGAACATTAC 
GTTCGAAGAA 
ACCTGAAGGG 



60 
X20 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
10P0 
1080 
1140 
1200 
1260 
1320 
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10 



15 



20 



25 



ACTAAATACA 

GATATCACTG 

TCGCTTATTG 

GATAAGACTG 

TCGCTCCTTG 

TCACGAGACG 

GAGTCCCAAT 

GAAGGCTTTG 

AATGTTAATT 

GCATCCCCTT 

CCCAGACTCG 

TCGGCTGAGC 

CTCTTGTTCG 

ACATTGCCTG 

TTCTTGCTCT 

ACTGATTTCG 

AACGGCTACG 

ATCTATGCAA 

GATGAGACTT 

TATAACGACA 

TTGGATAATA 

GTAAGGATAG 

ATTTGCATAT 

AATGACGGAG 

GAGATAAAGA 



TCGCATGGCG 
TATTCGGGAC 
AAAACAACAA 
ATGATAAAGA 
TTCACATACA 
ATCAGGTGGA 
CGGTTTGCGA 
AGGCCGGAAG 
GGGACTATTA 
CGTACTTACC 
AAGGAGCCAA 
ATTATGCTGT 
AAGAGACAAT 
CAGGAACAAA 
TGGATGACAT 
TTGTCTCGCT 
AACCCGATAA 
ATGGCTCGCT 
ATTCTTCACG 
ATATCGAGTC 
TTCAATCTGA 
AGGGATTGAG 
TAAGGGAAGA 
TCTACTTGAT 
GGCCG 



GCATTACAAC 
TCCTGCATCA 
GGGACGATTA 
CCCATTGCAG 
AGACCCGACT 
AGT GGAAT AT 
TAAGCTGATT 
TATTCCTGAA 
TCCTTGGACT 
GATGATTGGC 
GCTTGTCAAG 
GATGGTTTCT 
GACCGCTAAG 
ATATATTGCC 
TACGGTTTAT 
TATTGAGAAT 
GACTGATGAT 
CCTTGTTCAC 
AGACGGTCAG 
CCAATCGGTT 
TACAAGCTTG 
TCGGAGCAAG 
GACTCATTCA 
TAAAGTAGTC 



TGTACCGATA 
GAGCCCGAAC 
AAGTGGAATT 
CTTGCCGGCT 
GTTTTGGAGT 
TGTGTCACTG 
TATGATTCTC 
GGCTGGTTGT 
ATGTATGGAC 
GTTTTAACTC 
TATTGGGTAA 
ACTACGGGAA 
GCTAACGGTG 
TGGCGGCATT 
CGTTCTACTG 
AACAAGGGTC 
AAAAAACCAT 
ATACAAGACC 
GTGGAAATGG 
TGCGATAAGC 
AAAATATATC 
TCGACAATCG 
GAGAAAACGG 
GGTGGAAATA 



TATATTTCTT 
CTGTTACCGA 
ATCCTAACGG 
ACAATATCTA 
ATATCGATGA 
CCCTTTATAA 
AATCGGACAT 
TGATTGATGC 
ATGACAGTGA 
CGGATAACTA 
GTGCGCAAGA 
CTGCTGTTGA 
CATGGTATGA 
ATGATTGCAC 
AGACTGTTCC 
GCCTGAAATG 
TGCAGCTTAC 
CGACTGTTTT 
AATATTGTGT 
TGAACTATAC 
CTAATCCGGC 
AGTTGTATAA 
AAATCGATGT 
AAACAACAAC 



GAAGTTGGAC 


1380 


TTTCGTTGTC 


1440 


CTACGAACCC 


1500 


TGCAAACGGC 


1560 


GACTTATTCT 


- 1620 


CGACAATATC 


" 1680 


TATCTTATAT 


* 1740 


TGATGGCGAC 


1000 


GAAGTG TATT 


I860 


TTTGGTTACA 


1920 


TGCTGTTTAT 


1980 


AGATTTTGTC 


204 0 


GCGAACTATT 


2100 


CGATATGTTT 


2160 


CGAGCCTGTT 


2220 


GAATTATCCT 


2200 


CGGCTACAAC 


234 0 


GGAGTATATC 


2400 


CACTGCCGTT 


24 60 


TATCACATCC 


2520 


ATCGTATGTG 


2S0O 


TGCGCTGGGA 


2640 


TTCACGTCTC 


270O 


CGAAAAGGTA 


2760 




277b 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



{?.) INFORMATION FOR SEQ ID HO: 204 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANUEDNESS : double 

(D) TOPOLOGY: circular 

(II) MOLECULE TYPE: DNA (genomic J 
<iii] HYPOTHETICAL: HO 
Uv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM! PORYPHYROMONAS GIHCtVALIS 

(lx) FEATURE: 

(Al NAME/ KEY: misc feature 

(B) LOCATION 1...2766 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204 



ATGATGAAAA 

ACGAATCCGA 

CAACCGGATA 

GAGATAGATG 

CCTTACGGAC 

GGCAACTATC 

AATCAGTATA 

ATTGAAGACT 

TGGCGTAGAC 

AAAGTCACCG 

ATCGAAGGGC 

CGATTGACTT 

CAGCTTAGCG 

TCCATACTGG 

TACTGCGTTA 

CATTACGCCA 

TGGCTTGTGA 

GACGCTTTTC 

ATAGGCCCGG 

GTCAAGTACT 

GCTTCGACAA 

GCGAAGCCGA 

ATCGCATGGC 

GTATTCGGGA 

GAAAACAACA 



GTATTGTTTT 
CTGCTCAAGA 
CTATCTTATA 
CTGATGCTGA 
ACAATGGCCT 
TGATTACACC 
GTACCAATCC 
TTGTTTTGTT 
GAATCGTGGA 
ACTCACACAC 
CCGAACCTGC 
GGAACTATCC 
GCTACAACAT 
AGTATGTGGA 
CAGCCGTTTA 
CGGATGCCAT 
TAGACGCTGA 
CCGGCCATAA 
TGACTCCCGA 
GGGTAAGCAC 
CGGGGACTGC 
CCGGCGCATG 
GGCATTACAA 
CTCCTGCATC 
AGGGACGATT 



TAGAGCATTT 
GATCTCAGGC 
TGAAAGTTTT 
TGGTGCCACT 
TTGCACCTAC 
CAATATAGAA 
GGAACATTAC 
GTTTGATGAT 
CTTACCGGAA 
AGAATTCTTG 
TACCGACTTC 
GGAGGATTAT 
CTATGCGAAC 
CAGCACTTAC 
CGATGAAAGC 
CCTTTATGAA 
TGGAGATGGA 
TGGAGGCCAT 
CAACTATCTG 
GCAGGATGCC 
TGTCGGAGAT 
GTATGAAAGA 
CTGTACCGAT 
AGAGCCCGAA 
AAA GT GGAAT 



CTAACGATTT 

ATGAATGCAT 

GAGAATGGAC 

TGGGGAAGCC 

TCCCATATAC 

GGAGCCAAAC 

GCAGTAATGG 

TCCATAACAG 

GGGACCAAAT 

AAATTGGATG 

ACAGTAATCA 

CAACCGGAAG 

GGTACACTAC 

TCTTTGCGAG 

ATAGAATCTT 

AATTTTGAGA 

TTTAGCTGGG 

TGCTCCTTGT 

ATTACCCCCA 

AATTGGGCAG 

TTCGTCATAT 

ACCATCAACT 

ATATATTTCT 

CCTGTTACCG 

TATCCTAACG 



TGCTCTCGTG 
CCTGTCTGGC 
CTGTTCCCAA 
CATCAGGCTC 
GTTCCGGTAT 
GGGTCAAGTA 
TATCGACAAC 
GGAAACCGAC 
ATATTGCATG 
ATCTCACTGT 
ATATTGGTCA 
GAAAGGGGAA 
TGGCACAAAT 
ACAATCCCTT 
CGACCGTATG 
ATGGACCTGT 
GACACTA'JTT 
CGGCTTCTTA 
AGGTTGAAGG 
CGGAACATTA 
TGTTCGAAGA 
TACCTGAAGG 
TGAAGTTGGA 
ATTTCGTTGT 
GCTACGAACC 



GGCAGCGATC 

TGCTCCGGCT 

TGGCTGGCTT 

TTTCTCTGTA 

CTCAACAGCG 

CTGGGTATGC 

GGGGACTGCC 

TCCTCTTGTA 

GCGACATTAC 

GTATAGGTCG 

GAATGTGGGA 

TGAAGAGTTG 

AAAAGATGTC 

GCAAGTGGAG 

TGGCACGCTG 

TCCCAATGGT 

GAATGCATAC 

TGTTCCGGGT 

AGCCAAACGT 

CGCGGTGATG 

AACCATGACA 

GACTAAATAC 

CGATATCACT 

CTCGCTTATT 

CGATAAGACT 



6C 
120 
180 
240 
300 
360 
420 
400 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
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GATGATAAAG ftCCCATTGCA GCTTGCCGGC 
GTTCACATAC AAGACCCGAC TGTTTTGGAG 
GATCAGGTGG AAGTGGAATA TTGTGTCACT 
TCGGTTTGCG ATAAGCTGAT TTATGATTCT 
5 GAGGCCGGAA GTATTCCTGA AGGCTGGTTG 
TGGGACTATT ATCCTTGGAC TATGTATGGA 
TCGTACTTAC CGATGATTGG CGTTTTAACT 
GAAGGAGCCA AGCTTGTCAA GTATTGGGTA 
CATTATGCTG TGATGGTTTC TACTACGGGA 

10 GAAGAGACAA TGACCGCTAA GGCTAACGGT 
GCAGGAACAA AATATATTGC CTGGCGGCAT 
TTGGATGACA TTACGGTTTA TCGTTCTACT 
GTTGTCTCGC TT ATT GAG AA TAACAAGGGT 
GAACCCGATA AGACTGATGA TAAAAAACCA 

15 AATGGCTCGC TCCTTGTTCA CATACAAGAC 
TATTCTTCAC GAGACGGTCA GGTGGAAATG 
AATATCGAGT CCCAATCGGT TTGCGATAACI 
ATTCAATCTG ATACAAGCTT GAAAATATAT 
GAGGGATTGA GTCGGAGCAA GTCGACAATC 

20 TTAAGGGAAG AGACTCATTC AGAGAAAACG 
GTCTACTTGA TTAAAGTAGT CGGTGGAAAT 
AGGCCG 



TACAATATCT ATGCAAACGG CTCGCTCCTT 1S60 

TATATOGATG AGACTTATTC TTCACGAGAC 1620 

GCCGTTTATA ACGACAATAT CGAGTCCCAA 1680 

CAATCGGACA TTATCTTATA TGAAGGCTTT 17 4 0 

TTGATTGATG CTGATGGCGA CAATGTTAAT 1800 

CATGACAGTG AGAAGTGTAT TGCATCCCCT 1860 

CCGGATAACT ATTTGGTTAC ACCCAGACTC 1920 

AGTGCGCAAG ATGCTGTTTA TTCGGCTGAC 1980 

ACTGCTGTTG AAGATTTTGT CCTCTTGTTC 2040 

GCATGGTATG AGCGAACTAT TACATTGCCT 2100 

TATGATTGCA CCGATATGTT TTTCTTGCTC 2160 

GAGACTGTTC CCGAGCCTGT TACTGATTTC 2220 

CGCCTGAAAT GGAATTATCC TAACGGCTAC 2280 

TTGCAGCTTA CCGGCTACAA CATCTATGCA 234 0 

CCGACTGTTT TGGAGTATAT CGATGAGACT 24 00 

GAATATTGTG TCACTGCCGT TTATAACGAC 24 60 

CTGAACTATA CTATCACATC CTTGGATAAT 2520 

CCTAATCCGG CATCGTATGT GGTAAGGATA 2580 

GAGTTGTATA AT GCGCTGGG AATTTGCATA 2640 

GAAATCGATG TTTCACGTCT CAATGACGGA 27 00 

AAAACAACAA CCGAAAAGGT AGAGATAAAG 27 60 

2766 



25 (2) UJFORIIATION FOR SEQ ID MO: 205 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 63 base pairs 
(B| TYPE: nucleic acid 
30 (C) ST HANDEDNESS : double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPH: DHA t genomic) 

35 (iii) HYPOTHETICAL: I JO 

(iv) ANTI -SENSE; NO 

(vil ORIGINAL SOURCE 
40 (A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

tlx) FEATURE: 

(A) UAHE/KEY: roiscjeature 

(B) LOCATION 1.-.2763 



45 



(xi) SEQUENCE: DESCRIPTION: SEQ ID NO: 205 



A'i'GAAAAGTA TTGTTTTTAG AGCATTTCTA ACGATTTTGC TCTCGTGGGC AG CGATCACG 60 

AATCCGACTG CTCAAGAGAT CT C AG G CAT G AATGCATCCT GTCTGGCTGC TCCGGCTCAA 120 

50 CCGGATACTA TCTTATATGA AAGTTTTGAG AATGGACCTG TTCCCAATGG CTGGCTTGAG 180 

ATAGATGCTG ATGCTGATGG TGCCACTTGG GGAAGCCCAT CAGGCTCTTT CTCTGTACCT 24 0 

TACGGACACA ATGGCCTTTG CACCTACTCC CATATACGTT CCGGTATCTC AACAGCGGGC 300 

AACTATCTGA TTACACCCAA TATAGAAGGA GCCAAACGGG TCAAGTACTG GGTATGCAAT 360 

CAGTATAGTA CCAATCCGGA ACATTACGCA GTAATGGTAT CGACAACGGG GACTGCCATT 420 

-55 GAAGACTTTG TTTTGTTGTT TGATGATTCC AT AACAG GG A AACCGACTCC TCTTGTATGG - 4 80 

CGTAGACGAA TCGTGGACTT ACCGGAAGGG ACCAAATATA TTGCATGGCG ACATTACAAA 54 0 

GTCACCGACT CACACACAGA ATTCTTGAAA TTGGATGATG TCACTGTGTA TAGGTCGATC ■ 600 

GAAGGGCCCG AACCTGCTAC CGACTTCACA GTAATCAATA TTGGTCAGAA TGTGGGACGA 660 
TTGACTTGGA ACTATCCGGA GGATTATCAA CCGGAAGGAA AGGGGAATGA AGAGTTGCAG . 720 

60 CTTAGCGGCT ACAACATCTA TGCGAACGGT ACACTACTGG CACAAATAAA AGATGTCTCC 780 

ATACTGGAGT ATGTGGACAG CACTTACTCT TTGCGAGACA ATCCCTTGCA AGTGGAGTAC 040 

TGCGTTACAG CCGTTTACGA TGAAAGCATA GAATCTTCGA CCGTATGTGG CACGCTGCAT 900 

TACGCCACGG ATGCCATCCT TTATGAAAAT TTTGAGAATG GACCTGTTCC CAATGGTTGG 960 

CTTGTGATAG ACGCTGATGG AGATGGATTT AGCTGGGGAC ACTATTTGAA TGCATACGAC 1020 

65 GCTTTTCCCG GCCATAATGG AGGCCATTGC TCCTTGTCGG CTTCTTATGT TCCGGGTATA 1080 

GGCCCGGTGA CTCCCGACAA CTATCTGATT ACCCCCAAGG TTGAAGGAGC CAAACGTGTC 114 0 

AAGTACTGGG TAAGCACGCA GGATGCCAAT TGGGCAGCGG AACATTACGC GGTGATGGCT. 1200 

TCGACAACGG GGACTGCTGT CGGAGATTTC GTCATATTGT TCGAAGAAAC CATGACAGCG 1260 

AAGCCGACCG GCGCATGGTA TGAAAGAACC ATCAACTTAC CTGAAGGGAC TAAATACATC 1320 

70 GCATGGCGGC ATTACAACTG T ACCGAT ATA TATTTCTTGA AGTTGGACGA TATCACTGTA 1300 

TTCGGGACTC CT G CAT C AG A GCCCGAACCT GTTACCGATT TCGTTGTCTC GCTTATTGAA 14 4 0 

AACAACAAGG GACGATTAAA GTGGAATTAT CCTAACGGCT ACGAACCCGA TAAGACTGAT 1500 

GATAAAGACC CATTGCAGCT TGCCGGCTAC AATATCTATG CAAACGGCTC GCTCCTTGTT 156C 

CACATACAAG ACCCGACTGT TTTGGAGTAT ATCGATGAGA CTTATTCTTC ACGAGACGAT 1620 

75 CAGGTGGAAG TGGAATATTG TGTCACTGCC GTTTATAACG ACAATATCGA GTCCCAATCG 1680 
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GTTTGCGATA 
GCCGG AAGTA 
GACTATTATC 
TACTTACCGA 
GGAGCCAAGC 
TATGCTGTGA 
GAGACAATGA 
GGAACAAAAT 
GATGACATTA 
GTCTCGCTTA 
CCCGATAAGA 
GGCTCGCTCC 
TCTTCACGAG 
ATCGAGTCCC 
CAATCTGATA 
GGATTGAGT C 
AGGGAAGAGA 
TACTTGATTA 
CCG 



AGCTGATTTA 
TTCCTGAAGG 
CTTGGACTAT 
TGATTGGCGT 
TTGTCAAGTA 
TGGTTTCTAC 
CCGCTAAGGC 
ATATTGCCTG 
CGGTTTATCG 
TTGAGAATAA 
CTGATGATAA 
TTGTTCACAT 
ACGGTCAGGT 
AATCGGTTTG 
CAAGCTTGAA 
GGAGCAAGTC 
CTCATTCAGA 
AAGTAGTCGG 



TGATTCTCAA 
CTGGTTGTTG 
GTATGGACAT 
TTTAACTCCG 
TTGGGTAAGT 
TACGGGAACT 
TAACGGTGCA 
GCGGCATTAT 
TTCTACTGAG 
CAAGGGTCGC 
AAAACCATTG 
ACAAGACCCG 
GGAAATGGAA 
CGATAAGCTG 
AATATATCCT 
GACAATCGAG 
GAAAACGGAA 
TGGAAATAAA 



TCGGACATTA 
ATTGATGCTG 
GACAGTGAGA 
GATAACTATT 
GCGCAAGATG 
GCTGTTGAAG 
TGGTATGAGC 
GATTGCACCG 
ACTGTTCCCG 
CTGAAATGGA 
CAGCTTACCG 
ACTGTTTTGG 
TATTGTGTCA 
AACTATACTA 
AATCCGGCAT 
TTGTATAATG 
ATCGATGTTT 
ACAACAACCG 



TCTTATATGA 
ATGGCGACAA 
AGTGTATTGC 
TGGTTACACC 
CTGTTTATTC 
ATTTTGTCCT 
GAACTATTAC 
ATATGTTTTT 
AGCCTGTTAC 
ATTATCCTAA 
GCTACAACAT 
AGTATATCGA 
CTGCCGTTTA 
TCACATCCTT 
CGTATGTGGT 
CGCl'GGGAAT 
CACGTCTCAA 
AAAAGGTAGA 



AGGCTTTGAG 
TGTTAATTGG 
ATCCCCTTCG 
CAGACTCGAA 
GGCTGAGCAT 
CTTGTTCGAA 
ATTGCCTGCA 
CTTGCTCTTG 
TGATTTCGTT 
CGGCTACGAA 
CTATGCAAAT 
TGAGACTTAT 
TAACGACAAT 
GGATAATATT 
AAGGATAGAG 
TTGCATATTA 
TGACGGAGTC 
GATAAAGAGG 



1740 
1800 
1860 
1920 
1 980 
20^0 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2S80 
2640 
27 00 
2760 
2763 



25 



30 



35 



40 



45 



50 



55 



GO 



65 



70 



75 



(2) INFORMATION FOR SEQ ID HO: 206 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 9 base pairs 

(B) TYPF. : nucleic acid 
tC) STRAtlDEDNESS: double 
<U> TOPOLOGY: circular 

<ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: HO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

{ ix ) FEATURE : 

tA) HAME/KEY: misc_feature 
(B» LOCATION 1...1779 

(Xlj SEQUENCE DESCRIPTION: SEQ ID NO: 206 



AT G AACAGCA 
TCCGTTTGCG 
CACCTTACTA 
GCATACAAGG 
TTCGGATTTG 
ACGG AAGGTC 
CAGCGAATGG 
CTCTCCTATC 
GATAGACGAA 
ACCGGCTTGT 
GTGGGCAGCA 
AACGTCCGTG 
TTC1TCCGTT 
CCTCAACTGA 
GTTCGTACGG 
TCCGAAACCG 
GACTCGGACA 
GCCAAAGGGA 
TGGCGTGTGT 
CAAGAGGAGA 
ACCGCCCAAT 
GACTTCTCCA 
TCCGACTTGT 
CGCGCAGGTG 
TCCTATGGCA 
TTGAAGGCAG 
CAGCTGAACG 
GGTAATGTAC 
TATCTCAAAA 
CAGCCGTTCC 



TCATGAAATA 
GTCAAACCCC 
TCGTTAATGA 
CCATCGAACC 
TCCCTGAAGT 
ATATGAAGCA 
ATGCCGGCTA 
GTGGGATGAA 
TGATGGCAGG 
ATTATTCGAA 
TCCCTCAGCT 
TATACTTGGG 
CTATTCCCTA 
ACGTGACGAT 
GAGGATTGTT 
ACCGCAACCT 
ATATGCAATG 
GGTTGTTTTT 
ATGCGAAAGC 
TGCCCTACCT 
TAGGGGTGAA 
AGCTGACAGG 
ATCAGTATAA 
GTAAGCTGGA 
AGTGGAATTT 
AAGTAGGTGT 
GACGGTATCG 
ATCTTCTTAG 
TCGATAATAT 
ATTGTTTCGC 



TCAATTATAT 
ACGAAATACA 
CCAGACTGTG 
TCGATTAAAA 
TTCCTCTTCA 
CCGGGGGTAC 
TCGT CTGAT A 
ATCGGCTTTC 
AGTGGACTAC 
CCATTATTTC 
ATCGACACCT 
TGCAAAAAAT 
TCTGGGTACC 
GAGTAATGAG 
TTTTGCCAAA 
GTATTATGTG 
GAACATACAG 
CTGGCCTAAA 
CTTCGGCGGT 
GATGCCCAAT 
GGGGAATATA 
TGTGCCTTTC 
TGTGAGTTTC 
ATACTCTTAT 
GGATGGAGGA 
TCATCCCATT 
Gl'ATTCTTTC 
TGCGGATGTT 
GCTGGCGGAA 
CGGTTTTAGC 



ACGGCCGTCA 
GAAACCAAAC 
GAGATGGAGC 
CCTTTCCGTC 
GGCAGGAACA 
CTGAATATCG 
GATGCAGAGC 
AATACCGGTG 
GAGCAGCGCA 
A AT AACTACG 
GTTACTCCTC 
GATGTGATCG 
GATCCGATGA 
TTGTCCGATG 
AACAGCGAAA 
GAGGGCGCGC 
GCCGGAGTAG 
CTGGATGCTT 
GTGATTCGAA 
ACGATTGTAC 
GCCGATGTGG 
TAT ACT CCG A 
TTG CCG AT AT 
CGCGATATGC 
CTTGTCGCCT 
GCCCCATTGG 
GGCTCGGCTG 
TCATACAAGC 
ACGACAGAGC 
TGGACTTTC 



TAATGGCTCT 
GCCCCGACAC 
ATGCCGATCC 
CGGAATATAA 
ATCTTCCGAA 
GTATCGGCCA 
AGGAGAGACT 
ACTTCGACGG 
GGCCTTCCTT 
GACGGGGAGC 
AGATGGACAA 
ATGCCAGGAT 
AGGCTCTCAC 
ATATTAAGCT 
TGATTCAAAC 
CCACAATCGG 
GGATTTCTTC 
CGCTTAGTAT 
ATGGTCTCGC 
TCCCTTCGCG 
TACGTATGGA 
CTCTACCCTT 
ATGCCGACGG 
TCCGCTTTCT 
CCATGCAGCC 
ATGTCAGACT 
GCTCGGAAGC 
TGAAAAA'jAA 
TTATCGGTTA 



CTCTGTATCA 
GCTGCGCAGG 
GCTTCCGGCT 
CAAGCGTACA 
TATCCTGCCG 
TACGCTAAAC 
GAATCTTTTC 
CGACAGAAAG 
TGTGCTTGCT 
TACCACCAAT 
CGGGACCCAC 
CGACTATCGT 
AGAACATACG 
CGGTGT CGAA 
GGGCGTTCTG 
ATTTGTCGGA 
CCATTT CGG A 
CTTCCCTTCA 
CGATGTTATG 
CAATGCTTTG 
GGTTTATGGC 
ATATAATCCA 
CAGCCGCTGG 
GGTAGACGCA 
CGATCTTATA 
CCGGTATACA 
CTTGGGTATC 
CTTGAGCCTT 
TTATCCTATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
1300 
1440 
1500 
1560 
1620 
1680 
1740 
1775 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



:1NB! nib 



WO 99/29870 PCT/AU98/01023 

142/490 
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(2) III FORI 1ATXON FOR SEQ ID MO: 207 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1767 base pair* 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

Ui I MOLECULE TYPE: DIIA {genomic:) 
(iii) HYPOTHETICAL: NO 
(ivl AWT I -SENSE : MO 

(vi> ORIGIMAL SOURCE: 

(A) ORGANISM: PORY PHY ROt ION AS GINGIVALIS 

<ix) FEATURE : 

fA) NAME /KEY ; misc_f eaturfe 

( B ) LOCATION 1...17 67 

<;U) SEQUENCE DESCRIPTION: SEQ ID NO: 207 



ATGAAATATC 

CAAACCCCAC 

GTTAATGACC 

ATCGAACCTC 

CCTGAAGTTT 

ATGAAGCACC 

GCCGGCTATC 

GGGATGAAAT 

ATGGCAGGAG 

TATTCGAACC 

CCTCAGCTAT 

TACTTGGGTG 

ATTCCCTATC 

GTGACGATGA 

GGATTGTTTT 

CGCAACCTGT 

ATGCAATGGA 

TTGTTTTTCT 

GCGAAAGCCT 

CCCTACCTGA 

GGGGTGAAGG 

CTGACAGGTG 

CAGTATAATG 

AAGCTGGAAT 

TGGAATTTGG 

GTAGGTGTTC 

CGGTATCGGT 

CTTCTTAGTG 

GATAATATGC 

TGTTTCGCCG 



AATTATATAC 

GAAATACAGA 

AGACTGTGGA 

GATTAAAACC 

CCTCTTCAGG 

GGGGGTACCT 

GTCTGATAGA 

CGGCTTTCAA 

TGGACTACGA 

ATTATTTCAA 

CGACACCTGT 

CAAAAAATGA 

TGGGTACCGA 

GTAATGACTT 

TTGCCAAAAA 

ATTATGTGGA 

ACATACAGGC 

GGCCTAAACT 

TCGGCGGTGT 

TGCCCAATAC 

GGAATATAGC 

TGCCTTTCTA 

TGAGTTTCTT 

ACTCTTATCG 

ATGGAGGACT 

ATCCCATTGC 

ATTCTTTCGG 

CGGATGTTTC 

TGGCGGAAAC 

GTTTTAGCTG 



GGCCGTCATA 
AACCAAACGC 
GAT GG AG CAT 
TTTCCGTCCG 
CAGGAACAAT 
GAATATCGGT 
TGCAGAGCAG 
TACCGGTGAC 
GC AG CGCAGG 
TAACTACGGA 
TACTCCTCAG 
TGTGATCGAT 
TCCGATGAAG 
GTCCGATGAT 
CAGCGAAATG 
GGGCGCGCCC 
CGGAGTAGGG 
GGATGCTTCG 
GATTCGAAAT 
GATTGTACTC 
CGATGTGGTA 
TACTCCGACT 
GCCGATATAT 
CGATATGCTC 
TGTCGCCTCC 
CCCATTGGAT 
CTCGGCTGGC 
ATACAAGCTG 
GACAGAGCTT 
GACTTTC 



ATGGCTCTCT 

CCCGACAC3C 

GCGGATCCGC 

GAATATAACA 

CTTCCGAATA 

ATCGGCCATA 

GAGAGACTGA 

TVCGACGGCG 

CCTTCCTTTG 

CGGGGAGCTA 

ATGGACAACG 

GCCAGGATCG 

GCTCTCACAG 

ATTAAGCTCG 

ATTCAAACGG 

ACAATCGGAT 

ATTTCTTCCC 

CTTAGTATCT 

GGTCTCGCCG 

CCTTCGCGCA 

CGTATGGAGG 

CTACCCTTAT 

GCCGACGGCA 

CGCTTTCTGG 

ATGCAGCCCG 

GTCAGACTCC 

TCGGAAGCCT 

AAAAAGAACT 

ATCGGTTATT 



CTGTATCATC 

TGCGCAGGGA 

TTCCGGCTGC 

AGCGTACATT 

TCCTGCCGAC 

CGCTAAACCA 

ATCTT TTCCT 

ACAGAAAGGA 

TGCTTGCTAC 

CCACCAATGT 

GGACCCACAA 

ACTATCGTTT 

AACATACGCC 

GTGTCGAAGT 

GCGTTCTGTC 

TTGTCGGAGA 

ATTTCGGAGC 

TCCCTTCATG 

ATGTTATGCA 

ATGCTTTGAC 

TTTATGGCGA 

ATAATCCATC 

GCCGCTGGCG 

TAGACGCATC 

ATCTTATATT 

GGTATACACA 

TGGGTATCGG 

TGAGCCTTTA 

ATCCTATGCA 



CGTTTGCGGT 
GCTTACTATC 
ATACAAGGCC 
CGGATTTGTC 
GGAAGGTCAT 
GCGAATGGAT 
CTCCTATCGT 
TAGACGAATG 
CGGCTTGTAT 
GGGCAGCATC 
CGTCCGTGTA 
CTTCCGTTCT 
TGAACTGAAC 
TCGTACGGGA 
CGAAACCGAC 
CTCGGACAAT 
CAAAGGGAGG 
GCGT GTGTAT 
AGAGGAGATG 
CGCCCAATTA 
CTTCTCCAAG 
CGACTTGTAT 
CGCAGGTGGT 
CTATGGCAAG 
GAAGGCAGAA 
GCTGAACGGA 
TAATGTACAT 
TCTCAAAATC 
GCCGTTCCAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
17 67 



60 



65 



70 



75 



(2) II 1 FORI tAT I ON FOR SEQ ID NO: 208 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1038 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

Hi) IIOLECULE TYPE; OHA (genomic) 

(iii) HYPOTHETICAL: NO 

(lv) ANTI-SENSE: NO 

t*. 1) ORIGIMAL SOURCE: 

(A) ORGANISM: PORYPHYROtlONAS GINGIVALIS 

tix) FEATURE: 

(A) NAME/KEY: misc_feature 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



it - , ^, , 



ii- o Os 



WO 99/29870 



PCT/AU98/01023 



143/490 



<B ) LOCATION 1...1038 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208 



10 



15 



20 



ATGATGGAAA 
CATTTCGGTA 
AAGCAGGTGT 
AAAGCTATCA 
GGATATGAAT 
ATGGGCAATG 
CGTTTCCTGA 
TTTAGTGCTT 
CGCGGTGGAG 
CTTGGTGTGG 
CTGTTCAAGA 
TGGGATTTCC 
ACGTTGTTCA 
ATGCAAAAGT 
TGGGTCGGGC 
AAATGGGGAG 
TCTGCTGCCA 
GACGATAAGA 



AATSTATCTl' 
TCATTTT GGG 
TTCATTTTCT 
CCATCGTAGA 
CCGGTGGCCG 
CCTGTTATGC 
ACTACGGGTC 
CGGATATAGC 
T C AGCCT AAA 
ATGTCGGTAT 
ACGTAGGGGC 
AGCTCGGCTT 
ATCTGAATCC 
TCCTCCGACA 
TGGGATATAC 
GTCTTTCGGC 
CCTATCATCC 
GCATCTTC 



TGCTCACTAT 
ATTTCTTTCT 
GAACCTTCCG 
CGACAATCCC 
CGCCTTTCTT 
CTCGTCCGTC 
TATGCAAGGA 
TGTACAAGGA 
AGCATTGTAT 
CAGTTATTAC 
GCAACTGAAA 
TTCCCGCAGT 
GCACTATTTC 
CTTCTCGATA 
GCCACAGATT 
CGGCGTCGGT 
TGCAGCTCTT 



CCACATAACC 
CTTGTGTTTT 
GCTACTGCAC 
GGACTGGCTT 
TCCTATTTAT 
GGAGAGCGTG 
TACGATCAGA 
TTTTACAGCC 
TCTTCTATCG 
GACGATGACA 
GGCTATAATG 
TTTATCAATG 
AAGCGTCTTG 
GGAGCAGAAT 
GCACAGGATT 
TTCACTTCAG 
TCGTTCATGT 



TTGTGTTCAT 
CGGCAGGTGC 
AGGCTTTGGC 
TTGAGAATCC 
ATT AT AT GAG 
GCATGTGGGG 
ATGCGATTGC 
ATGAACTGAG 
AGACGTATAG 
AAGGATATTC 
AAGAACGGGA 
CTCCGTTTCG 
TACCACGCGA 
TTACTCCTTC 
TCGAGGTGGA 
GTGTAGTACG 
GTTCGGTAGG 



GATTCGCAAG 
TCAACAAGAG 
TGCCGGAGGC 
GGCTCTGCTC 
TGGTTCGCAT 
TGTTGGCATG 
CACCGGCTCT 
CAACCACTTC 
TTCCTTTGGC 
CCCTTCCGCT 
ACCGCTCGAT 
CTTGCACATC 
TCTGTCCAAG 
CGAGAGGTTT 
AGGCGGCAAC 
TGTAGGCGTA 
TATCCGTTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1033 



25 



(2) IIIFORI1ATION FOR SEQ ID HO: 209 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1035 base pales 
(D) TYPE: nucleic acid 
30 (C) STRAHDEDNESS : double 

(D> TOPOLOGY: circular 

(Ii) HOLECULE TYPE : DMA (genomic) 

35 Uii) HYPOTHETICAL: NO 



40 



45 



50 



55 



60 



65 



(iv) AMTI —SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOHAS GI1IGIVALIS 

I ix ) FEATURE : 

(A] IIAHE/KLY: misc_featurc 

(B) LOCATION 1...103& 

<xl} SEQUENCE DESCRIPTION: SEQ ID NO: 209 



ATGGAAAAAT 
TTCGGTATCA 
CAGGTGTTTC 
GCTATCACCA 
TATGAATCCG 
GGCAATGCCT 
TTCCTGAACT 
AGTGCTTCGG 
GGTGGAGTCA 
GGTGTGGATG 
TTCAAGAACG 
GATTTCCAGC 
TTGTTCAATC 
CAAAAGTTCC 
GTCGGGCTGG 
TGGGGAGGTC 
GCTGCCACCT 
GATAAGAGCA 



GTATCTTTGC 
TTTTGGGATT 
ATTTTCTGAA 
TCGTAGACGA 
GTGGCCGCGC 
GTTATGCCTC 
ACGGGTCTAT 
ATATAGCTGT 
GCCTAAAAGC 
TCGGTATCAG 
TAGGGGCGCA 
TCGGCTTTTC 
TGAATCCCCA 
TCCGACACTT 
GATATACGCC 
TTTCGGCCGG 
AT CAT CCTGC 
TCTTC 



TC ACT AT CCA 
TCTTTCTCTT 
CCTTCCGGCT 
CAATCCCGGA 
CTTTCTTTCC 
GTCCGTCGGA 
GCAAGGATAC 
ACAAGGATTT 
ATTGTATTCT 
TTATTACGAC 
ACTGAAAGGC 
CCGCAGTTTT 
CTATTTCAAG 
CTCGATAGGA 
ACAGATTGCA 
CGTCGGTTTC 
AGCTCTTTCG 



CATAACCTTG 
GTGTTTTCGG 
ACTGCACAGG 
CTGGCTTTTG 
TATTTATATT 
GAGCGTGGCA 
GATCAGAATG 
TACAGCCATG 
TCTATCGAGA 
GATGACAAAG 
TATAATGAAG 
ATCAATGCTC 
CGTCTTGTAC 
GCAGAATTTA 
CACGATTTCG 
ACTTCAGGTG 
TTCATGTGTT 



TGTTCAT GAT 
CAGGTGCTCA 
CTTTGGCTGC 
AGAAT CCGGC 
ATATGAGTGG 
TGTGGGGTGT 
CGATTGCCAC 
AACTGAGCAA 
CGTATAGTTC 
GATATTCCGC 
AACGGGAACC 
CGTTTCGCTT 
CACGCGATCT 
CTCCTTCCGA 
AGGTGGAAGG 
TAGTACGTGT 
CGGTAGGTAT 



TCGCAAGCAT 
ACAAGAGAAG 
CGGAGGCAAA 
TCTGCTCGGA 
TTCGCATATG 
TGGCATGCGT 
CGGCTCTTTT 
CCACTTCCGC 
CTTTGGCCTT 
TTCCGCTCTG 
GCTCGATTG3 
GCACATCACG 
GTCCAAGATG 
GAGGTTTTGG 
CGGCAACAAA 
AGGCGTATCT 
CCGTTTGGAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1035 



70 



75 



(2> INFORMATION EX>R SEQ ID WO: 210 

111 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 
(D) TYPE: nucleic acid 
(C) STRAMDEDNESS: double 
(DJ TOPOLOGY: circular 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



;;;;i> m 



g.ji'u^ ii:^, -ills iu*ru,lr 



WO 99/29870 PCT/AU98/01 023 

144/490 

Mi) MOLECULE TYPE: DNA (genomic) 
till) HYPOTHETICAL: NO 

5 <Jv) ANTI-SENSE; NO ' . " ' 1 \ ;v 5" 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROIIONAS GINGIVALIS 
10 (ix) FF.ATURE: 

(A) NAf IE/ KEY : misc feature 
IB) LOCATION 1...990 



15 



20 



35 



50 



55 



60 



70 



(2) INFORMATION FOR SEQ ID NO:211 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 972 base pairs 
An lB) TYP^i nucl-ic acid 

™ (£> STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

45 (ilii HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



Ivi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 
(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(O) LOCATION 1...97 2 



(XX) SEQUENCE DESCRIPTION: SEQ ID NO:211 

^CG^CG^ T^S TCATCCGAGT TGCTCGAGCA GGAGCTGACT 

X«,w!,?« TTCTGCGGAT GACAAACAAA GCCGGCAATG AGATCTATGT TTTTarcrrr 

SS §T^S S^ OTA ^CGA^C «" SEES 

TACA^A^ TGATCGTATG SS^I ATAGACGAGT TCGACACCAT GCCCGGGAGC 

at^a w ??rc^ 

65 ?^c™a ^" G " C ga ™ tctcc ~ 

cTfir^il^ IHIS CA gt ^ccaatcg acacggatgg gcacaaaggc catttttgtg 

TtcS^A I^S^ ACGG TATCG "»GCA CTCACTGTAG TCAATCCAGA GGCACTctI? 

SSt?S^ " acaaagac tatgatcggc gagctcgcaa tctgS^SS 

APaTHETI^ Sr*****" 1 CTCCGATCCG GAAGGCTTGG TCAAGCCTAT TCATC^A 
iS^*^ ^^^GA GGACGAAGCC TTGTTCTCCT CATCCGACTT TGACAcSS? 

™?S ag ctgggtatca ^atcc^c E5S£S£ 

GGAGA™ iSnSSS ^tl^ 2T™2« ^^GAA TGAGTCTTTC 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:Z10 

ATGATTCGCA AGCATTTCGG TATCATTTTG GGATTTCTTT CTCTTGTGTT TTCGGCArrr 

GTGAACCTTC S£SS£ AC^CT^C i^ 

cSgSS?p ££55SS t caccatcgta gacgacaatc ccggactggc ttttgagaat 

^ES^?ISI' jC TCGGATATGA ATCCGGTGGC CGCGCCTTTC TTTCCTATTT ATATTATATr 
TGCG^CC? EES"" GCCTC <™ TCGGaS £££££ 
SAnAA^™ TGCGTTTCCT GAACTACGGG TCTATGCAAG GATACGATCA GAATG-GATT 
Sc^S SSSSSS^ f™"™ GATTTTA^ SS£££ 



120 
180 
240 
300 
360 
420 
480 



ACCAA'-CA-T TrrrrrrTrJ- irZZZZ gattttaCAG CCATGAACTG 

TCOoCGGTUrG AGTCAGCCTA AAAGCATTGT ATTCTTCTAT CGAGA^RTAT 

SSSS rrrrecc™, .mmcn toctcct „ 



60O 
660 
7?0 



ssssss s^sss ssss; sssss ksessj 
30 ====== a 

= iSS = = = a 

990 



75 



SUBSTITUTE SHEET (Rule 26) (RQ/AU) 



WO 99/29870 



PCT/AU98/01023 



145/ 490 



[2> INFORMATION FOR SEQ TD HO: 212 



10 



15 



20 



25 



30 
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50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1641 base pairs 

(B) TYPE: nucleic acid 
tC) STRAW DEDNESS: double 
(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DMA (genomic) 

(ill) HYPOTHETICAL: NO 

(ivl ANT I -SENSE : NO 

{vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOI IAS GIIIGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY : ro±sc_ feature 

(B) LOCATION 1...1641 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:212 



ATGAAAACCA 

GCTGTGCACT 

ATCGTCCTGT 

AGGGGGGCCG 

GGCTCGGCCT 

TCGCTCTCGC 

AGTGCTT CCT 

GAAGCCTACT 

TATCGCCTTG 

TCATACAGGG 

GCATTGGAGC 

TCGGCTGCGT 

CAGGACAAAT 

TGGTTCGGTA 

ACCCGTAGGG 

AGGTCGTCCA 

CATCTGTCGG 

CGGATATACG 

GCCATTCGCC 

ATTCGCACGG 

GATGAGACGT 

GGTATAGCCT 

TATCGAATGG 

TATCAGCTGG 

TCTCTGTACG 

TATGGCGACC 

TCACATATCC 

TGCAATATCT 



TTGTAAGATA 
TGTCTGTCGA 
CTCCCACCGA 
ATCTTTTCCC 
AT GGCGAT AT 
TCGAATCGGG 
ACTCACGTGG 
AT CCCTATTT 
CC GG CT ACT A 
GCGAAGTTGC 
TTTCTTGTGC 
ATCTCTATCA 
TCTTCGTCAG 
TCTCCAGAAT 
GCGATGCCAT 
TCAATCTCTT 
ACTTCGATTT 
AAGACTACAA 
GCT GGT AT AT 
ATAGAGGTTG 
ATCGCAAGCA 
ATGACCATGC 
TGCTGACGCA 
CCTATTTGCC 
TCTCGATTCC 
TGATGAAAAG 
TGTCCGATCC 
CCTACCTCTT 



CAGCCGCCTT 
GGCGCAGATG 
AAGGCAATAC 
GTT GAGCGAT 
TGCGGGCGAC 
TGGTCGCATC 
CATGCACAAA 
GGTGTCCGAT 
TTCTTTTCGC 
TTATCGGCTG 
TACCTCT TTG 
TAGACAACAC 
CTACGGTTTC 
GAACTACGTC 
CGGTCTCGAC 
TGCTTTGCTT 
TTCATTTTCA 
GCCGGATGAT 
CCTCAATGAG 
TGCCCTGAGA 
TGGACATCAT 
CGGATCCAAA 
TTCGTATAAG 
CTATGCCTAT 
GATGCAGAAT 
AAAGGACGGT 
GCAAGCCGAA 
C 



CCGGTCGCTC 

CrCAATACTC 

AGGGAGATTT 

AAGCTGCGCG 

TATCTT COGT 

AGTGTCCGTA 

CGCATCGGCT 

TCGACCGGCG 

GCCGGCCGCT 

ACCGATCCGC 

ACGCTGCCTC 

CTCACACAGT 

GGTCAGGTGG 

AACGGATGGA 

TACAGCGGCT 

TACAATCGCC 

GCCGACTATG 

AATT AT CAT A 

TTTTCTGCCC 

GTGAGTGCCG 

ACCATGAGCG 

TTGGATTTTG 

ATTCGTACCA 

CGTAATAGAG 

ACCCACCGCC 

ATAGCCTATG 

CGAACGTCCG 



TCTTCTTTTG 
CTTTCGAGCT 
GTGTGCAAAC 
ATTCGGCCTA 
ACAACGGCAA 
ACTATGGCAC 
GGAATGCTCT 
GAGACTATCA 
TGCCCCTCGG 
GTACGACCAA 
GAGAGAACAG 
ACAACTGGCG 
ATGTCAGCAA 
AGCTTAGCTC 
ACTTCCTCGA 
TGCGACTCTA 
CTCTGCGCCA 
TCTACGACCT 
AAGCCCAAGC 
GTAGTGATTT 
GAATGCTACG 
GACTTTCGCT 
TCCAGAAAGA 
AAGGCGTGGA 
TGATGACAGA 
GCAAAACGCC 
GCCAT ACCAT 



CCTTTTGGGA 
GTCGGATCAG 
GAAAGAAAAA 
TGTTCGTTTC 
TAACTACTCC 
ATTGCAGGGC 
GCGCAACGCC 
TTTCGAAGAC 
TATAGGCTTC 
TACGACCGGT 
GCTATCGCTT 
TCCCGGGCAG 
CAGCCCTATC 
CCGTCTGGAT 
TACCGAa GAG 
TGGTAGCTGG 
AGGGATAGAG 
CCGTATCTTG 
CTCCTACCGT 
CTACGG CT AT 
TCCTTTTGCC 
TTCGGCTGCT 
GCAGCTCGAC 
GGTGCGTTCC 
GCTGCGGTTG 
CGGTGTCATC 
CGGGGCTATC 



60 
120 
180 
240 
300 
360 
420 
400 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1641 



55 



60 



65 



70 



75 



(2) I H FORMAT I ON FOR SEQ ID NO: 213 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2250 base pairs 
(R) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLF.CULC TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

tiv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIIIGIVALIS 

Ux) FEATURE: 

(A) NAME /KEY: taisc_f ea ture 

(E) LOCATION 1...2250 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



146/490 



(Xil SEQUEI1CE. DESCRIPTION: S] 

ATGAAAAAAC TTCACATGAT TGCCGCCTTA 
GCACCCGTCT CCAACAGCGA GATAGATAGT 
5 GTCGTAGCTA CTCGCGCCAC GGCGAAAACC 
GAACTTTCCA AGTCCAATTA TGGTCGTGAC 
GTGGTAGCCA CCAGCGATGC CGGTACGGGT 
ACCGATGCCA ATCGCATCAA CATAACTACC 
CAGTCCGTCT TTTGGGTGAA TAT GCCCGAC 

10 CAGCGAGGTG TGGGTACTTC CACCAATGGT 
CGTACGGATA ATTTGGGACT GGCTCCTTAT 
GGCACATTCC GCCGATCGGT CAAACTCGGT 
GATGCCCGCC TGTCCAAAAT CGGTTCGGAC 
AAATCCTATT TCGCACAGGT GGGCTATTTC 

15 TTCGGAGGAA AAGAAGT T AC GGGTATCGCA 
AAATATGGCC GCCGATACAA CAGTGCCGCT 
CACTACTACC ACAATACCGA CAAT TACGAG 
AGCTTCTCTC CTTCCGTTAT CCTCAACCTC 
ACGGACGAAT ATCGTACCGG ACGTAAACTA 

20 AACAGTGTGA CCGTGAAGAA AACGGATCTC 
GGAGGACTCA TCGGTTCGCT TAACTGGCAC 
TCGGGCAATA TCTATAAAGG AGACCACTTC 
CAGCCCTTAG CTCCCGACTT CGAATATTAT 
GCCTTTGCCA AAGCCAACTG GCAGATCACT 

25 TATCGTACCA TCGGCTACAC GATAAACGGC 

AG r ATGCAGC acatcgattt ggacaagacc 

ACCTATAGTT TCGACGATGC TCATACTGCC 
CCTAACAGAA CCAATTACAC CGAAGCCGGA 
ATCGACTATG AGCTGGGCTA CCGCTATGCT 

30 TATTATATGC AATACAAGGA CCAACTCGTG 
ATGCTCACAA GCAACGTCCC CGACAGCTAC 
CAGATCCTTC CTCGTTTGCT GCGTTGGGAT 
GACCGCTACG TACAATATAC ATCCGTATAT 
GAGGAGACCC TCGAAAGCAC GGATATAGCC 

35 CTTACCCTCT CTCATGCCGG TTTCGAAATG 
TATCTGGACA ATACACAGCG CAGCGATCGC 
CGCCTCGGCT ATGTGCTGCC GGTTCACTTC 
AATAATCTCT TCAACCTCAT GTATGCGTCC 
CAGGCATCCG GAGAACTAAG TGCATATGCC 

40 AATGCACTGG GTAGTCTGAC AATCGATTTC 



iQ ID HO: 213 

GCCGTCCTGC CTTTCTGCCT GACGGCACAA 60 

CTTAGCAATG TGCAGCTCCA GACCGTACAG 120 

CCTGTCGCTT ACACCAACGT TCGCAAGGCC 180 

ATCCCCTATC TGCTGATGCT GACTCCCTCC 24 0 

ATCGGATATT CCGGCTTTCG CGTGCGTGGC 300 

AATGGAGTAC CCCTCAACGA CTCCGAATCT 360 

TTCGCCTCTT CCATCGAAGA CCTTCAGGTG 420 

GCCGGAGCTT TTGGGGCAAG TGTCAATATG 4 80 

GGCCGTGTCG ATTTGAGCGG AGGTTCGTTC 540 

AGCGGACGCA TCGGTCGCCA TTGGGCAGTG 600 

GGCTACGTGG ATAGAGGAAG CGTGGATCTG 660 

GGTAGCAACA CGGCTCTCAG GTTCATCACT 7 20 

TGGAACGGTC TTTCCAAGGA GGATGAAGCC 760 

CTTATGTACG TGGACGCGCA AGGAGTACCG 84 0 

CAGCGTCACT AC CAT GC CAT CATGACGCAC 900 

ACGGCACACT ACACGGCCGG ATATGGCTAT ?60 

AAGGAATATG CACTGCAGCC CTATGTGGAA 1020 

ATCCGTCAGA AGTATCT GG A CAATGACTTC 10BO 

ACCGGTGCAT GGGATTTGCA GTTCGGGGCC 1140 

GGCCGTATCA CTTACATCAA AAAGTACAAT 1200 

CGGAACAGGG CAGACAAAAG AGAAGGTGCA 1260 

COGGAACTGA ACATGTATGC CGACCTCCAG 1320 

ATCACGGACG AATATGATGA GGTACAGGGA 1380 

TTCCGCTTCC TCAATCCGAA GGCCGGTCTT 144 0 

TATGCTTCTG TTGCGGTAGC ACACCGCGAG 1500 

ATAGGACAGT ATCCTACGCC TGAGCGACTG 1560 

TCGCCCCTCT TGTCGGCCGG AGTAGGTCTC 1620 

CTGGATGGCC GTTTGAGCGA TGTGGGACAG 1680 

CGTATGGGAC TGGAGCTGAC TCTCGGTTGG 1740 

GCTTCTTTCA CTATGAGTCG CAACAAAATC 1800 

GATGCGGACT ACAACTGGCT CGAACTCAAG 1860 

TACTCGCCCA ATGTCATTGC CGGCAGCATG 1920 

GCTTGGACGA GCCGCTTCGT CAGCAAGCAA 1980 

ATGCTTTCCT CCTATTGGGT GAACGACCTC 204 0 

GTTAAGAGAG TGGCACTGGG CGTACAGCTC 2100 

AATGCCTACA TCTACGATGC CGGTTACGTA 2160 

GATCTGCGTT ATTATCCTCA GGCCGGATTT 2220 

2250 



(2) INFORMATION FOR SZQ ID NO: 21 4 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 02 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
tDV TOPOLOGY: circular 

50 

(11 J MOLECULE TYPE: DMA (genomic) 

(111) HYPOTHETICAL: NO 

t»5 (ivj ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: PORYPHYROH0NAS GINGIVALIS 

60 iix) FEATURE: 

(A | NAME/ KEY : mi.icjeature 
(B) LOCATION 1...14B2 



65 



<xl) SEQUENCE DESCRIPTION: SEQ ID NO: 214 



ATGAAAAGAA GGTTTCTATC GCTGTTACTG CTGTACATAC TCTCTTCCAT CAGCCTTTCT 60 

GCTCAGCGGT TTCCGATGGT GCAGGGAATC GAGTTGGATA CCGATTCGCT TTTCTCTCTG 120 

CCCAAGCGTC CTTGGCGCGC CATCGGTAAA ACGATAGGCG TCAATCTGGC CGTATGGGGC 1«0 

TTCGATCATT TCATCATGAA CGAGGACTTT GCAGACATCA GTTGGCAGAC TATCAAGAGC 24 0 

70 AATTTCCAAA CAGGCTTTGG CTGGGACAAT GACAAGTTTG TCACCAACCT CTTCGCACAT 300 

CCTTATCACG GATCGCTCTA TTT CAATGCA GCGAGGTCGA ACCGTTTGAG CTTCAGGCAC 360 

TCTGCTCCGT TTGCCTTCTT TGGCAGTCTC ATGTGGGAGC TGCTTATGGA AAACGAGCCA 420 

CC GAG TAT CA ACGACCTCTG TGCCACCACC ATAGGCGGTA TAGCTTTGGG GGAGATGGGG 4 80 

CACAGGCTGT CGGACCTGCT CATCGACAAT CGTACCACAG GGTGGGAACG TATGGGGCGC 540 

75 GAGGTGGCTA TCGCTCTGAT CAATCCGATG CGCTTTCTCA ACCGTCTGAC AGCAGGAGAG 600 
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10 



15 



GTGACTTCTG 
GTCGATGCCG 
CTGACCCTGA 
TACGATTTCT 
CAGAT CAATC 
TT'JGTGGGAG 
AATTCGGAGG 
G6TCTTATCT 
ACCTACCTGA 
CGGGACTATA 
GATCTGTGGA 
GAAGAGCCGC 
AAGGCGCGCC 
GCCATCGTCG 
TTCGATACCG 



TCGGGAGTCG 
GCTTTCGCTT 
ATCTGAGATT 
TCCAATTCAA 
TGATCGGAAT 
GTCTCTTTCA 
AGGTACTCGT 
TCCAGCACCA 
ATGTCGTCCC 
ACCTCGGCAG 
GCTGGCTCTT 
ACCAGAAAAA 
TACTGGTGAC 
CTCGCCGTTT 
GCGACATACA 



CAGCGGACAG 
TTTGGCAGAC 
CGACTACGGC 
AGCCGGATTG 
CCTAAGCGGA 
GCACTTCGAC 
CACCCCATAC 
CGGAAAATTT 
GATGGGAGCC 
CGGATTGAGC 
GGGAGTCGAA 
TACCGATGTC 
G AGTT CCGAG 
CATCCGCAAA 
GCTGCGTGTC 



ATATTTCAGT 
AAGCGGCATG 
GATCCATTCC 
AGTTTCTCCG 
TGCCAACTGC 
TACTACAATT 
CGTATCrCGC 
CGACGACGTC 
AGTCTGTCGG 
GGCAAGCTAT 
AGCTATCGGC 
AGCTCTTTTA 
TTCGGATTTC 
ACAGCCTATC 
GGATTTCACT 



CTGTCCCCAT 
CCCGAACCGG 
GAAGCGAGAC 
AATCGCAACC 
TCGCACACGA 
CGGAAAAACG 
AAGTGGCAGC 
CTCTGGAGCT 
ATCACTACAA 
ACCTTGGTGC 
TCTACACATG 
TGGTGCAGGG 
ATCCTGGCCC 
AATTCTACCC 
TC 



AAACATAGTC 
TGCCACGGCT 
TTTCTCTCCA 
TCTGCTGAGC 
ACGAACGGTT 
AATAAGCAAA 
TCTGGGAGGC 
ATATGCCGAG 
CGTGGACAAT 
TACGTACAAT 
GATCGGGTAT 
GGACGAAA3C 
CTGGCATGTA 
TAACGTATCA 



660 
720 
700 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1482 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID MO: 215 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 0B2 bass pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 

(D) TOPOLOGY: circular 

iii) MOLECULE TYPE: DNA (genomic) 

(Iii) HYPOTHETICAL: HO 

(iv) ANTT -SENSE: HO 

Ivi) ORIGINAL SOURCE: 

<A) ORGANISM: PORYPHYROMOHAS GINGIVALIS 

Uxl FEATURE : 

(A) NAME /KEY: mlsc_feature 

(B) LOCATION 1...882 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215 



ATGAAACGAC 
CAAGAAGCAA 
GGACTAAATG 
GCAGGTAACC 
AACGGTTTGC 
GTAGACAAGA 
AGTGCGCTTT 
TTGACAGGAG 
GGTGCGGACT 
CTGACTGTAG 
GAAAAGACAA 
GAGAATGTCA 
AACATTGACA 
ACGATAGCCA 
CAGTTCAAAG 



TGATTGTTTT 
ACACTGCATC 
CCTCTCAGAC 
TCTATTTGAA 
GTACAGACTT 
TCGAACTCTT 
TCACTTTCCT 
TCAAGCATAT 
ATAAGCCCAA 
TAGCAGACGA 
TGTTCGAACT 
ATTTGATAAC 
TCAATTGGGA 
CCAATCTTAT 
AAGTTGTGGG 



TCTGGCAATG 
TGACACTCCC 
TTCTCTGACC 
CATAGATGCC 
CGGTCTGACC 
CACGAAGGCC 
CTCACAGTAT 
CTCTAATTTC 
TGAGAAGTTC 
CTACCTCTCA 
TGGTGCTTTG 
CAAGGCTTCA 
GGCTATGCTG 
CTACGACGAT 
CGTGGGTGTT 



GGTGGCTTGC 
AAAAAGGACT 
AACTGGGCTG 
AACTACCTGA 
TACACAACAG 
GGCTATGAGA 
GCCAAAGGAT 
TTCGCTCCTG 
TCTCTCTACC 
AGTTTGGGAG 
GTAGTGGGTT 
TTCTTCTCGG 
GCCATGAAGA 
GATGTGAAGA 
GCGTACACTT 



TGTTCACCCT 
GGACT AT AAA 
CCGGTGGAGA 
AAGATAAATG 
CCAACAAGTG 
TCGGCAAACA 
ATGAGAAGCC 
CATATCTCAC 
TCTCTCCTAC 
CCTTCGGGGT 
CGGCCAATAT 
CTTATACGCA 
TCAACAAGTT 
TCAACGATGG 
TC 



TGCGAACGCA 
AGGTGTGACC 
AAACACGGTG 
GAGTTGGGAC 
GAACAAAAGT 
TTGGTACGGA 
CTCGGATCAC 
TCTCGGTATT 
AACGGGCAAG 
GAAAGTTGGT 
AAATCTGATG 
CGACTTTGGC 
CCTCACGGCT 
CCCGAAAATC 



tZ) INFORMATION FOR SEQ ID NO: 216 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 612 base pairs 
t B ) TYPE: nucleic acid 
IC> STRANDEDNESS : double 
(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

Ivi) ORIGINAL SOURCE: 

(A) ORGAN I St 1 : PORYPHYROHOMAS GINGIVAL IS 

(Ik) FEATURE : 



60 
120 
1Q0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
882 
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(A) NAME/ KEY: misc_feature 
<B» LOCATION 1. . .612 

(xlj SEQUENCE DESCRIPTION: SEQ ID NO: 216 

ATGAAGAAAA TGATTTTGGC AGCTACTATG CTGCTCGCAA CAATCGGTTT TGCAAATGCT 
CAGAGTCGTC CTGCTCTTAG ACTGGATGCT AACTTTGTCG GTAGTAACTT AATGCAAAAA 
GTCGCAAACA CGAGCGTGAA CAATAAGATG ATCGTAGGCT TACGTGTTGG TGCTGCTGCT 
GAGTTCGCTC TTAGCAATGA TGGATTCTAT CTCGCCCCCG GATTGGCCTA TACGATGAGA 
GGTGCTAAGA TGGAATCACT AAGTGAAACG ACAACTCGCT TGCATTATCT GCAAATACCG 
GTGAATGCCG GT AT GAG ATT TAGCTTTGCT GACAACATGG CTATTTCATT GGAAGCAGGT 
CCCTATTTCG CATATGGTGT CGCCGGAACG ATTAAGACTA AAGTTGCAGG CGTTACGGCT 
TCTGTAGATG CCTTTGGTGA TAACGGATAT AACCGTTTCG ACTTGGGCTT GGGCTTGTCT 
GCTGCCTTGA GCTACGACCG TTATTACGTA CAAATTGGAT AT GAG CAT GG ATTGCTTAAT 
ATGTTGAAGG ATGCTCCGGA TAAGACTTCT TTGCGTAATC ATGACTTCTT TGTGGGTCTC 
GGTGTTCGCT TC 



(2) INFORMATION FOR SEQ ID NO: 21*7 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(O) TOPOLOGY: circular 

(ii> MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv> ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: PORY PH YROMON AS GINGIVALIS 

tlx) FEATURE : 

(A) NAME/ KEY: misc_f-ature 

(B) LOCATION 1...72 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21** 

ATGAAAAGGA TTTTTACTGT AGCCCTTGTG CTACTTGCTT CGGTCACTAT GGCCATCGGA 
CAAAGCCGCC CGGCACTTCG CGTAGATGCC AACTTCGTAG GCAGCAATCA GAGCATGAAA 
AGAGACGGAT ATGTGTGGGA CACCAAAATG AATGTCGGCC TGCGGGTCGG TGCCGCTGCC 
GAATTCATGA TCGGATCAAG AGGATTCTAC TTGGCTCCGG GTCTGAACTA TACGATGAAG 
GGCTCCAAAA CCGAATGGGA TATACCCGAA ATGGTTCCTG GTACCTATAT TACGATGGTT 
TCCACTCGCT TGCACTATCT GCAACTGCCG ATCAATGCCG GCATGCGGTT CGACCTGATG 
AATGACATGG CGGTTTCGAT CGAAGCGGGT CCTTTCCTTG CATACGGTAT ATATGGTACA 
TATCGGCAGA AGTTGGAAGG ATGGAAGCCG AACAACTACA GCACAGAGTT TTTTGGCCCA 
ACGCTTGGTG GCCCAACAAA TATCCGCTGG GACATCGGGG CAAACATAAT AGCCGCATTC 
CACTATAAGC GTTATTATAT ACAGATAGGC TATGAACATG GATTTGTGGA TATTGTGTCA 
GGTGGAGGTT CTGATATTCC CCGACTGAAC GACAATAGGC AATCCTCTTC GACGACCGCT 
CTAAGAGAAA AGGGAAATAA CGAATACGCT TATAATCGTG ACTTCTTCGT GGGCATAGGT 
TACCGCTTT 



(2) INFORMATION FOR SEQ ID NO: 218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
( 0) TOPOLOGY: circular 

<li> MOLECULE TYPE; DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) ANTI— SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PH YROMON AS GINGIVALIS 

tlx) FEATURE: 

(A) NAME/ KEY : mlsc feature 
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|B) LOCATION 1. 



.621 



10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218 



ATGAAAAGAA 
GCACAAGACG 
AACATCCCGG 
GAAGTGGGGC 
GCATCGAACG 
TCATTCTTCG 
CATCTGTTTT 
AATAAGCACA 
AATTCGAATA 
AACGTGGGGG 
GCCACGCTCT 



TGCTGCTGCT 
TTAT CAGACC 
GAGGATTCAC 
TGTCCATGTA 
AAGGAGACGG 
ATGCAGGCAG 
GGCAGAATAG 
ATATTCATTT 
CCTACATCGG 
CGGGTGCCGC 
CCACTCATTT 



TCTCGTTGTA 
ATGGTCATTG 
CTATGGTTTC 
CAACTCCACA 
ATCTTTTCAG 
TGCCAACTGC 
CCGGCACAAT 
CATCTATGGA 
TTACGGAGCA 
TGTAATGTAC 
T 



TTATTATATG 
CAGGTOGGAG 
TATTTGGGAA 
CGTCAAACAG 
GTAAATATGT 
TATATGATCG 
TTGTTTCTGG 
GACAAGGGAG 
CGTGTAGCCT 
GACCACGGCA 



GAATTGCAGG 
CGGGATACTC 
AGCGTATGGG 
CCAACAATGC 
CTTCTCCGAA 
TCGTCGGAGT 
CAGTACAAGC 
CCAAAGTCAG 
ACGAATATCA 
ATAAGATGCT 



CCGATTGGCT 
CGATACGGAG 
GAGCTTTCTG 
AGACTCCTTT 
TGAGAAGTGG 
CAATCCTCTC 
CGGCCTGTCC 
TATCTACACC 
AATTCATAAA 
TACGGCCATG 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
621 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) 1NFORIIATI0II FOR SEQ ID IIO:219 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2853 base pairs 
<BJ TYPE: nucleic acid 
(C) STRAHDEDtiF.SS: double 
<D) TOPOLOGY: circular 

Ui) MOLECULE TYPE: DMA {genomic) 

(Hi) HYPOTHETICAL: NO 

<iv> ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

{A> ORGANISM; PORYPHYROMONAS GINGIVALIS 

t ix J FEATURE : 

(A) NAME /KEY : miscf eature 

(B) LOCATION 1...2853 

(xi) SEQUENCE DESCRIPTION: SEQ 10 MO: 219 



ATGAGAGTAT 
TTGCTCTGTG 
CACCGTCGGC 
CCTATTCCTG 
CAAGATAGCG 
GATTCCATCG 
TATCAAAAGA 
TATACTCGCT 
GGAGAGCAGT 
ATCAGCGGAG 
ATGCCCGACA 
CATCCTCACT 
ACAGGTCCGG 
TATTTTCCCT 
AATCGCTATG 
GATTTGGCAT 
AAATATAAGA 
TCCGGCGACA 
ACACACAGTC 
GCCACCGGGA 
GCTACGACAC 
ATTACGGGT A 
CCGAAT CTTT 
CCGGAGCGAT 
TTGACAAAAG 
CGTCATTCCG 
GGGGTTAACT 
AAGAAAACAT 
CTGTCGGCAG 
TTCGGAGGCA 
ATGCCGGACT 
AACGGCAAGC 
TCCATGGGCA 
AAATCCAAAT 
TCTACATCC1' 



CCGATCTCTG 
CTACTTTGGT 
ATGCTGCGCT 
ACAGTGCTTT 
TGTATGACGA 
TTT TGCTCGG 
GTCGCTTGGA 
ATGTCCTCGA 
CGTTCGAAGC 
TGATCACGCA 
ATATCATGTT 
TCTATATCAA 
TCAAT CTGGT 
TTTCCAACAA 
GATTTTATTT 
TGCGTGGGGA 
AGAGGTATAA 
AATACGTGCC 
AGGATCCGAA 
GCTATTTCCA 
GAAGTTCGGC 
GCATGGATAT 
CGATTAATAT 
GGTACGAGAA 
AGAAAGATTT 
TACCGATCAG 
ACAATGAGTG 
TCCTGCCTTC 
GCTTATCTAC 
ATCTCATTAT 
TCACGAAACG 
TGCATACGCT 
ATGCAGGATC 
CGGATTCGAC 
ATAATATGTT 



TTCCAGACTT 
CGCTGCGGAA 
GTCTGATTCT 
TCGAGATTCC 
TGAATTCGAA 
ACAGAATCGT 
GGCAAACTTC 
TACGGCCGGT 
CAAGAACTTT 
GCAGGGCGAA 
TATGCAAGGA 
TCTTTCCAAG 
TATCG CCGAT 
ATACTCTTCC 
GAGGAATGGT 
GATCTTTTCC 
GTACAACGGC 
CGGAGACTAC 
GGCCAATCCT 
GAATTCGCTG 
CGTGAGCTAT 
CAGCCAGAAC 
GTCCACGCGT 
GTTGAGTGTG 
GCTCCAGAGC 
TTTGACTGTC 
GTGGTACACG 
GGACACGACC 
CACATT GTAC 
GATCCGTCAT 
CCGAT ATGGC 
GCTCTACTCT 
TGTCAATTTC 
AG GO ATCAAG 
TGCCGATTCG 



TCATGGTTAT 
CGTCCTATGG 
ACAGCGAAAG 
CTTCCTGCCG 
TTGGAAGATA 
GCCTATCTTT 
ATGTATCTCA 
TATCCGATGG 
ACCTACAACT 
GGCTATCTGA 
GGGCGTTAT A 
GCAAAGGTGC 
ATGCCGCTGC 
GGTATATTGA 
GGATATTATT 
AAAGGGTCAT 
TCGTTCGAAG 
AGCAAGACCA 
TTGCAAACGT 
AATACCACCT 
TCGCGCAAGT 
ATGCGCGATA 
TATCCTTTCA 
GGCTATTCCG 
AATCTCGTGC 
CCTTTGTTGG 
AAAGGCATAC 
TATAAATTCC 
GGTATGTTCA 
CGCTTCACGC 
TTTTGGGAGC 
CCTTATTTCG 
TCTTTTGACA 
AAGATCAGCC 
ATCCGATGGA 



TACCCGTAAT 
CCGGAGCAGT 
ACACGGTGCC 
ATTCCACCGG 
TAGTGGAGTA 
TCGGCAAGAG 
ATACCGACAG 
CCTTTCCTGT 
TCCGCACGGA 
CTGCCGGTAA 
CGACCTGCGA 
ATCCGGAGAA 
CGATAGGTCT 
TGCCCACGTA 
TTGCCTTCAG 
GGGGCATTTC 
CCAATTATCT 
CCAGTCTGAA 
TGTCGGCCAA 
ATGATGTCAA 
TTCCGGGTAC 
CGACGGTGAG 
ACCCGAAGAC 
GTCAGCTTCG 
GCGATTGGAA 
ATTATATCAA 
GGAAGTCGTG 
GCAGACTGTA 
AGCCTTGGAA 
CCACTGTCAG 
TTCTTGAGCA 
AGCAGATATT 
ACAACTTAGA 
TGATAGATCA 
GCAATATCTC 



CCTTGTCGGA 
CGGATTGCAC 
TCTCGCAAAA 
ATCGATGCGG 
CGAAGCTGCC 
CTATGTGAGC 
CAGTACGGTT 
TTTCAAGGAT 
GAAGGGGATT 
GACCAAGAAG 
CAATCACGAT 
AGACATCGTC 
TCCTTTCGGC 
CGGAGAGGAC 
CGACTATATC 
AGCCCAATCG 
GGTATCGAAG 
TATCCGATGG 
TGTCAATTTT 
TGCCCGTACT 
TCCTTTTTCG 
CCTTACCTTG 
CCGTGTAGGA 
CAATAGTATC 
GAATGGTATG 
TCTGACTATG 
GAATGAGGAT 
CGATTACAGT 
ACCTTTTTCC 
TTTCTCCTAT 
TACGGATCAG 
CGGTGCTCCC 
GGCCAAGATC 
GTTCACATGG 
GGCTTCGCTG 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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GCACTTCGCC 
ACGAAGTATT 
ATTTTTAACG 
CTCAACAAAG 
5 GATGAAAAGA 
GAGGGAGGAA 
GG AGCAGT GG 
TATAGTTGGA 
TACCGGGTAA 
10 TTCGGATTCA 
GTCACTCGCG 
AAGTCCTATA 
CAGAGCAATC 



15 



20 



5 



0 



5 



TCTCCAAGAG 
ATGAGGGAGA 
GCAAGGGATT 
AGTCGCTCAG 
AAAACACAGG 
GACCGCAAAA 
ATCAGGATGG 
ATATTGCTAC 
CGCAGAATCT 
ATGCGAACTA 
ACATGCACTG 
ATTTCGTCAT 
GTCCCATCAC 



CTTTACCTTG 
AGAT GGGAAG 
GGCACGCCTG 
CGG ATT GAT A 
GGCTACTCGT 
TGAAAGTGGG 
TTACTTCGCA 
CGACTACAAT 
GAGCTTTCGC 
CAATTTCGAC 
CTGGGCTATC 
ATCGGTGAAG 
GAATACTTGG 



CGCTTGTCCG 
ATCATTCCCT 
ATCAGTACGG 
GCTCTTTTCA 
CAT GAAGG AG 
GGGTCGCTCC 
TATTCGATCC 
AGGTACAATG 
GG CAAT AT CC 
TTGAAGAAAA 
TCGGCCAGTT 
AGTTCACTCT 
TAT 



GTCTGTTCGA 
ATAAGAGCAA 
GTACTTCTTT 
GTGGCAAAAA 
ACGATGCTGC 
TCGAGCGCAA 
GAT GG A GC CT 
TCAATAAGAT 
AGCCTACACC 
TAACATCGCT 
TCATCCCTAT 
TGCAGGATCT 



TCCCTATTTG 
CGACCTGCGC 
CAGCTATACG 
GGAGCGGAGA 
CGATATACTT 
CCGTCAGGGC 
GTCCTTCGAC 
GGAGCACTAC 
GAACTGGAGC 
TACCTGCAAC 
AGGAGCATAC 
GAAGTATCAG 



(2) INFORMATION FOR SEQ ID NO: 220 

U) SEQUENCE CHARACTERISTICS: 

(A> J.EliGTM: 3G7B b<* £ s pairs 
<B} TYPE: nucleic acid 
<C) STRAIJDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) AWT I -SENSE: NO 

(vl) ORIGINAL SOURCE : 

(A) ORGANISM: PORYPHY ROMONAS GINGIVALIS 
I ix ] FEATURE : 

(A) NAME/ KEY : mi£rc_feature 

(Bi LOCATION 1...367B 



txi) SEQUENCE DESCRIPTION: SCQ ID HO: 220 



ATGATGAAAC 
J TTTCAAATAA 
AAACATTCAA 
ATTGTATACC 
AGGCCTGCAA 
TCTCCCCCCG 
► GATCCTGAAA 
GTACCTGTCG 
GC C AG ATT AA 
GGGACGGCTA 
TTATCTGATT 
• ATTTGGGGAA 
ACTGCTGTGG 
GTTAAAAATG 
GATT TGGTT A 
ACACCTCTTA 
GAATCCAAAT 
GCATTGGTCT 
GCTATCAATG 
GACCTGGGAG 
CACTTCTTTA 
ATTAATATAG 
AAAAATCTGA 
CACTATGATG 
GAAGCTACCA 
CCGGTAAATA 
GCAAAATTGA 
CCGACGGTTA 
GGTCAACCCA 
CAGTCACAAG 
TTGGACAAAA 
ACTACTGCCG 
GGCCAAATCG 
CTCAATTATT 
TGGACAGGAG 
TTCGGTTCTC 
AACCTTATCA 



GATATACAAT 
AAGCTCGCCC 
TGGATTCTAA 
AATCTGTTTC 
ATCGTTTCCC 
TAGTTCCCGT 
ATCCCAATGC 
ATGTGGTGAT 
AGTCTGCCAT 
CAGAAGGGGT 
TTACCAAAGA 
CACATACCCA 
ATAAGCATAT 
TAACTACTGC 
TACAAGGAGC 
CCCCAAATTA 
TCGATTATAG 
ATGAACCGAG 
AGGCTCAGTT 
ATTTTGCCTT 
CGGCGACACC 
GTATACAGAG 
CGCAATCGGG 
TCTCTACTAA 
T AACTTAT CG 
CTACTTCTGC 
CCTATACCAA 
AGTTAGGTTA 
TACAGGCAAA 
ATTTCTTTTT 
CGACCGAAGC 
ATGGTAAACG 
GTATCAGTTG 
GGATGGGAGG 
CCCAAGTACC 
CGGCGGTAGC 
ATAATTCCGA 



AATTCTTGCA 
TTATGAAAGA 
ATTGGTGCCT 
TGAACATAGT 
TTCGCATAGG 
TGCCGTCGAC 
CTGGGATGTG 
GGTTATCGAC 
TGCATCGGGA 
GCGTATCGCT 
CACTGCTTTT 
GGGGGGGCTT 
CATATTGATG 
AGACTTCATT 
AATTAATTTC 
TCCAACTCAT 
TAATCTGAGT 
GTTTCCTCAT 
TGCGAAAAAC 
GGCCAACAAT 
GGCCAATTTA 
GGGGGAGGTG 
AGATGTTACT 
AAAACTGACA 
TATTTATCCC 
TATCGGCCCG 
TTCCAATGGC 
TGGTGTTATT 
TGGAACAGTT 
GCCCTCAGGT 
ATTACAGTAC 
TTATCGTTTT 
GAAAAAACCG 
AACAACAGAC 
GCTCACAGGA 
CGATTTGCAT 
CAAGGATTTA 



GTTTTTCTTT 
TTTGCAGATG 
GCAAATAAGG 
GACTTAGTTA 
AAGTCTTTTT 
AAGTATGCGG 
ACGCTAAAAA 
CAGTCI'TCGT 
CAGCGTTTTG 
CTTGTGAGTT 
CTCTGTCAAA 
AAAATGGCGA 
TCTGACGGGT 
GGCAAAACTG 
CCTACAAATT 
TCTTCTAAAG 
GCAAGGATTA 
CCCTATTATT 
TCTGGTTATA 
TCGTTGAAAC 
GCTGCAGOGT 
ACGGACTTTG 
CATTTGCTAA 
TGGACTACTG 
GATTTGGATT 
GATCTTGGTG 
GAACCGAATC 
AAGCGGCACT 
GTCAGTTCCC 
GGAGGTCATA 
TATTCCGTAC 
GTCGAAGTCC 
GCAGGAAACG 
CAACAGAGTG 
GAAGATGTAG 
GTCCCGACAA 
GTTGTTACCA 



TATTCTGCAC 
TAGAGAAGCC 
GTAACTTAAT 
TTTCACCTGT 
TTGCAGAAAA 
TACCGGTTGC 
T CACT ACT AA 
CAATGGGAGG 
TGAAAAAAAT 
ATGACCATGA 
AAATCCGGGC 
GAAACATTAT 
TAGCGACGGA 
GAAATGCGAA 
ATGTTTCCAA 
TTGGACGGAG 
CTTTTGATGG 
ATTATTTCCC 
CAATCCATAC 
TAACCGCTAC 
TTGATAATAT 
TAGCTCCTGG 
ATGTTTCAAA 
GTACTATCCT 
ATATACAGAA 
GATTCGATAC 
AGCAGTTAAT 
ATGTATTGGT 
TAAGCGAGGC 
TTGTTCCCAA 
CGCCGACTAA 
CAGGCTCCAC 
CTTACTTCGC 
AATGGGATGT 
AGTTTGCAAC 
CCAACCCCAA 
CAAGCAGTCA 



GGTATTTACC 
TTGGATTCAG 
TCAAGCl'GAA 
GAACGAAATA 
TCTACGGGCA 
CAATCCAATG 
AGCGGTAACA 
CCAAAACATT 
GTTGCCTAAG 
GCCTCATCGC 
TTTGACTCCT 
GGCCACTTCT 
GCAGTATCCT 
TGATCCCATT 
CAATCCATCT 
AAATCTGCCG 
TGTTGCTGGC 
TTGTAACGCT 
TATTGGCTAT 
AGACGAGAAT 
TGCCCAAACT 
TTTCATCGTT 
TGGAACGGTG 
GAGCTCATCA 
CAATGATATT 
CAATACCGAG 
TTTCCCACGT 
AAATAAAGAC 
TCATGTTCTA 
ATGGATAAAG 
CACGGTCATC 
GCCGAATCCG 
TTACAAGCTC 
GACGTCCAAT 
GACAGAAAAT 
AATTATCGGT 
ATTGACGATC 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2853 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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AACGGCGTGG TTGAGGATAA CAATCCGAAT 
GACAATCCTA CGGGGACATT GCTTTTTGCC 
ACCGTCGAGT TTTACAATCA GGGATATGAT 
TGGCAGTATT TCGGTATCCC TGTCAATGAA 
5 AACGAGACCG TCAACCAATG GGTTGAGCCT 
TATGCACCTG ATACAGAGCT TCAAAAATTC 
GCACAGOCTA CGGGAGTTTA CAGCTTCAAG 
AATCTGACAC GCACGTCCGG TGTCAACTAC 
ACTGGAGCCA TCGACATCAA GCAGGGTATT 

10 TATCTGTTCA AGACGGGAAC ACGCGACCAG 
GGCTATCGAG CCGGTCAGTA CCTCTCTGTA 
CCGGATCGTA TTCCATCGAT GCATTCCTTC 
ACGTTGCANA TCTTGTACGA TAAGCTGCTC 
ACGCAGATCA CATGGCGATC CGGCAACTCC 

15 GATGTTCTTG GTAACGAGTC GGCCGACCGT 
TTCGGATTCG ACAACGGCTG GGATGGTCGC 
TATGCGATGT CTGACATCGG TAATGATAAA 
AACCTGCTGA TCGGCTTCGA TGCGGATAAG 
TCGGATCATT TTGCGAAAGG GGCTGTTTAC 

20 CGTATTACGA ATTCTACGTC GTA1TCATTC 
TTCCGCTTGT CATATGGATG TGATGAGAAC 
GGCCGTGAAA TTATAATTCT GAATCAAGAT 
ATAGAAGGTA AGCTTCTTCG CCGCTXGAAA 
GTGCAGACCG GAGGGGCCTA TATTGTGCAT 

25 AAGGTGCTTG TTGAGTAT 



GTCGGTACGA TCGTCGTGAA GTCGTCGAAA 2280 

AATCCGGGCT ATAATCAAAA TGTAGGGGGG 234 0 

TGTGCCGATT GTGGTATGTA TCGCAGGAGC 24 00 

TCAGGTTTTC CAATTAATGA TGTGGGCGGA 2460 

TTCAATGGCG ATAAGTGGCG GCCAGCACCT 2520 

AAGGGCTACC AGATCACGAA TGACGTGCAG 2580 

GGTATGATTT GTGTGTGCGA TGCCTTCCTG 264 0 

TCGGGCGCCA ACTTGATCGG CAACTCATAC 2700 

GTCTTCCOGC CGGAAGTCGA GCAGACGGTG 2760 

TGGCGTAAGC TTAATGGAAG CACGGTTTCA 2820 

CCTAAGAATA CAGCGGGTCA GGACAATCTT 2880 

TTGGTGAAGA TGCAGAACGG AGCGTCTTGT 294 0 

AAGAACACGA CTGTAAACAA CGGTAATGGT 3000 

GGATCGGCGA ATATGCCGTC ACTTGTGATG 3060 

TTGTGGATCT TTACCGATGG GGGTCTTTCT 3120 

AAGCTGACTG AAAAAGGTTT GTCACAACTT 3180 

TTCCAGGTTG CAGGGGTTCC GGAGTTGAAT 324 0 

GATGGTCAAT ACACGTTGGA GTTTGCTCTT 3300 

CTGCACGATC TTCAGTCAGG AGCCAAACAC 3360 

GATGCCAAGC GGGGAGATTC CGGGGCTCGT 34 20 

GTAGATGATT CGCATGTCGT GAGTACAAAT 3480 

GCTCTTGACT GCACTGTAAC CTTATTCACA 3540 

GTATTAGCTG GTCATAGAGA AGT CAT GAAA 360O 

CTTCAAAATG CTTTCACTAA TGATGTGCAT 3660 

3678 



(2) INFORMATION FOR SEQ ID NO: 221 

30 (i) SEQUENCE CHARACTERISTICS: 

<A) LEHGTK: 3675 base pairs 
(BJ TYPE: nucleic acid 
(CJ STRANDEDHESS : double 
(D) TOPOLOGY: circular 

35 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

40 (iv) ANT! -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 

45 FEATURE : 

(A) NAME/ KEY : miscfeature 
( 3 ) LOCATION 1 . . .3675 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:221 



ATCAAACGAT ATACAATAAT TCTTGCAGTT TTTCTTTTAT TCT CCACGGT ATTTACCTTT 60 

CAAATAAAAG CTCGCCCTTA TGAAAGATTT GCAGATGTAG AGAAGCCTTG GATTCAGAAA 120 

CATTCAATGG ATTCTAAATT GGTGCCTGCA AATAAGGGTA ACTTAATTCA AGCTGAAATT 180 

GTATACCAAT CTGTTTCTGA ACATAGTGAC TTAGITATTT CACCTGTGAA CGAAATAAGG 240 

55 CCTGCAAATC GTTTCCCTTC GCATACGAAG TCTTT TTTTG CAGAAAATCT ACGGGCATCT ■ 300 

CCCCCCGTAG TTCCCGTTGC CGTCGACAAG TATGCGGTAC CGGTTGCCAA TCCAATGGAT 360 

CCTGAAAATC CCAATGCCTG GGATGTGACG CTAAAAATCA CTACTAAAGC GGTAACAGTA 420 

CCTGTCGATG TGGTGATGGT TATCGACCAG TCTTCGTCAA TGGGAGGGCA AAACATTGCC 4 80 

A3ATTAAAGT CTGCCATTGC ATCGGGACAG CGTTTTGTGA AAAAAATGTT GCCTAAGGGG .54 0 

60 ACGGCTAGAG AAGGGGTGCG TATCGCTCTT GTGAGTTATG AC CAT GAGCC TCATCGCTTA 600 

TCTGATTTTA CCAAAGACAC TGCTTTTCTC TGTCAAAAAA TCCGGGCTTT GACTCCTATT 660 

TGGGGAACAC ATACCCAGGG GGGGCTTAAA ATGGCGAGAA ACATTATGGC CACTI CT ACT 720 

GCTGTGGATA AGC ATAT CAT ATTGATGTCT GACGGGTTAG CGACGGAGCA GTATCCTGTT 780 

AAAAATGTAA CTACTGCAGA CTTCATTGGC AAAACTGGAA AT GCGAATGA TCCCATTGAT 84 0 

65 TTGGTTATAC AAGGAGCAAT TAATTTCCCT ACAAATTATG TTTCCAACAA TCCATCTACA 900 

CCTCTTACCC C AAATT AT C C AACTCATTCT TCTAAAGTTG GACGGAGAAA TCTGCCGGAA 960 

TCCAAATTCG ATT AT AG T AA TCTGAGTGCA AGGATTACTT TTGATGGTGT TGCTGGCGCA 1020 

TTGGTCTATG AACCGAGGTT TCCTCATCCC TATTATTATT ATTTCCCTTG TAACGCTGCT 1000 

ATCAATGAGG CTCAGTTTGC GAAAAACTCT GGTTATACAA TCCATACTAT TGGCTATGAC 114 0 

70 CTGGGAGATT TTGCCTTGGC CAACAATTCG TTGAAACTAA CCGCTACAGA CGAGAATCAC 1200 

TTCTTTACGG CGACACCGGC CAATTTAGCT GCAGCGTTTG ATAATATTGC CCAAACTATT 1260 

AATATAGGTA TACAGAGGGG GGAGGTGACG GACTTTGTAG CTCCTGGTTT CATCGTTAAA 1320 

AATCTGACGC AATCGGGAGA TGTTACTCAT TTGCTAAATG TTTCAAATGG AACGGTGCAC 1380 

TATGATGTCT CTACTAAAAA ACTGACATGG ACT ACT GGT A CTATCCTGAG CTCATCAGAA 1440 

75 GCTACCATAA CTTATCGTAT TTATGCCGAT TTGGATTATA TACAGAACAA TGATATTCC3 1500 
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GTAAATACTA 
AAATTGACCT 
ACGGTTAAGT 
CAACCCATAC 
TCACAAGATT 
GACAAAACCA 
ACTGCCGATG 
CAAATCGGTA 
AATTATTGGA 
ACAGGAGCCC 
GGTTCTCCGG 
CTTATCAATA 
GGCGTGGTTG 
AATCCTACGG 
GTCGAGTTTT 
CAGTATTTCG 
GAGACCGTCA 
GCACCTGATA 
CAGCCTACGG 
CTGACACGCA 
GGAGCCATCG 
CTGTTCAACA 
TATCGAGCCG 
GATCGTATTC 
TTGCAHATCT 
CAGA'i'CACAT 
GTTCTTGGTA 
GGATTCGACA 
GCGATGTCTG 
CTGCTGATCG 
GATCATTTTG 
ATTACGAATT 
CGCTTGTCAT 
CGTGAAATTA 
GAAGGTAAGC 
CAGACCGGAG 
GTGCTTGTTG 



CTTCTGCTAT 
ATACCAATTC 
TAGGTTATGG 
AGGCAAATGG 
TCTTTTTGCC 
CCGAAGCATT 
GTAAACGTTA 
TCAGTTGGAA 
TGGGAGGAAC 
AAGTACCGCT 
CGGTAGCCGA 
ATTCCGACAA 
AGGATAACAA 
GGACATTGCT 
ACAATCAGGG 
GTATCCCTGT 
ACCAATGGGT 
CAGAGCTTCA 
GAGTTTACAG 
CGTCCGGTGT 
ACATCAAGCA 
CGGGAACACG 
GTCAGTACCT 
CATCGATCCA 
TGTACGATAA 
GGCGATCCGG 
ACGAGTCGGC 
ACGGCTGGGA 
ACATCGGTAA 
GCTTCGATGC 
CGAAAGGGGC 
CTACGTCGTA 
ATGGATGTGA 
TAATTCTGAA 
TTCTTCCCCG 
GGGCCTATAT 
AGTAT 



CGGCCCGGAT 
CAATGGCGAA 
TGTTATTAAG 
AACAGTTGTC 
CTCAGGTGGA 
ACAGTACTAT 
TCGTTTTGTC 
AAAACCGGCA 
AACAGACCAA 
CACAGGAGAA 
TTTGCATGTC 
GGATTTAGTT 
TCCGAATGTC 
TTTTGCCAAT 
ATATGATTGT 
CAATGAATCA 
TGAGCCTTTC 
AAAATTCAAG 
Cl'TCAAGGGT 
CAACTACTCG 
GGGTATTGTC 
CGACCAGTGG 
CTCTGTACCT 
TTCCTTCTTG 
GCTGCTCAAG 
CAACTCCGGA 
CGACCGTTTG 
TGGTCGCAAG 
TGATAAATTC 
GGATAAGGAT 
TGTTTACCTG 
TTCATTCGAT 
TGAGAACGTA 
TCAAGATGCT 
CTTGAAAGTA 
TGTGCATCTT 



CTTGGTGGAT 
CCGAATCAGC 
CGGCACTATG 
AGTTCCCTAA 
GGTCATATTG 
TCCGTACCGC 
GAAGTCCCAG 
GGAAACGCTT 
CAGAGTGAAT 
GATGTAGAGT 
CCGACAACCA 
GTTACCACAA 
GGTACGATCG 
CCGGGCTATA 
GCCGATTGTG 
GGTTTTCCAA 
AATGGCGATA 
GGCTACCAGA 
ATGATTTGTG 
GGCGCCAACT 
TTCCCGCCGG 
CGTAAGCTTA 
AAGAATACAG 
GTGAAGATGC 
AACACGACTG 
TCGGCGAATA 
TGGATCTTTA 
CT GAd'G AAA 
CAGGTTGCAG 
GGTCAATACA 
CACGATCTTC 
GCCAAGCGGG 
GATGATTCGC 
CTTGACTGCA 
TTAGCTGGTC 
CAAAATGCTT 



TCGATACCAA 

AGTTAATTTT 

T ATT GGT AAA 

GCGAGGCTCA 

TTCCCAAATG 

CGACTAACAC 

GCTCCACGCC 

ACTTCGCTTA 

GGCATGTGAC 

TTGCAACGAC 

ACCCCAAAAT 

GCAGTCAATT 

TCGTGAAGTC 

ATCAAAATGT 

GTATGTATCG 

TTAATGATGT 

AGTGGCGGCC 

TCACGAATGA 

TGTGCGATGC 

TGATCGGCAA 

AAGTCGAGCA 

ATGGAAGCAC 

CGGGTCAGGA 

AGAACGGAGC 

TAAACAACGG 

TGCCGTCACT 

CCGATGGGGG 

AAGGTTTGTC 

GGGTTCCGGA 

CGTTGGAGTT 

AGTCAGGAGC 

GAGATTCCGG 

ATGTCGTGAG 

CTGTAACCTT 

ATAGAGAAGT 

TCACTAATGA 



TACCGAGGCA 
CCGACGTCCG 
TAAAGACGGT 
TGTTCTACAG 
GATAAAGTTG 
GGTCATCACT 
GAATCCGGGC 
CAAGCTCCTC 
GTCCAATTGG 
AGAAAATTTC 
TATCGGTAAC 
GACGATCAAC 
GTCGAAAGAC 
AGGGGGGACC 
CAG GAG CT GG 
GGGCGGAAAC 
AGCACCTTAT 
CGTGCAGGCA 
CTTCCTGAAT 
CT CAT ACACT 
GACGGTGTAT 
GGTTTCAGGC 
CAATCTTCCG 
GTCTTGTACG 
TAATGGTACG 
TGTGATGGAT 
TCTTTCTTTC 
ACAACTTTAT 
GTTGAATAAC 
TGCTCTTTCG 
CAAACACCGT 
GGCTCGTTTC 
TACAAATGGC 
ATTCACAATA 
CATGAAAGTG 
TGTGCATAAG 



1560 
1620 
1680 
1740 
J 800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2S20 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3400 
3S40 
3600 
3660 
3675 



40 



(2) INFORMATION FOR SEQ ID WO: 2 22 



45 



tl> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 



(11) MOLECULE TYPE: UNA (genomic) 
50 (iii> HYPOTHETICAL; NO 

(lv) ANTI-SENSE: NO 



55 



60 



65 



70 



75 



tvi) ORIGINAL SOURCE : 

(A) ORGANISM: PORY PHYROMON AS GINGIVALIS 

tlx) FEATURE : 

<A> NAIIE/KEY : mlac_feature 
( B } LOCATION 1...1275 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 222 



ATGGAAGTGA 
CTCTTCCACT 
AAAGGATTCG 
TCGCGTACGA 
TCGGTCAATG 
GCCTTCTTCG 
TGGGGGGCTG 
TTTCTGGCAC 
TTCAACGAAT 
CCTGTGGATC 
AATATCGTCT 
GGTGCGCGAT 
TGGAACAAGA 



AGAAAAACAC 
CCTCACGGCT 
TGGATACCTA 
GAGTGAGAGG 
CCACCTACAA 
AGCATCAGGA 
CCGACGGTGT 
AGGATTACGA 
CGATGAAAGT 
CTCGCAATCC 
GGAAAGAAGA 
GGAGCACTAC 
TGCCCGTCAT 



AGTGGTGCTA 
GTGGGGACAG 
CCATGCCGTA 
TGAGCTGGAG 
TGCTCTACTG 
AG AG CATTGG 
GCGCATCACG 
TGATATTCGT 
GGAAGTCGTG 
TTGGAATATC 
AGCCGGCAAA 
GCTCTCCGGT 
CGAAGTACAG 



CGCCTTCTGA 
GAAGGGGAGG 
CGCAGCTCTT 
AGGTCGTTCG 
AAAGACGAGA 
GGGTTGCGCC 
GATCTGATCT 
AT GCCGGTCA 
GTACTGCCTG 
TTCTCCCTTT 
CCGGCCTTCA 
ATCGACTTCG 
GGCATTGTGC 



TTTGGTTCGT 
GGAGTGCCCG 
CTCCTTTTGA 
GTAATTCGAA 
CCGGCTTACG 
TCGGACGACA 
CCCCGATGGA 
ATGCATTGCG 
TATTCGAGGG 
CGCCCATTGC 
AGGTTGCCAA 
CTTTGGCT GO 
CGACGGAAAT 



GGCCATTCTT 
ATACAGATTC 
TTTCATGAGC 
AGTAGCCGTA 
TTTACGTGAA 
GATTGTCATT 
TATGACCGAG 
TTTCTCTGTC 
Gl'ACCGTCTG 
TCAGGGGATG 
TATCGAGTAC 
ATTGCATACA 
CATCGTTAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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CCTCGCTATT 
GTTTTCAGGG 
GAGCAAGAGG 
GAATGGATGA 
TCCCAAAGAC 
ACACTCCAAC 
TTTGCAGCTG 
AGTAGTAAGG 
GCCCGCTACA 



ATCGTATGGG 
GAGAGGCTGC 
GTTTCCAAAC 
TCTCAGGACA 
AACATTCTAC 
TTTCGGACTT 
ACTATGCCTT 
GCAGCGGTAT 
GCTTC 



ATTTGTCGGC 
GTTCAATATC 
AATCAATTGG 
ATTCTCAATG 
CCTGATTACT 
CACCTACTAC 
GAACGATCAG 
ATTCGATCGC 



GGCGACCTCT 
GACAAACACT 
TTGGCCGGAG 
GAAAGCATAT 
CTCAATGTTT 
GACCTTACGG 
ATACATCTGA 
TACAAAGACA 



CCGTACCCGT 
TCACCTATAA 
CCGATTGGTA 
TCAGGTATAG 
CCAAGAAATT 
GCAAAGGATG 
TGGCCGGATA 
ATTCCGAACT 



CGGACAGTTT 
GAGTCATGCC 
TGCTCCCGGT 
GGATTTCATC 
CTT CGGCAGT 
GTTCAGTCGC 
TGACTGGTTC 
CTGGTTCAAA 



(2) IN FORMAT I CM FOR SEQ ID NO: 223 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

Hi) MOLECULE TYPE: DMA (genomic) 
(ill) HYPOTHETICAL: HO 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1275 



25 



(iv) ANTI-SENSE: NO 



(vi> ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



30 



35 



40 



45 



50 



55 



U>:> FEATURE: 

<A) tlAME/KEY : misc f ea ture 
(B) LOCATION 1...1212 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO:223 



ATGTCCTCCT 
GAACGGAATC 
TTAGCTACTA 
TAT GC CAT CG 
ATACTCCTGA 
AAGAATCCGC 
TTGTCACCGT 
AAGAAGAAAT 
GAGGGAGAGC 
GCCTGCGAAA 
CGTAAGATGC 
TCAGAAGACA 
GGCGATGAAT 
CGCATAGATG 
GATGAGTTCG 
GGCGAGATTA 
AGTTATCTTT 
GAAAATGCTT 
ATCAAGCAGG 
GTGACCCAAA 
TGGGAGGTCG 



GTGAGGTGGC 
ACTCTTCCGA 
TTCTGATCGG 
AGCAGACATT- 
CCACTGTTCT 
TGCAATACTC 
TTTCAAAATT 
ACGATATGTC 
CG GAG GAG AA 
TCATGGTT CC 
TTGACTTCGT 
ATATCAAAGG 
TCGACTGGCA 
ATTTGCTCGA 
GTGGCACTTG 
CGGACGAGTA 
TCGAAGGAAA 
TCGGTGAATT 
AACTCCCCCA 
T GG ACAAGCG 
AA 



TTATTTTTCA 
CATCGCGCTT 
GAATAATGTG 
CGTTTTCTCT 
TTTGCTGTTC 
GCGCTTTTCT 
GCTGGTCAAA 
CGTGOATGAG 
AGAAATGATT 
GCGTATCGAT 
TGTTTCGTCG 
GGTGATTTAC 
TCCTCTGATT 
GGAGTTCAGA 
CGGACTGATC 
CGATGAGGAA 
AACGTCTCTC 
GGGGGACGAG 
TGTGGGCGAT 
CCGAATCATC 



CTAAAGCCGA 
TCCAATTTAT 
ATTAATGTAG 
TCTCCGATCA 
GGAGAGATTC 
GCTGCAGCTA 
AGTACCGGCA 
CTCTCGAAAG 
AACGAAATCA 
ATTGTGGATG 
GGTTATTCCA 
ATCAAAGATC 
CGTAAAGCAT 
GCCAATAAGG 
ACAATGGAGG 
GAACTCCCCT 
TCCGATGTTC 
GTAGATACGC 
ACAGCAGTGT 
GAAATCAAGA 



TCGATCTGCA 
TAGACAATTC 
CCATCGTTAT 
TTGGATTTCT 
TGCCGAAAGT 
TGTCCGTTAT 
TCGTTACCAG 
CGGTAGCCCT 
TCAAATTCTA 
TGGATCTGAG 
GACTTCCCGT 
TAATCCCACA 
ATTTTGTCCC 
TGCATGTCTC 
ACATATTGGA 
TTAAGGTTTT 
GACACTATCT 
TAAGTGGGCT 
ACGAGCCATT 
TTTTCCCTTT 



GAACATCCGC 
GAATCAGCTA 
CCTTTCCAAT 
GATCCAGACG 
GTATGCGCGG 
CTATAAGATA 
AGGTATCAGC 
CACCACTACG 
TAATAAGACA 
CTGGCCATTT 
TTCAGAGGGG 
CAT GG AT AAA 
CGAAAACAAG 
CAT CGTTGTG 
AGAGATCGTC 
GGGGGATGGC 
TGACCTTCCG 
CTTCTTGGAA 
CCGCTTTCAA 
CGAGCGCACT 



60 
120 
100 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1212 



(2) INFORMATION FOR SEQ ID NO: 224 

60 (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH J 780 base pairs 
<B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: circular 

65 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

70 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : PORYPHYROMONAS GINGIVALIS 

75 (ix) FEATURE : 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(A) NAME/ KEY: miscjeature 
(B> LOCATION 1. . .780 

Ui> SEQUENCE DESCRIPTION: SEQ ID NO:224 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



ATGAAACTAT 
ATGCTCTTCT 
AT ATT GCCCG 
GCCAATCTCA 
TTTTCTACCT 
TATTTGTTGC 
ACAGGAATGA 
GATAACTCTTC 
CCAGCCACTA 
TCTATCAGTT 
CAGGGAAAGC 
TACCATATCA 
CCTATTGTAC 



TACTTTATCT 
CAGAGAATCT 
TACTGGATTT 
ATCTGCTGAT 
ATGGCAATAA 
ATCATGTTTT 
AATATAAGGT 
TCATGTTTGC 
GCCTCTTTGC 
TCAGACATCG 
CTGACAG TT A 
CACCTACGAT 
CTGTGCGGAA 



CCTATTGGTC 
CACAATGAAT 
CAAAACCGAA 
AAAGCACGGT 
AGTAACCGTA 
TGAGGTTTAT 
TTCTACGGGA 
AACATTGGGG 
AGGAACGTAT 
GTTGCGTGAA 
TTTTAAGAAG 
AGGAATACGC 
AGATTACAAC 



TTGTCGACTC 
ATAGACAGCA 
AAGGAAAATG 
CAAGTAATCA 
AGTGGAGGAT 
CCTTATGTCG 
TTACAGTCGC 
GTATTTTTCG 
GCATACAGCC 
CATTGGGAAT 
GCACGTTTTG 
GGGGCCTATC 
ACCGTTGATG 



TATCCCCGAT 
CCAAAACCAT 
TGTTCACCTT 
ACTTAATTAA 
ATGTACACAC 
AGTCGCAATG 
GTTATCGGCT 
AATTCGAAAA 
GAAGTATCAA 
TTACAACTAC 
GAGGAGCTAT 
GGATCATCTA 
TTGGTATCGA 



GTATTCGCAA 
ACAAGGAACG 
CAAAAATACT 
TAAGCTTGAG 
CGAATACCGC 
GGCAGAAAGT 
GGTAAATAGT 
GTGGGAACAG 
AAGCCACCTG 
GGCTATTCAC 
CGACCTCAAA 
CGATACTGCC 
TATTTCGTTT 



12) ItiFORIlATION FOR SEQ ID 110:225 

U) SEQUENCE CHARACTERISTICS: 

I A) LENGTH: 2502 base pairs 

(B) TrPF.: nucleic acid 

(C) STRAIIDEDNESS: double 
(O) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DtIA (genomic) 

(iii) HYPOTHETICAL; NO 

(iv> ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: PORY PH YR01 Id IAS GINGIVALIS 

tlx) FEATURE: 

(A) NAME /KEY : mi 3C _featuc- 

(B) LOCATION 1...2502 

(Kl) SEQUENCE DESCRIPTION: SEQ ID NO:22S 



ATGAAACGAA 
CAGAACAATA 
CCAGTGCCCT 
CGACAAGTGA 
CACCTGACAG 
GGACAGCACG 
ACCGTCACCG 
AATATGAAAG 
CCTTTGGTAA 
ATCCACCTCA 
TCCATTCCTG 
GATGCGGAAG 
GGATATTCAG 
TTTCTGACGG 
AAAAACAAGG 
GAAGGCAAAG 
ATAGATTCGC 
GACCGGAACA 
AAACTCAAAA 
ACTCGCCTGC 
AGCGAGACCT 
TACGCCGGCC 
TATACACGTC 
CATGCCACGA 
GGCTCTCTAT 
GCAGCCTATG 
GTGGAAAGCA 
AACTCGTTCG 
CTTAAGCTGG 
CGGCTACAGA 
CGTCACCACG 
CTCGACTACG 



TCGTTTTATC 
CCCTCGATGT 
ATGCCACTGT 
CTGACGGCAA 
CTTCGTTCGT 
ACATCAAATC 
TATCGGCAGC 
ATGACCCCGC 
CGGTGGATGG 
ATGGCAGGCC 
CCCATACGAT 
GCACAAGTGC 
GTTCCATCAC 
CAAAGTCCGG 
GCTCTCGCTA 
GGCAAGAAAC 
TCAATCTCTT 
GCGTAGAAAA 
CACAGATGGA 
CGGGCGAATT 
TCATTGACCA 
AGCACTCCAA 
CCTTAGGACA 
GCGATCCTCT 
ATGCACAGAA 
CCGGCTACAA 
GCAGGCTGAA 
ACTGGGTGCC 
CCTATAACTT 
CCAACGATTA 
TCGGTCTCTC 
ACTTCTGCAA 



ATCTTTCCTG 
ACACATATCC 
AAGCATCCGG 
CGGCTACTTC 
AGGTATGAAA 
CATCGACATT 
ACGACCACTG 
AGCCAAGACG 
TCAGGGCAAT 
CTCGACCATG 
CAAACGGGTG 
CATCCTGGAC 
GGCCAGTGTC 
CAAAGTCGGG 
CTTTACCGAA 
CTTTGGCGGA 
TACGGTGGGC 
AAGCTTTGCC 
TGCCGGATCA 
GCTCACCGTT 
ATGGAAGCGC 
ATCCGATGCG 
AGCACATTCT 
CTATGAGATA 
TCCGTCGAAC 
CTATCGTAAG 
AGCACT CTTT 
ACAGCTCACG 
CCGAATCCAA 
TCAAGTACAG 
CTATAATCAA 
CAACGCCATC 



TTCGTTCTGT 
GGTACGATCA 
CTGACAGGAG 
GTCATAGGCC 
ACCCATACCA 
TCTCTCGAAT 
GTGAAGATGG 
AACAACCTGC 
ATCCAGGTGA 
GTGAGCAGCA 
GAGGTCATCA 
ATCGTCACGG 
AC CAACAATC 
CTGACTACCA 
CGTACTACAT 
CACTTCGGCA 
GGCAATGTAC 
GGCAGCAACC 
TACGAGCTCA 
TCCTACCGCT 
GATCCGCTCA 
GGCATGGACG 
TTGGAAGCAG 
CGACCATCCG 
GGAAAGTTCC 
GATCAGTATT 
CCCGAAAACG 
CTCGGCTATA 
CGTCCTGCAA 
TAT GGTAATC 
TACGGAGCCA 
CAGAATTACA 



CCATACTTTC 
AGGATGCCTC 
CAGATACCAC 
TGCCGGCAGC 
TGCACATTAG 
CCGAGGACAA 
AGATAGACCG 
TCGAAATGCT 
AAGGATCTTC 
ACCCGAAGGA 
CCGATCCGGG 
AAGAAGGTAA 
CCACAGCCAA 
ACTATAACTA 
CCATGCTCCA 
ATGCCCTCCT 
GCCTTTGGGA 
TCAT GTCCT A 
ATGCCGACTA 
TCACTCACAA 
ACACAGCTAA 
AACATACGGC 
GGCTGAAGTA 
AAGATGCTCC 
GCCACGATCA 
CTTTGCAAAC 
CAGCAGCAGA 
CCCCCTCGCC 
TCGGCCAACT 
CCGACCTAAA 
AGGTCATGCT 
CCTTCTCCGA 



TTTGATGGCA 
CTCCGGCGAA 
ACAGGTGTTC 
TCCCTCCTAT 
TCGGGGAAAT 
ACAACTCTCC 
CCT GTCCT AT 
GCGCAACGTT 
CAACTTCAAA 
GGTCTTTCGC 
TGTAAAGTAC 
GAAGCTGGAA 
CGGTAGTATC 
CTACGGTGGC 
AACGATAGAA 
CTCATTCGAG 
GATGACCACC 
CATAGACAGA 
TCAGCACAGC 
TCCTAATAAT 
TACGATCCAG 
ACAAGTGGAC 
CATCTATCGT 
GTGGCAGCCC 
ATACATCGGA 
CGGCCTCCGA 
TTTCTCCCAC 
CATGAAGCAG 
GAATCCCTAC 
GTCGGAGAAG 
TACAGCATCG 
CCCGGCCAAT 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 



60 
120 
180 
24 0 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1600 
1740 
1800 
1860 
1920 
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CCCAATCTCT 
ACCTATGCCA 
CGCACATTCC 
TACTCAGGCC 
TATCATCGGG 
ATAGCCAAAC 
GCGAAGTATT 
AATGCCGGTA 
ACACAAGTGC 
CAAGGACAGC 



TCCACCAGAC 
TGTACACGCC 
AAAAGAGCGA 
TGATGTTCAC 
GAAGAAGCTA 
AGCTTTTCGA 
CGACATGGAA 
TACAACGGAG 
GCAAGGTAGA 
AGGGTGGCGG 



CTATGGCAAT 
GGCCGTATGG 
AGCACTCGGC 
CCTGCCGAAG 
CCAGACGAAG 
CAAAAAATTG 
GAGCCGGACC 
TGTTTCCCTC 
GCGTACGATC 
ACAAGGAAAT 



ATCGGACGAG 
GTCAGGATTA 
ATTGATGTCA 
GATTGGACTG 
TATGATGGCA 
AGAGTCTCGC 
ATCGGCAATG 
AGCCTCACCT 
GTCAACGACG 
CCTACCGGCA 



AGCATTCTTT 
TGCTCAACGG 
ATTCATGGTC 
TGAATCTCTT 
ATGTATTCAA 
TGAGCGCAAA 
GATTTACTAT 
ACAGCTTCGG 
ACCTCAAGCA 
AT 



CAGCTTGAAT 
AAATATCGAT 
CGGCATGGTA 
CGGAGGTTAT 
CAATATCGGT 
CAACATTCAT 
TTATTCGGAA 
TAAGATGAAT 
AACCTCATCC 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2502 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(2) INFORMATION FOR SEQ ID MO; 226 

<i) SEQUEHCS CHARACTERISTICS: 

(A) LENGTH : 1197 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES3: double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DNA (genomic) 

tlii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GTNGIVALIS 

(lx) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1...H97 



txi» SEQUENCE DESCRIPTION: SEQ ID NO:22€ 



ATGAGACTCT 
CAGCATTATT 
CTCTCGGCAT 
GTCAAAGGTT 
GCAGCACATG 
CGCTATCCGG 
GTACGCTTGC 
CTCATGCTGG 
CCTGAGTTGG 
CGTTTCCGAA 
CAGGAGTCGT 
TGGCGACTCG 
GCGCAGCAGG 
CGCCCTCGTA 
TCAAGCGGAG 
GACTTGGAGC 
TTTGCTGCAC 
GGCGATTATA 
GCGGCTGTTG 
GAACTGACGA 



CTGCCATTCT 
ATTCCATGGC 
CGATCCGAAG 
ATACGTTGCC 
GTGTGAAGCT 
CCGGTATCGC 
GTATCCT GCC 
GCAATATAGC 
ATTTGACGGC 
TGGATGTTTG 
TTGTCTTTGG 
AACTGCCCTT 
ATACCGTGCA 
CCGACAAACC 
GATACTTCCC 
ACTTCGCTTT 
CTTTCGCCAA 
TTCGTCTCTA 
CTCGGGTATG 
TGCGTATC7GA 



TATCGCTTTG 
GGGAGAGCGA 
TGCGCTTTTC 
GGGTGCACGG 
TTCGCTCGGA 
TT ATT CCG AT 
TTATGTACAG 
CGGTGGTACG 
TGATCCTGAA 
GGTCAATTGG 
CTTGTCCACT 
GCAGGCTATT 
TACATGGGTC 
CATGCAGATT 
TTACGAAAGA 
TCGTACCGAC 
TTCCCTGACG 
TGCCGACTAT 
GTTCCAGCCT 
TCCCAAATTC 



ATTGTGATGC 
CTGGAGACGG 
TTTCGGAACA 
GTTTCCGCTT 
GTATCTACCC 
TTACCTTATT 
GCCATGCTGA 
GCTCACGGAC 
GCCGGTGTGC 
ATG AG CATG A 
ACTTCGAAAT 
GCCACGCATC 
AATGGAGCTG 
TGGGGATCTG 
GGGTGGGGCG 
TATTGGTACG 
TATGACAAAC 
TCGTGGCGCA 
TCGGATCGTT 
CCAATAGCTT 



TGCCTGCTGT 
ACAGCA1TCG 
ATGAATACAA 
TTGCCTCT T A 
TGAACTACTG 
GGACGGACTA 
AGCCGACGGC 
TGATCGAACC 
AATTTCGGGG 
TTTTCAAAAA 
TGTTATCGGC 
GCGGCGGGGA 
TCGGACTTAA 
CTTATGGTGT 
GTTATCTTTC 
GCAGGCATTA 
AGCCTCTTAC 
TGGCACGAAG 
TTGCGATGAG 
TTCTGAAAGG 



GCTTAGCGGG 
TCCGAACGAA 
TGCACGTTCG 
CTCGCTGCCG 
GGGGGCAAGT 
TAACGACTAT 
CACGACTGCT 
GATCTACAAT 
TGATTGGACA 
TGACAATCAT 
TGAAGGCAAA 
ATACAACTGG 
GCTTTCGTAT 
GGCAGCCTTG 
TCTCGGAATG 
CGTTTCTCCC 
GAACGGTTGG 
TGTTTCGTTG 
CCACGCCTTG 
CAATCAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10B0 
1140 
1197 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 227 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 114 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

{11) MOLECULE TYPE: DMA (genomic) 

(ill) HYPOTHETICAL: NO 

tlv) ANT1 -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 

(ix) FEATURE : 
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(A) NAME/ KEY : misc_faature 

(B) LOCATION 1 114 6 



(Xl> SEQUENCE DESCRIPTION: SEQ ID 110:227 

ATGAACAAAT CGCTATTATC AT TGGCATGC CTCATCCTGT GCGGTATGCC GGCCATCGCC 60 

CAACAGACAG GACCGGCCGA ACGCAGCGGC GAGCCTTCTC TCGCCGAACG TGTATTCGGT 120 

CTGGAGCAGA AGCAGAAAAA GCTGAAGGTG TACTTAGGCA TACAGTCGTT CTACGACCAG 180 

CCGCTTGTCC ATGACGAATC CCATATCGGA CACTTCAAGG TACAGGAGCT GCGGATGTCT 24 0 

GCTCATGGCG AACTGAACCG CCACCTCAGC TTCGACTGGC GACAACGTCT CAACCGTGCC 300 

GCCGACGGCA CTTCGTTTGC CGACAATCTC TCCAATGCCA TCGACATCGC AGGTGTGGAC 360 

TGGCACCCGA ACGACAAGGT GTCTTTCTTC TTCGGACGTC AGTACGCGCG TTTCGGAGGG 420 

ATAGAATACG ACATGAACCC CGTAGAGATC TACCAGTACA GCGACCTTGT GGATTACATG 4 80 

ACCTGCTATA CTTCGGGCGT GAACTTCGCA TGGAACTTCC ACCCCGAACA GCAGCTGCAG 54 0 

XO CTACAGGTAC TCAATGCTTA CAACAACCGC TTCGCCGACC GCTACCACGT GACACCCGAT 600 

GTCGCTACCG CCACGAGCTA CCCGCTCCTC TACTCGGCAC AGTGGAACGG TACCCTCCTC 660 

GGAGGAGCAC TGCATATGCG TTACGCCGTG TCGATGGCTC ATCAGGCCCA AGAGCGTAAT 720 

ATGTGGTACT TCACTGCGGG CAACCTGTTC AATCCGGGCA AACG GAT CAA CGGATACCTC 780 

GACCTCACCT ACT CG AT CG A GGGATTGGAC GACAAAGGCA TTATGACTGC TCGCTA^GGC 840 

AAGGGCAAGA CCCTCACGGA CGTCAAGTAC TATGCTCTGG TATCGAAGTG GAACTTCCGC POO 

ATTTTCGATC AGGTCAATCT CTTCCTCAAA GGCATGTACG AGAACGGCTA TGCGCCTGCC 960 

CAATACGGCG AGAGCAGCCA CACGCGCCAC TCCTACC3GCT ATATGGGAGG GGT GGAAT AT 1020 

TACCCTACGG AGACCAACTT CCGTCTGTTC GTCACCTACA TAGGACGGCA TTACCGGTAC 1080 

AGTGCGACCG AGACGGAAAG CACCAATGCT CTTCGCGCCG GTCTGATCTA TCAGATACCT 11 4 O 

AO TTCTTA , . 

1 J ** o 



(2) INFORMATION FOR SEQ ID HO: 228 

30 (1) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 666 base pairs 

(B> TYPE: nucleic acid 

(C) STRAW DEDNESS: double 
0 _ <D» TOPOLOGY: circular 

do 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

40 (Iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHOHAS GINGIVALIS 

45 (ix) fcEATURE: 

(A) NAIIE/KEY: misc_feature 
IB) LOCATION 1...666 

50 Ui> SEQUEnCL ASCRIPTION: SEQ ID NO: 228 

ATGAAACATT TGTTTAAGTC GACATTAGTA CTTCTTTGTG CTCTTTCTTT TTCCGGTACC 60 

TATACCTTTG CACAAGAAAA TAATACAGAA AAGTCACGAT TTGATTTTTC TGTTAGGCTG 120 

GGACAGGGAT ATATTGCAGG TTCAACTACC AACCTGATGT ATGGGTATAC ATCTGCTAAC 180 

GATAGACTTT TGTCTGGTGC AATTTATCTG GGCTTGACAC CAAGTAAGAA AGAAAATGCA 240 

OO ACCGGCGTAG CATTTCGTTT CTTATCNCCC TCTCCGGGTT ATTATGTCGA TATATCCGGC 300 

AAAGAAAATA CCTTGAATTA TGCGTTTTAC GTTGTCGGAG CATATAATAG AATAGCCATT 360 

CCTATACGCC CTATCAAAAA TTTTAATTTC ATCTTCTCTA CAGAACTCGG AAT GGCTTGG 420 

ATGAGTCGTC ATGAGCAAAT TTACAATTCT ACTTCGCAGA CTTGGGATAA GCAGCGCAAG 4 80 

TCGAGGTCGG GACTGGATTT TGGTCTCGGG ATGCATCTGC AATNCCACAT TAATAAGACC 540 

GTTTACTTTA TGGCAGGAAC CGATCTTACG TCTTGCATGT TCGGAAAAAG GATCAATGAC 600 

TACCAGCAAA AGGATCGAAC CTTCATTGCA CTTATCGACA ACAGTATTGG CATAGGATTA 660 
AACCTC 



60 



65 (2) INFORMATION FOR SEQ ID NO: 22 9 



70 



li) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 1173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



iii) MOLECULE TYPE: DNA (genomic) 
75 (iii) HYPOTHETICAL: NO 



666 
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(iv) AHTI -SENSE: NO 



10 



40 



50 



55 



60 



65 



(vi> ORIGINAL SOURCE: 

(A) ORGANISM: POR Y PHY ROMONAS GINGIVAL! S 

li*) FEATURE: 

(A) NAME/ KEY: miec_feature 

(B) LOCATION 1...1173 

Ixi) SEQUENCE DESCRIPTION: SCQ ID NO: 229 



ATGATTATCA AGAAAATGCT GAAAAATAAA TTGGCCCCCT TGGCCATACT GTTCCTTTTT 60 

GCTCCAAAGG CTATGAAGG C TCAGGAGCAA CTGAATGTGG TACACACCTC TGTGCCATCG 120 

IO CTGAATATCA GTCCGGATGC ACGTGCGGCC GGTATGGGGG ATATAGGTGT GGCAACGACG ISO 

CCGGATGCGT ATTCACAGTA TTGGAATCCG AGTAAATATG CTTTCATGGA TACGAAAGCC 24 0 

GGTATTAGCT TCTCATATAC ACCCTGGCTG TCCAAGCTGG TCAATGATAT TGCCCTGATG 300 

CAGATGACCG GTTTCTACAA ATTGGGAACA GACGAGAATC AGGCTATTAG TGCTTCTCTG 360 

CGTTATTTCA CATTAGGAAA GTTGGAGACT TTCGACGAAT TGGGCGAATC CATGGGAGAG 4 20 

GCCCATCCCA ATGAATTTGC TGTCGATTTG GGCTATAGCC GCCAGTTGTC GGAGAACTTC 4 80 

TCCATGGCTG TTGCACTGCG TTACATCCGC TCAGACCAAA GCACTCACAA CACCGGAGAG 54 0 

AATCAGGCCG GAAATGCCTT TGCGGCGGAT ATAGCCGGTT ATTTGCAGAA GTATGTGCTA 600 

CTGGGTAATG CGGAGAGCTT GTGGTCGTTG GGTTTCAACG TAAAGAATAT CGGAACGAAG 660 

ATCTCCTATC ACGGAGGTGT CACGAGTTTT TTCATCCCTA CTTCGTTGAA TCTCGGGACG 720 

ZD GGGCTGTTGT ATCCGATCGA TGACTATAAC AGCATCAATT TCAACCTTGA ACTTAGCAAG 7 80 

CTGCTTGTAC CCACTCCTCC TATCATGGAT CAAAACGATC AGGCCGGGTA TGAGGCTGCA 04 0 

CTCAAGAAAT ATCAGGAAAC TTCTTCGATC AGCGGTATAT TCTCTTCTTT CGGTGATGCG 900 

CCGGGAGGAC TCAAGGAAGA ATTCCGTGAG ATTACATGGG GACTTGGGGC TGAATATAGC 960 

TATGACGATA AATTTTTTGT TCGTGCCGGA TATTCATACC TGCACCCCAC CAAAGGCAAT 1020 

JU TTGCAGTACT TCACGGCCGG TGCCGGCTTC AAAATGAACA TATTCCGTAT CGATGCTTCC 10S0 

TACCTGTTGT CTACGATCCA GAGTAATCCG TTGGATCAGA CTCTGCGGTT TACGCTTGCT 1140 

TTCGATATGG ATGGATTGCG CAATTTGTTC CAC - 1173 

35 (2) INFORMATION FOR SEQ ID NO: 230 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1338 base pairs 
(B J TYPE: nucl-ie acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



<ii> 1IOLECULE TYPE: DNA (genomic) 
45 (ill) HYPOTHETICAL: NO 

tiv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHONAS GINGIVALIS 

<ix) FEATURE: 

(A) NAME/ KEY: rolscjeature 
<B) LOCATION 1...1338 

Ui> SEQUENCE DESCRIPTION: SEQ ID NO: 230 



ATGAAAACAA CAGTTCAACA AATTATTCTG TGCCTGGCTT TAATGATGTC AGGTGTATTG 60 

C5GCGGAAACG CACAGAGCTT TTGGGAAGAA ATAGCTCCTC CTTTTATCAG TAATGAGCCT 120 

AACGTCAAGT ATATAATTCC CAATATGGGG ATTGATTCAA AGGGAACAAT CTATGTAACC 100 

GTCACAAAAA GGATTCAGCA GGGAGCAAAT TATACTTCTG AGCAATTGGG TAT G TACT AT 24 0 

CGACCATTAG GTGATAATGA ACAGTGGTGG AAACATGATC CGTATTTTGA TGACAAGATA 300 

GTTGCGGATA TTCAGACAGA TGCATATGGC AGAGTTTATn TATGTACGAC TTCTTCTCGA 360 

GATCAAGAGT ATCAACTTTA TATAAACGAG CAGAACGAAT GGAGGTGTAT ATTCAAAACT 4 20 

TCTGTGTCTA CAT AT GAG CA TGGTATGGCT GTTTTTCGCT CTTCGACAGG GGTGACTTAT 480 

ATAGGTACCA GGCAT CACAT CTTCGCATCA GGTGTAAATG ATTTCGAGTT CAACACTATC 540 

TATGAAGACT CTACACCTAT GAGCTGTCGC TTTGCAGAOG CTACGAATAG TGG CAC CATC 600 

TATCT GGCAT TG AT GCATG A AACCACAATG TCTACGACTA TCCTTACTTA TCAAAACGGT 660 

GAGTTCGTCG ATATCTCGGA AAGTGAATTG AGTAACTCGA TTATTGCATC CATGTGCTCT 720 

/U AATAAAGAAG GTGATATAAT AGCTCTTGTT ACTTCATATA CAGGATTTAT GAGTGGAACC 7 80 

CTTGCGATCA GAAAAGCAGA TGAAGGCAAA TCGCAACTTG TTGGCGGAGA TATACAGAAT 84 0 

GCGATCGTTC AAAATATATG CATGATGGAC GACAACAAGA TTGCTTGTGA AGTCTTCGGG «00 

ACTCCTAACG GAGTAGATGG TCGGACAAGG GTTTGTGTTT CTGACGCATC TGTTTTGAT 960 

7q TTTGAGTGGT ATGAAGATGA AATATACGGA GGCCTGATAT TTGACACTTT CTTCTATAGC 1020 

/3 CCTTGGGACA AACTTCTTTA TGCGAAATTT GGTGGGATTA TGCTCAGGAG TAAAGAGTCT 1000 
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TTTATAACCT CTTTCATTTC TCCGACAGTT GTACAAGGAG TGGATGTCTA TACTTTGGCC 11 4 0 

GGGAAGATAA GGATCGAAAG TGAAACTCCG GTGTCTGAGG TGTTGCTTTT CGACCTGGCT 1200 

GGCAGGATGG TACTTCGGCA AACCATTGAT AATAAAATCT ATTCGGACAT AGATACTAAC 1260 

GGACTAAAGC GAAGCGGT AT TTACGTAGTC TCGGTGCGGC TCTCTTCCGG ACAGGTATTC 1320 

5 AGTCATAAGG TGCAGGTA 1338 



(2] INFORMATION FOR SEQ ID NO: 231 

10 II) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

15 

til) MOLECULE TYPE: DNA (genomic) 

(Jt1) HYPOTHETICAL! NO 

20 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS G1HGIVALIS 

25 (i.-:) EEATURE: 

(A) MAI IE/ Kb! i : nuse_f eature 
( D ) LOCATIOM 1...92 4 



30 



50 



60 



65 



ixi) SEQUENCE DESCRIPTION: SF.Q 10 NO: 231 



ATGATAATCC GGTGTCTTAT CCGTCGTCCG AGAACCGTCC TGTTCGGGTT GATATTCGTG SO 

GTAGGTCTTT TCTCTGCGAT GGCGCAAGAG AAAAAGGATA GTCTCTCTAC GGTTCAGCCA 120 

GTGCCGAATA GCAGCATGGT GGAGCAGACC CCTCTTCTCT CCATTGATCA CCCCGTCCTG 180 

CCCGCTTCTT TTCAGAATAC CCGTACACTG AAAAGGTTTA GAGACAAACA TCTTTCCGAT 24 0 

35 GCTTTGCTCA ATGGATTGAA GCCTCATCGC TCAT CTTTGC AATTGAATGA GGAACTCAAC 300 

TTCGCGGCAG AGCGTCGGGA TTTCGTTTCT CCCCTCTTGC AAACTCGCCA CGCTGCCGGT 360 

GTCCTTTCAT GGCGACCGAC CGATAGGATG CATTTTTATA CAT CGGGCAA TATCGGTCTT 420 

GGCCATGATT TATTGACCGG TGTGCGCAAG GACTTCGGAT GGAATGCTGG TGCCGACTTC 4 80 

TTGCTGACTC AAAATCTTAC GGCACATGTC CAAGGCGGTT GGCAGCAGAA TTTCGGCTTT 540 

40 ATACCTATGA CGGCTGTCAA TGGCCAACTG CGTTGGCAAG CCACCGAGAG ATTGAGTTTT 600 

'ACCACCGGTA TCGATTATCG ACAGGTACAG TGGAATGCTT TCGATAATAG AACGTTCTCG 66 Q 

CTTAAAGGAA GTGCTCGATA CGAAGTGATG GACAATGTCT TTGTCAATGG ATTTGGCAGC 720 

TATCCTCTCT ACAGCAGTAC GCGCTCAGGA CTCAATATGG CTGTTCCGAT GCATGGATTC 7 80 

GGCCCTCAGT ACGGTGGATC GCTTGAGCTG AAAGTCTCCG AGCGATTCGG CTTTGCCGTC 840 

45 GGTATGGAGC GCGAATACAA TATCTGGACT CGT CGGTGGG AAACGCATTA CTTTGCTTAT 900 

CCTGTATTCT ATGGCGATAA GAAG 924 



(2) I M FORMAT I ON FOR SEQ ID NO: 232 



()) SEQUENCE CHARACTERISTICS: 

<AJ LENGTH: 94 5 base pairs 
(BJ TYPE: nucleic acid 
(C) STRAI IDE DNESS : double 
55 (U) TOPOLOGY: circular 



(11) MOLECULE TYPE: Dt.'A ( genomic) 

<iii) HYPOTHETICAL: NO 

(iv) AUTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIUGIVALIS 



(ix) FEATURE: 

(A) NAME/ KEY : mi3C_f eature 
(0) LOCATION 1. . .94 5 

70 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232 

AT GAAAACGA ATAGACGATA CGCATTTGTT TTGCCGCTTC TGCTACTCAC CGGATTGTTG 60 

GCATGGGGGC AGGATTCTTC CCACGGTAGC AATACAGCGT TTGCAACTGA TTCTTCGAGT 120 

AGAGAGTTGC CCACGGAGCA GTCCGCCTAC CGCATTCATT CTGCCTATAT GGTCGGTGGT 180 

75 GGCGGAAGCA TAACGCGCGA CACCTATTTG TCACCCCTTC GTTATGGACG ATGGACACTG 240 
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AATTTGTTGG GA'JAGAAGAC GTTCCCTCTC AAAGCCTCCG ATTCCCGTTG GATGATCCGT 300 

ACCGGGCATG AGCTGGATTT TGCCCTGATG GACAATCCGG CCAATAATGC TCATTTCTAT 360 

TCCCTGCTGT ATAACGGTTC CGCTGCGGCT CTTTACCGCC TTGGCGCTAA GCATCTGCGA 4 20 

GCCGCGTGGA TGGACAATCT GCGCTTGGCA TTCGGCCCGG GCTTGGAAAT CGGGCTTGGA 4 80 

GGAATTTATA GTACACGCAA CGGCAATAAT CCTGCGACAT TGAAGCTCTA CACCAATGCC 54 0 

ATCGCCCAAG CCTCGATAGG ATACTACGTC CCCTCCGAAA CTTTTCCCCT GTATTTTCGG 600 

TTGCTCTCCC AGATCAATCT CTTCGGTATA GCCTATGGAA ATGGTTTTGG TGAGAGCTAT 660 

TACGAGAATT TTTTGCTCAA TAACGGCATT GCAGGCTCCC TGCATTTCAC TTATCCGGGC 7Z0 

AAGTTTACTC GGTTCACGAC ACTCATAACG GCGGATATTC CCATTCGGAA CTTCTGTACG 780 

CTTCGTGTCG GTT ATCGCT A TTCCCATTTG GGCTCTTCGC TTAACGCATT GGATACTCGA 840 

ATCCACAGTC ATACGGCTTT TAT CGGTTTC GTCACGGAGT TTTACCGATT CCGTGGGCGC 900 

AAA GC CAT G A ATACCGGTCG GAGAACCAGT CTTTACTATC ATGAT 945 



15 (2) INFORMATION FOR SEQ ID 110:233 

(1) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 855 base pairs 
tB) TYPE: nucleic acid 
20 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii> MOLECULE TYPE: DMA (genomic) 

25 (1111 HYPOTHETICAL: NO 



(ivl ANTI-SENSE: WO 



30 



35 



<vi) ORIGINAL SOURCE: 

(A| ORGANISM: PORYPHYROMOUAS GINGIVALIS 

f ix ) FEATURE : 

1A> NAME/ KEY: miacjeature 
IB) LOCATION 1. . ,855 

Ui) SEQUENCE DESCRTPTION: SEQ ID NO: 233 



60 



ATGGTAGTGA CGCTGCTCGT AATTGTCGGT ATTGTGGTAG TAGTGCGGTA CTCCTTGCGC du 

GTCCACGTAC ATAAGACCGG CACTGTTGTA TCGGCGGCCA TATTTGGCTT CATCCTCCTT 120 

*U GGAAAGACCG TTCCATGCGA TACCCGTAAC TTCTTTTCCT CCGAAAGTGA TGAACCTGAG 1B0 

AGCCGTGTTG CTACCGAAAT AGCCCACCTG TGCGAAATAG GATTTCAGAT CCACGCTTCC 24 0 

TCTATCCACC TAGCCGTCCG AACCGATTTT GGACAGGCGG GCATCCACTG CCCAATGGCG 300 

ACCGATGCGT CCGCTACCGA GTTTGACCGA TCGGCGGAAT GTGCCGAACG AACCTCCGCT 360 

CAAATCGACA CGGCCATAAG GAGCCAGTCC CAAATTATCC GTACGCATAT TGACACTTGC 4 20 

CCCAAAAGCT CCGGCACCAT TGGTGGAAGT ACCCACACCT CGCTGCACCT GAAGGTCTTC 4 80 

GATGGAAGAG CCGAAGTCGG G CAT AT TCAC CCAAAAGACG GACTGAGATT CGGAGT CGTT 540 

GAGGGGTACT CCATTGGTAG TTATGTTGAT GCGATTGGCA TCGGTGCCAC GCACGCGAAA 600 

GCCGGAATAT CCGATACCCG TACCGGCATC GCTGGTGGCT ACCACGGAGG GAGTCAGCAT 660 

CAGCAGATAG GGGATGTCAC GACCATAATT GGACTTGGAA AGTTCGGCCT TGCGAACGTT 720 

t>U GGTGTAAGCG ACAGGGGTTT TCGCCGTGGC GCGAGTAGCT ACGACCTGTA CGGTCTGGAG 780 

CTGCACATTG CTAAGACTAT CTATCTCGCT GTTGGAGACG GGTGCTTGTG CCGTCAGGCA 840 
GAAAGGCAGG ACGGC 



(2) INFORMATION FOR SEQ ID HO: 234 

(1> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 

<li> MOLECULE TYPE: DNA (genomic) 
liii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPI1YROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY; misc_f eature 
<B) LOCATION 1...17 97 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



""" ilZb «JL iCf! U, iMd; iLL, CTinj:, ^ 




WO 99/29870 PCT/AU98/01023 

160/ 490 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 234 



10 



15 



20 



25 



30 



35 (2) INFORMATION FOR SEQ ID NO: 235 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16SO base pairs 
4n (B) TYPE: nucleic acid 

** U (C) STRANDEDNESS: double 

iD) TOPOLOGY: circular 

<ii) MOLECULE TYPE: DNA (genomic) 
45 (iii> HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 
- n (vi) ORIGINAL SOURCE: 

50 * A > ORGANISM: PORYPHYROMONAS GINGIVALIS 



55 



i GTTTGAGGCC 


60 


^ CAAAACGCTC 


120 


l CAAGCGTGAA 


180 


i ATTCCGCTAC 


240 


i GGAAGATAAG 


300 


i GGAGCGCGTG 


360 


1 TACCAAGGGT 


420 


> GATCGACGTG 


480 


. GATTGTGAAA 


540 


AGCAGAGCTC 


600 


CGAAGGTATC 


660 


TGGTCTTATC 


720 


ACAGCTGGAT 


780 


CGCTCTCGGA 


840 


GGTAGGCGAT 


900 


GATTGCACAG 


260 


GCGTTCTGCT 


1020 


CGACCGCGAA 


1080 


TGATATCGAA 


1140 


CAATTTCGGT 


1200 


CCTTTCTTGG 


1260 


CGAAGTTCAG 


1320 


GTTGGAAGAG 


1380 


AGGAACGGTA 


1440 


AGGTTTTGCC 


1S0O 


GAAGTTACCT 


1560 


TAGCCGTGTA 


1620 


GGCTGAAGCC 


isao 


TGTAGAGAAA 


1740 


AGAAAAC 


1797 



<lx) FEATURE: 

(A) NAME/ KEY: miscjeature 
tB) LOCATION 1...1650 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235 

Z^^™ CAACAAGCGT GAAGTGGTTA TCAATGTAGG GTACAAATCG 60 

60 S^SI^SS TACCTGCAAC AGAATTCCGC TACAATCCCG AACTCAAAGT GGGAGACGAA 120 

GTGGAAGTTT ATATCGAGAA TCAGGAAGAT AAGAAGGGCC AGCTCGTCTT GT^CACCGC 180 

GTAsnrrr^^ ^CGCTCGCTC TTGGGAGCGC GTGAACGAGG CTCTCGAAAA AGACGAAATC 240 

ST™™" ATGTG AAGTG TCGTACCAAG GGTGGTATGA TCGTCGATGT AT TCGGTAT C 300 

TCCCGC CATC ACAGATCGAC GTGCGCCCCA TTCGCGACTA CGATgSttC 360 

65 AGGt"™? ^ TTGTG *^ T <™ AAGAATATAA G^G^St 420 

*l^r~£r rGCTCAT CGAAGCAGAG CTCGAACAAC AGAAGAAAGA AATCATCGGC 4 an 

AAGGGCAGGT ACTCGAAGGT ATCGTCAAGA ATATTACTTC TTACGGAGTA 640 

gTgStC ^TGGTCTT ATCCATATCA CTGACCTT^C ATGGGGTCGT ' 600 

TTTGAT G^AP S^S252[ CGTACAGCTG GATCAGAAGA TCAATGTCGT TATCCTCGAC 660 

70 ZTl^L^ 0 ATCGCAAGCG TATCGCTCTC GGACTCAAAC AGCTGATGCC TCATCCTTGG 720 

G^gSScG i^ GTAGGC ^AAGGTGA AGGGTAAAGT "lo 

GAAAtS£S ESS^ST IS AGATTGc:a CAGGGCGTTG AGGGTCTTAT CCACGTAAGC 84 0 

GT^Iar^ ^ ACACAGCA CTTGCGTTCT GCTCAGGACT TCCTGCATGT AGGCGACGAA 900 

TGATCCTGAC GCTCGACCGC GAAGAACGCA AAATGTCGCT CGGTCTGAAG 960 

75 C^S™ ?? CTGATATC GAAACTCGTT TCCCTGTAG3 CtSS^C 1020 

CATGCTCGTG TTCGCAACTT CACCAATTTC GGTGTATTCG TTG AG AT CG A AGAGGGCGTA 1080 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



161/490 



GATGGCCTTA TCCATATTTC CGACCTTTCT TGGACGAAGA A6ATCAAACA CCCCAGCGAG 114 0 

TTTACGGAAG TAGGTGCTGA TATCGAAGTT CAGGTAATCG AGATCGACAA GGAAAACCGT 1200 

CGTCTCAGCT TGGGTCACAA ACAGTTGGAA GAGAATCCTT GGGATGTATT CGAGACGGTA 1260 

TTCACTGTAG GAT CT AT CCA CGAAGGAACG GTAATCGAAG TGATGGACAA GGGTGCTGTC 1320 

5 GTTTCTCTGC CTTACGGTGT GGAAGGTTTT GCCACTCCGA AGCACATGGT GAAGGAAGAT 1330 

GGCTCACAGG CTGTACTCGA AGAGAAGTTA CCTTTCAAGG TTATTGAGTT CAATAAGGAT 14 4 0 

GCCAAGCGAA TCATTGTATC TCATAGCCGT GTATTCGAAG ATGAGCAGAA AATGGCTCAG 1S00 

CGTGAAGCCA ATGCAGAGCG TAAGGCTGAA GCCAAAGCGG CTCAGAAAGA AGCTGCTGCC 1560 

GAAGCTGCCA ATCCTGCACA GGCTGTAGACi AAAG CC ACT C TCGGAGACCT CGGCGAGCTG 1620 

10 GCCGCTTTGA AAGAAAAGCT TTCAGAAAAC 1650 



(2) INFORMATION FOR SEQ ID NO: 236 

15 (1) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1374 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

20 

(ii) MOLECULE TYPE: UNA (genomic) 
(iii) HYPOTHETICAL: NO 
25 tiv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: POR Y PHYROMOl IAS GINGTVALIS 

30 (ir.) FEATURE: 

(AJ NAME/KEY: miscjeature 

(B) LOCATION 1...137 4 



(xi} SEQUENCE DESCRIPTION: SI 

35 

ATGATCGTCG ATGTATTCGG TATCGAGGCT 
CCCATTCGCG ACTACGATGC ATTCGTTGAG 
AATCAAGAAT ATAAGAATGT AGTTGTTTCC 
CAACAGAAGA AAGAAATCAT CGGCAAGCTC 

40 AAGAATATTA CTTCTTACGG AGTATTTATC 
AT CACT G AC C TTTCATGGGG TCGTGTGGCT 
AAGATCAATG TCGTTATCCT CGACTTTGAT 
AAACAGCTGA TGCCTCATCC TTGGGATGCT 
GTGAAGGGTA AAGTTGTGGT GATGGCAGAT 

45 GTTGAGGGTC TTATCCACGT AAGCGAAATG 
GACTTCCTGC ATGTAGGCGA CGAAGTGGAA 
CGCAAAATGT CGCTCGGTCT GAAGCAACTC 
CGTTTCCCTG TAGGCTCTCG T CACCATGCT 
TTCGTTGAGA TCGAAGAGGG CGTAGATGGC 

50 AAGAAGATCA AACACCCCAG CGAGTTTACG 
ATCGAGATCG ACAAGGAAAA CCGTCGTCTC 
CCTTGGGATG TAT T CG AG AC GGT ATT CACT 
GAAGTGATGG ACAAGGGTGC TGTCGTTTCT 
CCGAAGCACA TGGTGAAGGA AGATGGCTCA 

55 AAGGTTATTG AGTTCAATAA GGATGCCAAG 
GAAGATGAGC AGAAAATGGC TCAGCGTGAA 
GCGGCTCAGA AAGAAGCTGC TGCCGAAGCT 
ACTCTCGGAG ACCTCGGCGA GCTGGCCGCT 

60 

(2) INFORMATION FOR SEQ ID NO: 23* 



IQ ID NO: 236 

TTCCTCCCGG GATCACAGAT CGACGTGCGC 60 

AAGACGATGG AGTT CAAGAT TGTGAAAATC 120 

CACAAGGTGC TCATCGAAGC AGAGCTCGAA 180 

GAAAAAGGGC AGGTACTCGA AGGTATCGTC 24 0 

GACCTCGGTG GAGTGGATGG TCTTATCCAT 300 

CATCCGGAAG AAATCGTACA GCTGGATCAG 360 

GAAGATCGCA AGCGTATCGC TCTCGGACTC 420 

CTCGACAGCG AGCTTAAGGT AGGCGATAAG 4 80 

TACGGTGCTT TCGTTGAGAT TGCACAGGGC 54 0 

TCATGGACAC AGCACTTGCG TTCTGCTCAG 600 

GCCGTGATCC TGACGCTCGA CCGCGAAGAA 660 

AAGCCGGATC CTTGGGCTGA TATCGAAACT 720 

CGTGTTCGCA ACTTCACCAA TTTCGGTGTA 7 80 

CTTATCCATA TTTCCGACCT TTCTTGGACG 84 0 

GAAGTAGGTG CTGATATCGA AGTTCAGGTA 900 

AGCTTGGGTC ACAAACAGTT GGAAGAGAAT 960 

GTAGGATCTA TCCACGAAGG AACGGTAATC 1020 

CTGCCTTACG GTGTGGAAGG TTTTGCCACT 1080 

CAGGCT GTAC TCGAAGAGAA GTTACCTTTC 1140 

CGAATCATTG TATCTCATAG CCGTGTATTC 1200 

GCCAATGCAG AGCGTAAGGC TGAAGCCAAA 1260 

GCCAATCCTG CACAGGCTGT AGAGAAAGCC 1320 

TTGAAAGAAA AGCTTTCAGA AAAC 137 4 



i\) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1278 base pairs 
65 (B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

( D) TOPOLOGY: circular 

til) MOLECULE TYPEi DNA (genomic) 

70 

(iil) HYPOTHETICAL: NO 

tiv) ANTI -SENSE : NO 

75 (vi) ORIGINAL SOURCE : 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



10 



20 



*LJi f ; ]Z"u 43 « 3 



Ixl) SEQUENCE DESCRI PTIOH : SEQ ID NO:237 



WO 99/29870 PCT/AU98/O1023 , . ,. 

162/490 

(A) ORGANISM: PORYPHYROMOIIAS GINGIVALIS 
(ixl FEATURE: 

(A) NAME/KEY: ml5C_featur- . t : - ■ - * T Tr . ^ 

(B) LOCATION 1...1278 W 

A I GGAGTTCA AGATTGTGAA AATCAATCAA GAATATAAGA ATGTAGTTGT TTCCCACAAG €0 ■ >> f - 

SJSST™ 06 AAGCAGACCT CGAACAACAG AAGAAAGAAA TCATCGGCAA GCTCGAAAAA 120 '* * * '\ - ' ■ 

^?i^ T ^ TCGAAGGTAT CGTCAAGAAT ATTACTTCTT ACGGAGTATT TATCGACCTC 180 . . , , . 

GGTGGAGTGG AT GGTCTTAT CCATATOACT GACCTTTCAT GGGGTCGTGT GGCTCATC^G 24 0 

GAAGAAATCG TACAGCTGGA TCAGAAGATC AATGTCGTTA TCCTCGACTT 1*GATGAAGAT 300 < 

15 TCGCTCTCGG ACTCAAACAG CTGATGCCTC ATCCTTGGGA TGCTCTCGAC 360 

™^52Z A AGGTAGGCGA TAAGGTGAAG GGTAAAGTTG TGGTGATGGC AGATTACGGT 42 0 

GCTTTCGTTG AGATTGCACA GGGCGTTGAG GGTCTTATCC ACGTAAGCGA AATGT CAT GG 480 

ACACAGCACT TGCGTTCTGC TCAGGACTTC CTGCATGTAG GCGACGAAGT GGAAGCCGTG 540 

^ISS^SSSS" IS GACCGCCA AGAACGCAAA ATGTCGCTCG GTCTGAAGCA ACTCAAGCCG 600 

^n^S? CTGATATCGA AACTCGTTT C CCTGTAGGCT CTCGTCACCA TGCTCGTGTT €60 , - 

CGCAACT1CA CCAATTTCGG TGTATTCGTT GAGATCGAAG AGGGCGTAGA TGGCCTTATC 720 

CATATTTCCG ACCTITCTTG GACGAAGAAG ATCAAACACC CCAGCGAGTT TACGGAAGTA 780 

GGTGCTCATA TCGAAGTTCA GGTAATCGAG ATCGACAAGG AAAACCGTCG TCTCAGCTTG 840 

GGTCACAAAC AGTTGGAAGA GAATCCTTGG GATGTATTCG AGACGGTATT CACTGTAGGA 900 

TCTATCCACG AAGGAACGGT AATCGAAGTG ATGGACAAGG GTGCTGTCGT TTCTCTGC^T 960 

I??22 TGTGG AAGGTTTTGC CACTCCCAAG CACATGGTGA AGGAAGATGG CTCACAGGCT 1020 

S^S—* AGAAGTTACC TTTCAAGGTT ATTGAGTTCA ATAAGGATGC CAAGCGAATC 1080 

A JISI^ CTC ATAGGCGTGT ATTCGAAGAT GAG CAGAAAA TGGCTCAGCG TGAAGCCAAT 114 0 

GGAGAG J GTA AGGCTGAAGC OAAAGCGGCT CAGAAAGAAG CTGCTGCCGA AGCTGCCAAT 1200 ' 

30 SgAAA^ AGCCACTCTC GGAGACCTCG GCGAGCTGGC CGCTTTGAAA 1260 



12) INFORMATION FOR SEQ ID NO: 238 

35 " (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEDNESS: double 

4Q (D) TOPOLOGY: circular 

(li) MOLECULE TYPE: DNA (genomic) 

Uli> HYPOTHETICAL: NO 

45 (iv) ANT I -SENSE: HO 

IrL) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

50 (ix) FEATURE: 

(A) NAME/ KEY: misu_f eature 

(B) LOCATION 1...720 

55 



60 
120 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238 

A 'f GAAAAAAG CTATTCTTTC CGGAGCGGCC TTGCTCCTCG GCCTATGTGC CAACGCACAA 

£SS?2SI TGCACTACGA TTTCGGT CAT TCCATCTACG ACGAACTAGA TGGACGTCCC 120 

£2*.™™ CCACAGTGGA AAACTTCACA CCCGACAAAT GGGGAAGCAC CTTCTTCTTC 180 

60 AT ' i,ACACGGG CAAGGGTATC CAGTCGGCCT ATTGGGAGAT TTCGCGCGAA 24 0 

b0 S^HTI GGCAAGCTCC CGTTTCCATT CATTTGGAGT ACAACGGAGG CCTCTCCACA 300 

tSSoJ^ SSSS^* rGCTC «W GGTGCCACCT ACACCTACAA CAACCCCTCC 360 

™^^ GrG GATTTACGAT CACGCCCATG TACAAGCATC TGGGTGCGCA CGACTTCCAC 4 20 

A SI A ^ JSSSS^ TTGGTACATG CACTTTCTGG ACGGTCTGCT TACCTTCAAC 480 

Ctr °?rTTSSI ATCTTTGGGG TTTCCCCCAA GAGAACCCAA TCGGGGGCCC TGTGCTCAAA 54 0 

^ AA ? 3GGATA AGTTCGTATT CCTGTCCGAA CCGCAGTTCT GGATCAACCT CAATCGCATC 600 

TTCG^TCGCA TGGArftAATT CAATCTCAGC ATAGGGACAG AGATGGAAAT CAGCAGGAAC 660 

TTCGCTCGCA TGGACAAATT CTCCTGCATC CCTACTCTTG CGGTCAAATG GACTTTCAAC 72 0 



70 (2) INFORMATION FOR SEQ ID NO: 239 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1302 base pairs 
-c IB) TYPE: nucleic acid 

/D <C| STRANDEDNESS: double 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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PCT/AU98/01023 



(D) TOPOLOGY; circular 

Ui) MOLECULE TYPE: DNA (genomic) 

5 till) HYPOTHETICAL: MO 

(iv) WITT -SENSE: MO 

(vi> ORIGINAL SOURCE: 
10 (A) ORGANISM: PORYPHYROHONAS GINGIVAL1S 

(ix) FEATURE: 

(A) NAME/KEY: nd.3c_f eature 

(B) LOCATION 1...1302 



15 



45 



55 



60 



65 



70 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 239 



ATGTATAAAG ACTATAAGGG TTTGTATGCG TCGCTTCGGT GGTATGCCCT GATCATTGGG 60 

TTGCTATTTG CAGCAGACGG TATACAGGCT CAGAACAACA ACTTTACCGA GTCGCCTTAC 120 

ZU ACTCGCTTCG GCCTTGGCCG TCT CGGAGAA CGGACGACTA TTAGTGGGCA TTCCATGGGA 180 

GGACTCGGCG TCGGTCTGCG TCAGGGCACA TACGTCAATG CCGTCAATCC TGCTTCATAC 24 0 

TCGGCTGTGG ATTCGATGAC GTTTATCTTC GATTTCGGTG CATCTACCGG AATTACGTGG 300 

TATGCCGAGA ACGGGAAAAA GGACAATAGG AAAATGGCAA ACATTGAGTA TTTCGCCATG 360 

CTTTTTCCTA TTTCCAAATC CATTGCTATG AGTGCGGGAG TGCTTCCTTA CTCCGCATCC 420 

GGGTACCAGT TCGGATCCGT TGATCAAGTG GAAGGAGGCA GCGTCCAGTA CACCCGTAAA 4 80 

TACTTGGGGA CAGGCAATCT GAACGATCTC TATGTCG3TA TAGGTGCAAC CCCGTTCAAA £4 0 

AACTTCTCAA TAGGAGCCAA TGCTTCATCC CTTTTTGGGC GATTCACACA CAGCAGGCAG 600 

GTAATCTTCT CCACGGAGGC TCCTTACAAT CCCGTACATC TCTCGACGCT GTACTTGAAG 660 

GCTGCCAAGT TCGACTTCGG TATGCAGTAT CACCTTCTTC TCAAATCAGA TCGTTCGCTC 720 

GTTATCGGTG CCGTCTATTC TCCGCGGGTG AAGATGCATA GCGAGCTGAC TCAGATAAAG 7 80 

AATCAGGTTC AGAACGGTGT AGTAGTGGAG AGCCAAACCC AAGAATATAT CAAGGGAATG 84 0 

GACTATTATA CCCTGCCTCA TACATTGGGG ATAGGTTTTT CTTATGAAAA GAAAGATAAA 900 

CTTCTCTTAG GAGCAGACGT CCAATA'i'AGT AAATGGAAAG GCGAGAAATT TT AT A AATCC 960 

GATTGCAAAT TCCAGGACAG AATACGGGTA TCTCTCGGCG GAGAGATCAT ACCGGATATA 1020 

Jt> AATGCCGTTG GGATGTGGCC TAAAGTTCGC TATCGCTTCG GTTTACATGG TGAAAATTCT 1080 

TACCTGAAAG TGCCGACTAA AGGCGGTGTA TATCAAGGAT ACCATAT CGT AGGT GCTGT A 114 0 

TTCGGTATAG GAATCCCGCT CAATGACAGA CGTTCGTTCG TAAATGTCTC TCTTGAATAT 1200 

GACCGATTGA TCCCGAAGGA GGGTATGATC AAAGAAAATG CTCT GAAATT GACCTTCGGC 1260 

^ CTCACGTTCA ACGAGTCATG GTTTAAAAAG CTGAAACTGA AC 1302 

(2) INFORMATION FOR SEQ ID NO: 24 0 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 7 3 bass pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

50 (ii) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) AHTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

( ix > FEATURE : 

(A) NAME/ KEY : iuisc_feature 

(B) LOCATION 1...277 8 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:240 



ATGCGATCGA TTTATCAATT ACTGTTGTCA ATACTCCTTG CTTCTCTTGG TTTCGTCGGG 60 

CTGGAAGCCC AACAAGCCGG AGTAGCAGGT AGAGTATTGG ACGAAGAAGG CAACCCCATG 120 

ATTCAAGCCA ACGTACAGCT TGTACAGAGT ACCGGCCAAG TAGCCGTTGC CGCAGGTGCC 180 

ACTAATGAAA AAGGGTTGTT CAGCCTGAAA ACGTCACAGG AGGGT G ACTA CATTCTGCGC 240 

GTTTCATATG TAGGTTACAC TACCCACGAC GAAAAAATAT CTCTTAGAAA CGGGCAAACC 300 

ATTACGCTCA AAGATATATC CATGAACGAA GATGCCCGTC TTCTACAGAG TGTGACGGTG 360 

CAGGCTAAAG CGGCAGAGGT CGTGGTACGC AACGATACGC TCGAATTCAA TGCCGGATCC 420 

TATACCGTAG CACAGGGAGC TTCTATCGAG GAACTGATCA AGAAGCTACC CGGAGCAGAG 480 

ATCGGATCCG ATGGGAAGAT CACCATCAAC GGCAAGGACA TTAGCAAGAT CCTTGTCGAT 540 

GGCAAAGAGT TTTTCTCCAA AGATCCACAG GTGGCAATAA AGAATCTTCC GGCCGATATG 600 

GTCAATAAAC TACAGGTACT GAACAAACTG AGCGAGCTGT CGCGGATGAG CGGTTTCGAT 660 



SUBSTITUTE SHEET (Rule 26) (RQ/AU) 



C'b jf 1* m h^.i> Oi iiZ2 



it J * 



WO 99/29870 



PCT/AU98/01023 



164/ 490 



10 



15 



20 



25 



30 



35 



GATGGAGAAG 
TTCGGAACGC 
AATCGGTTCG 
ATGGGCTTTA 
GGTGGTCGAC 
AACTTCAGTG 
AACGACAAGG 
TATATGGACG 
ATGCAATGGA 
AAGATCGATG 
AACAAAGGTT 
GATATCAGTC 
CTGACCGACG 
CAAAAGCAAT 
CCGTTGGGTA 
TCGGATCGTC 
TACGGACTCT 
ATI' GCCAAAA 
TATCGGAGCG 
CCGATGCTCC 
GGACGCACGA 
CTATTCGTGA 
ATGTTCTCGG 
TATACATTCG 
ACTCGTTACG 
CC ACT CAAG A 
CAAAGCTTCA 
ACGCTGACCT 
ATGGCGAATA 
TATCAAGTTG 
ACGAACTCCG 
TCATACAGCT 
GGTCAGCGGT 
AATACGATCG 
- GGTGGATCTC 
GGTGGCAGAC 



AGGAGACCGT 
TTCAGGCCGG 
ATGGAAATAA 
GCGAGATGGA 
GCGGCTTCGG 
TCGAATTCTC 
CCATAGAGAC 
AAAATATATT 
AACCGTCCGA 
GGTTCTTTAA 
CTATCCACCA 
ACAAGCTCAA 
AAGACGGAGA 
TCAACGACAA 
AAAACi'ACTT 
AGGTGTACCG 
CCTACAGTAA 
CGTGGGACTA 
TAGCCGGAGT 
GAATCAACTA 
CACAACCATC 
CGGAAGGCAA 
ACTTCGATGC 
ACGACATCGT 
AAAACGCCTC 
AC r\GG GC ATT 
TCAATGACGA 
ATCGCAACAA 
ATAGTCTGAG 
CCCTAACGCT 
GTTACAGCGG 
TCCTCCGTGA 
CAAGT AT CAG 
GACGCTACGT 
GCAGCGATCA 
GACCGTCC 



AATCAACCTG 
CTACGGTACC 
GCAATGGACA 
CAGCGAGATG 
CAATAGTGCA 
CTCTGCCCTT 
GACCAAACGC 
GGAACGCTCT 
ACGTACCGAA 
CGACACATAC 
AACTACACAA 
CGACGAAGGC 
TGGCATATAT 
CTCCAACCTG 
CGCACAAGCG 
ACTGGGCGAT 
CGAGTTCACC 
CACCGTAGGA 
AGAGCAGGAC 
CAAACCGAGC 
CAT CAAT CAG 
TCCCGGTCTG 
CAAAAGTCAG 
CCCCAATACG 
CGGTACGTGG 
TTCTTTCAGG 
TAAGAACAAA 
TTGGATCGAT 
CGGACAGAAA 
TCCCTATGGA 
AGGATTCAGT 
CAAGGCCGGT 
CCGTTCTGCT 
GATGGTGGAC 
TCAGCGTGGC 



ACGGTGAAGC 
GACCAACGCT 
TTGATCGGTA 
GGATCCATGA 
GGTGTTACGT 
AATACAGGAG 
GTGGAAAATA 
TTCTCTCACA 
GTGGTATTCG 
GAGACGAAAG 
GGAAACAACT 
CGTACGATCA 
CAGGCTGTGC 
CAATATCGGC 
ATTCTGAACA 
GACGGGCAAT 

cagtatcgca 
ttcaatgtgg 
aaactggctt 

AGGACTACCA 
ATCGCTCCCG 
AAGCCGAGCT 
CGAGCTTTCA 
CACTACGATC 
CAAGCGAATC 
ATGTCCTTGT 
GCTCTCTCTT 
ACGAGTATCG 
GATTCTCGCA 
TTCCGTATCG 
CTGGACGAAT 
ACACTGCGTG 
TCGGCCATCA 
TTTATCTACC 
AATATGAATC 



CCGAAAAAAA 
ATATGGCCGG 
GTGCGAACAA 
CCTTCTTCTC 
CTTCGTCGAT 
GCGATGCACG 
TCCTCGCCGA 
ATGGTCAGGC 
AGCCGGATCT 
ATGCCACCGG 
TCAGACTGAA 
GTGCCTCCGT 
TCCAAAGCGT 
TTCGCCTCTC 
GACGTTTCTC 
ACTCCATATT 
TCGGACTCAA 
ATCCCAACAG 
TCAATCGTGT 
ACCTCCGAGT 
TTCAGGACAT 
ATTCCAACAA 
ACATTGTTTT 
CGTCTACAGG 
TTCATGGGAC 
TCAACAGGTT 
TCCGAACGAG 
GTGGCAATAT 
CATACGATTT 
ACAGCGATGT 
GGCTTTGGAA 
TCAATGGCTA 
ATATAGAAGA 
GATTCAACGC 
GTCCGGGCCC 



GAAAGGCCTC 
AGGGAACGTC 
TACGAACAAT 
TCCCCAAGGC 
GCTGGGOGGC 
CTACGGATAC 
AGGGAATACT 
GCGATTTAGG 
TTCGATATCC 
AATCTCTATC 
CGGAGAATTG 
CAGTGGCGGT 
GGAGACGAAT 
GTATGTGGAA 
CCGTCGCAAT 
AGACAGTCAG 
CCTCAAGAAG 
AACTGTCAGC 
CAATCTCTCC 
GGACTACCGA 
CACGAATCCG 
TGTGATGGCC 
CTTCGGCAAC 
GATCCGTACC 
ACTATCGCTT 
GGCCGAAGGA 
GGAACGCCTG 
CGGATTCT^T 
TGGCGGCA^T 
TGAATACAAT 
TGCTTCGCTT 
TGACATCCTC 
GAGCATGTCC 
CTTCAGTGGT 
ACCTTTCGGC 



770 
780 
840 
900 
960 
1020 
1000 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2500 
2640 
2700 
2' 60 
2778 



40 



45 



{2) INFORIiATION FOR SBQ ID NO: 241 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY: circular 



50 



tii) MOLECULE TYPE: DUA (genomic) 
(ill J HYPOTHETICAL: HO 
(iv) ANTI -SENSE: NO 



55 



60 



65 



70 



75 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIMGIVALIS 

(IX) FEATURE: 

(A) NAME /KEY : misc_f ea ture 
(Bl LOCATION 1...1200 

ixl) SEQUENCE DESCR1 PTION : SEQ ID MO: 241 



ATGGTTATGA 
CTTCCT GCGT 
GAAATACGTA 
GATACGGAAT 
CAGTCCTACG 
GTTCCCGGTC 
TCAGGTGCCG 
TACTATGATG 
CCTTATACGT 
TGGTTCTACT 
TGGAGCTACT 
CCTTATGGCT 
CACCACTACC 
GGCCG ACCGA 



AGCTGATTAA 
ACT CGCAG AA 
AACAAAACCA 
ATACAGTGGC 
ATGGGAAAAA 
GCTATAGTCG 
ACAAT GTATA 
ACGCGTCGTC 
CATGGTATCC 
ACGGTAGCCA 
ACTATGATCC 
GGGGCAGCTA 
CCAAGAAGAC 
ATCGTATCAA 



AAGAAGTTTG 
TGATGACATC 
AGTTAAAGAC 
TTCCGATCGG 
GTTGTCCAAA 
CCGCTTGGCT 
TGTAACTGAT 
TGTAAACATT 
AT CTTTCi'CC 
T AT CC GAT GG 
TTTCTACAAT 
TTACGGTTGG 
CTATTCCAAT 
AGGTGGAACG 



CTCCTGCTTG 
TTCGAAGATG 
TGGCAAAACC 
GACATTGACG 
GACAAGAAAA 
CGATTCTATA 
GATGGTGAGT 
TACATCAACA 
GGCTGGTACA 
GGCGGTTATT 
CCCTATGGAA 
GGTGGCTATC 
GGTCAGCATT 
TCCGGTGCCA 



GAGCGGTACT 
ACATCTATAC 
AAGAGGACGG 
CCTACAATCG 
GAGACTCCAC 
AGCCGAATAC 
ATTTCGTCTA 
GTCCTTCGTG 
ACTATACGTG 
ACCCCGGATA 
TCGGTATGGG 
CGGGAGTGAT 
CCGGAGCTTA 
AACTTGGGAC 



GCTGATTACG 
ATCGCGAAAA 
ATACGGCGAC 
TAGAGATGGC 
TCGTTCTTCT 
GATCGTCATT 
T GG AGACGAA 
CGATCCGTTC 
GAACTATCCA 
TAATTGGTAT 
TTGGGGATAT 
ACATCACTAC 
CTATTCTTAT 
AGGACGCTAC 



60 
120 
180 
210 
300 
360 
420 
4 80 
540 
600 
660 
720 
780 
840 
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GATAGAATTC 
AATAATAATC 
ATAGAAACGG 
CAGTCCGGCA 
AACGATAGGA 
TC7TCGCTCTT 



AAAATT CGT C 
TGCAAAATGT 
TAACTCCAAA 
ATGACCGACC 
CATTTTCGAC 
CTTCCGGCTC 



TTCGCAAAAA 
CAAGTCGGGA 
CAACGGGCAA 
GACCGGACGG 
TCCTTCTCGT 
TATGAGCGGA 



AATAAGTTCG 
CGTACCGGCC 
AAGCAGAATC 
AATATCCGCA 
AGCAATAGTA 
GGT G GCGGAC 



GATTGCAGTC 
GAGCCAATAG 
GTCCCGTATT 
GCGAGAGACA 
ACGGTGGCTT 
GTAGTGGCCG 



GAACAAACCC 
AGACCGAAAT 
CCAGCAGAAT 
GCGCGAAAAT 
CTCCACGCCT 
GGGACGCAAT 



9O0 
960 
1020 
1O80 
1140 
1200 



10 



15 



20 



25 



30 



35 



40 



45 



50 



(2) I N FORI IAT I OH FOR SEQ ID NO: 242 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1194 base pairs 
(8) TYPF. : nucl-lc acid 
(C) STRANDEDNESS : double 
{0) TOPOLOGY: circular 

(ii) HOLECULE TYPE: DNA (genomic) 

(ixi) HYPOTHETICAL: HO 

(iv) ANT I -SENSE: JIO 

(vi) ORIGINAL SOURCE; 

(A) ORGANISM: PORYPHYROMONAS GIMGTVALIS 

(ix) FEATURE: 

tA) NAME/KEY: misc_f eatur- 

(B) LOCATIOtI 1 1194 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 24 2 



ATGAAGCTGA 
GCGTACTCGC 
CGTAAACAAA 
GAATATACAG 
TACGATGGGA 
GGTCGCTATA 
GCCGACAATG 
GATGACGCGT 
ACGTCATGCT 
TACTACGGTA 
TACTACTATG 
GGCTGGGGCA 
TACCCCAAGA 
CCGAATCGTA 
ATTCAAAATT 
AATCTGCAAA 
ACGGTAACTC 
GGCAATGACC 
AGGACATTTT 
TCTTCTTCCG 



TTAAAAGAAG 
AGAATGATGA 
ACCAAGTTAA 
TGGCTTCCGA 
AAAAGTTGTC 
GTCGCCGCTT 
TATATGTAAC 
CGTCTGTAAA 
ATCCATCTTT 
GCCATATCGG 
ATCCTTTCTA 
GCTATTACGG 
AGACCTATTC 
TCAAAGGTGG 
CGTCTTCGCA 
ATGTCAAGTC 
CAAACAACGG 
GACCGACCGG 
CGACTCCTTC 
GCTCTATGAG 



TTTGCTCCTG 
CATCTTCGAA 
AG ACT GGCAA 
TCGGGACATT 
CAAAGACAAG 
GGCTCGATTC 
TGATGATGGT 
CATTTACATC 
CTCCGGCTGG 
ATGGGGCGGT 
CAATCCCTAT 
TTGGGGTGGC 
CAATGGTCAG 
AACGTCCGGT 
AAAAAATAAG 
GGGACGTACC 
GCAAAAGCAG 
ACGGAATATC 
T03TAGCAAT 
CGGAGGTGGC 



CTTGGAGCGG 
GATGACATCT 
AACCAAGAGG 
GACGCCTACA 
AAAAGAGACT 
TATAAGCCGA 
GAGTATTTCG 
AACAGTCCTT 
TACAACTATA 
TATTACCCCG 
GGAATCGGTA 
TATCCGGGAG 
CATTCCGGAG 
GCCAAACTTG 
TTCGGATTGC 
GGCCGAGCCA 
AATCGTCCCG 
CGCAGCGAGA 
AGTAACGGTG 
GGACGTAGTG 



TACTGCTGAT 
ATACATCGCC 
ACGGATACGG 
ATCGTAGAGA 
CCACTCGTTC 
ATACGATCGT 
TCTATGGAGA 
GGTGCGATCC 
CGTGGAACTA 
GATATAATTG 
TGGGTTGGGG 
TGATACATCA 
CTT ACT ATT C 
GGACAGGACG 
AGTCGAACAA 
AT AG AG AC CG 
TATTCCAGCA 
GACAGGGGGA 
GCTTCTCCAC 
GCCGGGGACG 



TACGCTTCCT 
AAAAGAAATA 
CGACGATACG 
TGGCCAGT CC 
TTCTGTTCCC 
CATTTCAGGT 
CGAATACTAT 
GTTCCCTTAT 
TCCATGGTTC 
GTATTGGAGC 
ATATCCTTAT 
CTACCACCAC 
TTATGGCCGA 
CTACGATAGA 
ACCCAATAAT 
AAATATAGAA 
GAATCAGTCC 
AAATAACGAT 
GCCTTCTCGC 
CAAT 



60 
120 
180 
240 
300 
360 
420 
4B0 
54 0 
600 
660 
720 
780 
840 
900 
960 
102O 
1080 
1140 
1194 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 243 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1743 base pairs 
(B> TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(P) TOPOLOGY: circular 

(ix> MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

(IV) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOI IAS GIHGIVALIS 

lixj FEATURE : 

(A) NAME /KEY : mlsc_f-ature 

(B) LOCATION l._ .1743 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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10 



20 



25 



30 



AT G ATCCGCA AGTTGATCTT ACTGOfTGCT CTGATGCCGG TAGCCTCTGT GGCTTTTGCG 60 

GTACCAACGG ACAGCACGGA ATCGAAAGAC AATCGTATCC TTACAAGCAT GCAATCCTCC l'Q 

TCTTTGAATA GGGATGATGC TCCGGATAAA TGGCAACCTA TGCATGCCAA TTTCAGTATT 180 

CAGAGCGATA TGCTGCTTTC TACTGCCCAA AAGTCCAAGA ACACCTGGTT CGGCAACTCC 240 

TATATGATGG GTATAATCAA GAACAATTAT CTGGAGTTTG GTGCCCGTTT CGAGGATCTC 300 

TATAAGCCCC TGCCCGGACA TGAACCCGAG ATGGGGCGTG GCGTTCCTCA CATGTATGTG 360 

AAGGGAAGCT ATCATTGGGC GGAGCTGACT ATGGGAGACT TCTACGATCA GTTCGGTAGC 4 20 

GGTATGGTAT TCCGCACCTA TGAAGAGCGC AACCTCGGTA TAGACAACGC GGTTCGCGGC 4 80 

GGACGTATAG TACTCACTCC TTTTGATGGA GTGCGTGTCA AGGGTATTGC AGCACAGCAG 5jn 

CGTAACTACT TCGACCGCAC GGGCAAGGTA TTCAATTCCG GCCGAGGCTA CCTACTGGGT 60n 

TCTGATCTGG AGCTGAATGT AGAGCGTTGG AGCAGTGCCA TGCGCGACAA TGACTATCAT 660 

TTGGCTATCG GGGGATCGTT CGTTTCCAAA CACGAAGCAG ACGAAGATAT ATTTGTGGGT 720 

GTAGGCGAAG ATCGCAAGCG ACTCAACCTG CCGCTCAATG TCCCGATTAT GGGCCTGCGC 780 

ACCAACTTTC AAAAAGGAGG TCTCGCCCTC TACGCAGAG1' ATGGATACAA ATACAACGAT 84 0 

CCCTCGGCAG ACAATGACTA TATCTACCAC GACGGACAGG CTGCACTCCT CTCTGCCTCA 90^ 

TACTCCAAAA AAGGGATGAG TATCCTGTTG CAGGCCAAAC CTTGTGAGAA CTTTGCTTTC 96" 

CGCAGCAAGC GAACTGCCCA GCTCACACCG CTTATGATCA ACTATATGCC GGCTTTTACC 1020 

CAAGCTCACA CTTATACGCT GGCGGCCATC TACCCCTATG CTACTCAGCC TCAGGGAGAA 1080 

TGGGCTTTCC AAGGTGAACT GCGTTACAAC TTTGCTCGCC GGACAGCTCT CGGTGGACGC 114^ 

TACGGTACCG GCTTGCGTAT CAACGTTTCG CATGTGCGTG GTCTGGACAA AAAGATGCTC 120" 

AAAGAGAATC CCGACGAACT GATCGGAACG GATGGCTACA CCGTTTCTTT CTTCGGCATG 1260 

GGCGACCTCT ATTATTCGGA TATAGATGTG GAGATTACTA AAAAGGTAAG CCCAGGATTC 13-0 

AACTTTACGC TCACCTACTT GAATCAGATC TACAATAACA AGGTACTGCA CGGTGCAGCC 138" 

GGAGAGAAGC CTGAGAAGAT CTATGCCAAT ATCTTCGTCT ATCATGGTAA GTATAAGCTG 14 4 n 

AGTAATAAGG TAGCCCTCCG TACCGAACTG CAATATTTGC ACACGAAGCA GGATCAGGGT 15 or 

GACTGGATCT ACG GCATGGC CGAGCTCTCT ATCCTGCCTT CTCTGATGCT TTCCCTCTC3 15er 

GAGCAGTATA ATATCGGAGA GACCAAGAAA CATTATGTCA TGGGGTCTGT CACTATACT 162^ 

CACGOAGCAC ATCGAGTAGC '1TTCTCTGCA GGCAAAACCC GTGCAGGGAT GAACTGCTCG 1680 

GGAGGTGTAT GTCGTGTGGT CCCTGAGACT CAGGGATTCT ACCTTTCTTA TAGCACCAAT 17 4 0 
CTG 



1743 



35 



40 



45 



50 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 244 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 717 base pairs 
<B> TYPE: nucleic acid 

(C) ST RAN DE DNE SS : double 

(D) TOPOLOGY: circular 

til) MOLECULE TYPE: DMA (genomic) 

tiii) HYPOTHETICAL: NO 

<iv) AHTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

<AJ ORGANISM: PORYPHYROMONAS GIHGIVALIS 

( ix ) FEATURE : 

<A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . .717 



55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 4 

- ATGCGTTCTT TATTTTTGAG CGCGTTGCGC AGCTCCTCTC TCCATGGTTC AGAGCGACGC 60 

AGTCGGATAA GTTCTTCTGT AGTCATGTCA ATAAGGCAGA AAATAAGGTT ATTCCATCTC 1?0 

TCGGTATGCG CCCAAACGCA TGATCATCTC ATCGAAATCC ACTTGGTGTG CATCGAATTC 180 

DU GGGGCCATCG ACACAGACGA ATTTCGTCTG TCCTCCCACG CTTATACGAC AAGCCCCACA 240 

CATACCGGTG CCATCCACCA TAATTGTATT GACAGAAGCT ATGGTCGGTA TCTCGTAACG 3nrs 

TTTGGTCAGG AGAGAAACGA ACTTCATCAT CACAGCCCGC CCGATCGTAA CGCAGAGGTC 360 

TACCGTTTCC CGTTTGATAA CGCTTTCCAC TCCATCCGTT ACGAGGCCTT TCGTCCCATA 420 

AGACCCATCG TCTGTCATGA TGATCACTTC ATCGCTATTG GCTCGCATTT GTTCTTCAAG 4 8r> 

GATAACCAGA TCTTTAGTTC TGGCAGCCAA TACGACAATT ACACGGTTGC CTGCTTTGTG 54* 

GAAAGCCTCC ACGATCGGGA GCAAAGGAGC CACACCCACA CCGCCTCCGG CACAAACCAC 60* 

TGTGCCGACC TTTTCGATAT GCGTACTCTG TCCCAGCGGA CCTACCACAT CCGTGATATA 660 

GTCGCCGACT TCGAGTTCGG CCAATTTCTT GGAAGATTTG CCCACGGCCT GAACCAC 1' ~ 



(2) TNFORJtATIOM FOR SEQ ID HO: 24 5 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 PCT/AU98/01023 

167/ 490 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) , . 

(iiil HYPOTHETICAL: HO * / " / 

(i'v) AWT I - SKNSE : HO ^ * ■ - J . ^ 

10 (vi) ORIGINAL SOURCE: , . . ~ .. . 

(A) ORGANISM: PORY PHYROMONAS GIHGIVALIS ' 

(ix) FEATURE: " " * * " * ~ - ' ' ^ J 

(A» NAME/KEY: misc_f eature 
15 (B) LOCATION 1...633 

Ixi] SEQUENCE DESCRIPTION: SEQ ID NO: 24 5 

ATGTCAATAA GGCAGAAAAT AAGGTTATTC CATCTCTCGG TATGCGCCCA AACGCATGAT 60 

ZU CATCTCATCG AAATCCACTT GGTGTGCATC GAATTCGGGG CCATCGACAC AGACGAATTT 120 

CGTCTGTCCT CCCACGCTTA TACGACAAGC CCCACACATA CCGGTGCCAT CCACCATAAT 160 

TGT ATT GAGA GAAGCTATGG TCGGTATCTC GTAACGTTTG GTCAGGAGAG AAA CG AACTT 24 0 

CAT CAT CAC A GCCGGCCCGA TCGTAACGCA GAGGTCTACC GTTTCCCGTT TGATAACGCT 30" 

TTCCACTCCA TCCGTTACGA GGCCTTTCGT CCCATAAGAC CCATCGTCTG TCATGATCAT 360 

CACTTCATCG CTATTGGCTC GCATTTGTTC TTCAAGGATA ACCAGATCTT TAGTTCT GGC 4?Q 

AGCCAATACG ACAATTACAC GGTTGCCTGC TTTGTGGAAA GCCTCCACGA TCGGGAGCAA 480 

AGGAGCCACA CCCACACCGC CTCCGGCACA AACCACTGTG CCGACCTTTT CGATATGCGT S4 0 

ACTCTGTCCC AGCGGACCTA CCACATCCGT GATATAGTCG CCGACTTCGA GTTCGGCCAA 60C 
TTTCTTGGAA GAT TTGCCC A CGGCCT GAAC CAC 



50 



60 



70 



30 

(2) INFORMATION FOR SEQ ID NO: 24 6 

U) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 2343 base pairs 

{B> TYPE: nucl-ic acid 
(C) STRANDEDNESS: double 
(D( TOPOLOGY: circular 

40 (11) MOT.F.CULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

tiv) ANTI-SENSE: NO 

45 



633 



(Vi) ORIGINAL SOURCE: 

(A) ORGAHIS1I: PORY PHYROMONAS GINGIVALIS 

tlx) FEATURE: 

(A) NAME/ KEY: mlgcjeature 

(B) LOCATION 1...2343 

<xi> SEQUENCE DESCRIPTION: SEQ ID HO: 24 6 

55 ATGCATCGTC CTAAGCCTTC AT AT AT TGT T CGAATAGCAG CCATTCTCTG CTTGTTTGTC 60 

GGCA GG 5 P rT TGTTTGCGCA GAGCTATGTG GACTACGTCG ATCCGCTGAT CGGGACGCTA 120 

AGTTCTTTTG AGCTGAGTGC GGGCAATACC TAT CCGGT GA TCGGTTTACC GTGGGGAATG 180 

AATAGCTGGA CACCGATGAC CGGTGTACCC GGTGACGGCT GGCAATATAC CTACTCGGCA 240 

CACAAGATTC GCGGATTCAA ACAGACCCAC CAACCCAGTC CTTGGATCAA CGACTACGGC 300 

CAATTCTCCC TTCTTCCCCT TACGGCACCG CAGAAGCCAT CATCGAACGA CTCCATAGCT 360 

CTGACTAAAT GGTGCAAGCA ACTCTTTTCG GACGAACAGA CCTCGTGGTT CTCGCACAAA 420 

GCGGAGACGG CGACGCCATA CTATTATAGT GTCTATTTGG CCGATTACGA CACACGCGTG 4 80 

GAGATGGCTC CGACCGAGCG TGCAGCTATC TTTCGCATAC GTTATTCCGG CAATACCGAA 54 0 

AGTGGCTCCG GTCGATGGCT TCGTCTTGAT GCCTTTACCG GTGGTTCGGA GATTAGCATC 600 

GTGGATCCTC ACACCGTAGT GGGCATATCT CGCAAGAATA GCGGAGGTGT GCCGGCTAAC 660 
TTCGCCTGTT ATTTCATCCT GCAGTCCGAT ACTCCTATGG CCGATGTCCT GCTTGAGACA - 720 

GATACCGGCA AGTCAGACGA AGGCACAAGG GCATGGGCAG CCTGTCGCTT CGATTCGCAA 780 

GAAGTTACCG TCCGGGTGGC ATCTTCTTTT ATCAGTGTCG AGCAGGCCGA AAGAAATCTT 84 0 

GCGGAAGTCA AAGGGCAGAG TTTCGACCGG ATCAGACTTG CCGGT CGCGA AGCTTGGAAT *00 

AAGGTGCTCG GACGCATACA TGTGGAAGGA GGAACGAAGG AT GAG CG CAC TACATTCTAT 960 

TCCGCACTCT ATCGCTGTCT GCTTTTTCCC CGTCGCl'TCT ATGAGGAGGA TGCTTCCGGC 1020 

AATTTTGTGC ATTACAGCCC CTACAAT GGA GAGGTACTTC CCGGTTATCT CTATACCGAT 1080 

ACCGGATTTT GGGACACTTT TCGAGCCCTT TTCCCCCTGC TCAATCTGCT GTATCCCGAT 1140 

GAAAACATTA AAATTCAGGA AGGTCTGCTG AATGTATATC GCGAGAGTGG CTTTTTCCC™ 1200 

/O GAATGGGCCA GTCCGGGCCA TCGGGATTGT ATCATAGGCA ACAACTCTGC TTCTGTTCTG 1260 
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GCGGATGCCT 
TTGCATGCTA 
TGGTACAACT 
ACGCTCGAAT 
GATAGAGCTG 
CCGGAAACCA 
CCTTTCAAAT 
TTTCATGATG 
CTCGATTCGG 
CACGAAATCA 
ATACAGCATA 
CTACGCGAAG 
GACAACGGAC 
CCCGCTACGG 
CCCGACGGAC 
CGCTCGATCA 
CGCTCTTCTG 
AAGGAAAGTG 
AAT 



ACCTCAAGGG 
CGAAAGCCGT 
CCTTAGGTTA 
ATGCTTATAA 
CATTGGACAC 
AACTCATGCG 
GGGGAGATGT 
TGCAGGGGCT 
TATTCAATAC 
GAGAGATGCA 
TG AT AT AT CT 
TGATGGGGCG 
AGACTTCGGC 
ATCAGTATGT 
ACAAAACGGT 
GCGTAGAAGG 
CATCCATTCA 
ACAGACCTTA 



TGTTCGGGTA 
C CAT C CG AAA 
TGTTCCGGCT 
CGATTGGTGC 
GTTGGCTCAT 
CGGTAGAAAT 
ATTCACGGAG 
TATCGACCTG 
TCCTCCTATG 
AATAGCGGAT 
GTATAATCAT 
GCTCTATCGT 
TTGGTACGTT 
GCTCGGTTCG 
GTTGCATGCT 
AAAAGAATGG 
ATGGATGATG 
TTCCTTCTCC 



GAAGATACCC 
ATCTCCTCCA 
GATGCAGGCA 
ATCCTCCGAC 
CGTTCGATGA 
CAGGATGGTA 
GGCAATGCCT 
ATGGGAGGAG 
TTCGATGAGA 
ATGGGCAATT 
GCCGGTCATC 
CCTACTCCGG 
TTCTCTGCTT 
CCGATTTTTT 
CCGGCCAACA 
AGCTGCAATT 
G AC ACG AAA C 
ACGGAGCAAC 



GTACACTGAT 
CGGGTCGCAA 
TCGACGAAAG 
TGGGGCGCAC 
ACTATCGTCA 
GTTTCCGGAC 
GGCACTACAC 
ATCGCCCGTT 
GCTATTACGG 
ATGCTCATGG 
CATGGAAAGC 
ATGGGTATTG 
TAGGCTTCTA 
CCAAGGTAAT 
GTGCCGATAC 
ACCTGACTCA 
CCAATTATAA 
AGCGTCGCGC 



GAACGGACTC 
AGGTTGGGAG 
TGCTGCCCGT 
ATTGGGTTGG 
TCTGTTCGAT 
ACCTTTTTCC 
TTGGTCGGTC 
CGTGTCTATG 
ATTTGTCATC 
CAATCAACCC 
TCAGGAGAGA 
CGGCGATGAA 
TCCTGTTACA 
ACT CTCTTTT 
GCCTTACATC 
CGAACAGCTT 
TCGTGGTATG 
TAATCACAGT 



(2) INFORMATION FOR SEQ ID HO: 247 

(i) SEQUENCE CHARACTERISTICS : 

(A) LEKGTH: 813 base pairs 

(B) TYPE: nucleic acid 

( C 1 STRANDEDNESS : doubl e 
(D) TOPOLOGY : circular 

(11) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL; NO 

(iv> ANT T- SENSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOHAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 
(BJ LOCATION 1. . .813 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247 



ATGATGAAAT 
TT GGGCTGT A 
CTCGAACGCT 
AGCTACGTCT 
GGCGACAGCC 
CTGGGAGAAG 
CTTTTGGACT 
TCCACGCGCA 
TTTGCCAACA 
GACTATGCGG 
ACCACACCTG 
GGCAATTTTA 
GCTTATGCCA 
AGGGATGATT 



CCATGCGCAG 
GCAATAACAA 
ACATCCCTCT 
ATCCTTCGGG 
TGATGGATTC 
ACTATCGCTC 
ATATCTACAA 
TCAATACATA 
TCCTTCGCAC 
AAAGGCTGTC 
CCGAATTGGA 
TGATCGATGA 
GAGGTGCTGT 
CCCCACTAAA 



CGTGCTGTTG 
AGCTGCCGAA 
GCGGGCAGAC 
AGATGATATG 
CTCTTCGCCG 
TAACAATGCC 
GCAGGAAAAT 
TACTTACGAA 
CACCGGCAAG 
CGCACTCATC 
TGCAATAGGT 
CAAAGGTCTC 
CTATGTCCGT 
GCGTTACTTG 



CT ACT CTTTC 
TCGAAGTCTG 
ATAGATACGC 
CTCACAGAGA 
GAGAATGCCA 
GAAGCCAATC 
ACCATCGCTT 
GGCGGTGCAC 
GTGCTCGAAG 
ATAGGAOAAT 
TTCTTCAACG 
ACATACTGTT 
CTCGGATATG 
CCG 



CATTGTCTTT 
TCTCTTTCGA 
CATCGCTGCA 
TTTTCAACGG 
TGGAAGGCTA 
TGCAAGGGCT 
ATT GCGAT AC 
ATACGGAGAA 
AGCGAGATAT 
TGGTGCACGA 
CAGAAGAAAT 
TCAATGAGTA 
ACGTATTGGC 



GATCACTGCT 
TTCGGCCTAT 
TGTGATGATC 
TTTGCTCTTC 
TGCACAGATG 
TCCTTCTGAC 
GGGATTGATC 
TACAGTCCGG 
ATTCAAGATC 
TTTCGGCAAG 
ACAGCCCAAT 
TCAGATAGCT 
TCCTTTGCTA 



(2) I N FORMAT I ON FOR SEQ ID NO: 243 

(1) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 810 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAN DEDNE SS : double 

(D) TOPOLOGY: circular 

tii) MOLECULE TYPE: DNA (genomic) 
(lii) HYPOTHETICAL: NO 
(lv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 



1320 
1380 
1440 
1500 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
23 60 
2220 
2280 
2340 
2343 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
. 660 
720 
780 
813 
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(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(Xx) FEATURE: 

(A) MAI-IE/ KEY: misc_f eature 
5 (D> LOCATION 1...810 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 24 8 

m ^S}^T CCA TGCGCAGCGT GCTGTTGCTA CTCTTTCCAT TGTCTTTGAT CACTGCTTTG 60 

TGCCGAATCG AAGTCTGTCT CTTTCGATTC GGCCTATCTC 120 

180 
240 
300 



GAACGCTACA TCCCTCTGCG GGCAGACATA GATACGCCAT CGCTGCATGT GATGATCAGC 
TACGTCTATC CTTCGGGAGA TGATATGCTC ACAGAGATTT TCAACGGTTT GCTCTTCGGC 
™^nn^^ TGGATTCCTC TTCGCCGGAG AATGCCATGG AAGGCTATGC ACAGATGCTG 3Q0 
1( - ATCGCTCTAA CAATGOCGAA GCCAATCTGC AAGGGCTTCC TTCTGACCTT 360 

15 TTGGACTATA TCTACAAGCA GGAAAATACC ATCGCTTATT GCGATACGGG ATTGATCTCC 420 
ACGCGCATCA ATACATATAC TTACGAAGGC GGTGCACATA CGGAGAATAC AGTCCGGTTT 
GCCAACATCC TTCGCACCAC CGGCAAGGTG CTCGAAGAGC GAGATATATT CAAGATCGAC 
TATGCGGAAA GGCTGTCCGC ACTCATCATA GGACAATTGG TGCACGATTT CGGCAAGACC 



420 
480 
54 0 

ACACCTGCCG AATTGCATGC AATAGGTTTC TTCAACGCAG AAGAAATACA GCCCAATGGC 660 
AATTTTATGA TCGATGACAA AGGTCTCACA TACTGTTTCA ATGAGTATCA GATAGCTGCT 720 
JESSES* CTGCTG *CTA TGTCCGTCTC GGATATGACG TATTGGCTCC TTTGCTAAGG 780 
GATGATTCCC CACTAAAGCG TTACTTGCCG gj|J 

25 (2) INFORMATION "FOR SEQ ID NO: 24 9 

UJ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 base pairs 
Qn (B) TYPE : nuclaic acid 

dU (C) STRANDEDME3S: double 

(D) TOPOLOGY: circular 

<ii) MOLECULE TYPE: DMA (genomic) 

35 Uil> HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 
40 (A> ORGANISM: PORYPHYROMONAS GINGIVALIS 



45 



fix) FEATURE: 

(A) NAME /KEY: misc_feature 
IB) LOCATION 1-..801 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 9 

^CCGCAGCG TGCTGTTGCT ACTCTTTCCA TTGTCTTTGA TCACTGCTTT GGGCTGTAGC 
SO ^ISiS^ G CTGCCGAATC GAAGTCTGTC TCTTTCGATT CGGCCTATCT CGAACGCTAC 
&U ATCCCTCTGC GGGCAGACAT AGATACGCCA TCGCTGCATG TGATGATCAG CTACGTCTAT 
CCTTCGGGAG ATGATATGCT CACAGAGATT TTCAACG GTT TGCTCTTCGG CGACAGCCTG 
SI^^ZSS 7 C™»«=a5GIV GAATGCCATG GAAGGCTATG CACAGATGCT GGGAGAAGAC 
ACAATGCCGA AGCCAATCTG CAAGGGCTTC CTTCTGACCT TTTGGACTAT 
AG t GAAAA TAC CATCGCTTAT TGCGATACGG GATTGATCTC CACGCGCATC 
DO AATACATATA CTTACGAAGG CGGTGCACAT ACGGAGAATA CAGTCCGGTT TGCCAACATC 
^~S£ CGA CCGGCAAGGT GCTCGAAGAG CGAGATATAT TCAAGATCGA CTATGCGGAA 
£*n^™.^ CACTCATCAT AGGACAATTG GTGCACGATT TCGGCAAGAC CACACCTGCC 
GAATTGGATG CAATAGGTTT CTTCAACGCA GAAGAAATAC AGCCCAATGG CAATTTTATG 
fin AAGGTCTCAC ATACTGTTTC AATGAGTATC AGATAGCTGC TTATGCCAGA 7-0 

gt?act?g^ ™ atgac gtattggctc CTTTGCTAAG GGATGATTCC ' 



780 
801 



(2) INFORMATION FOR SEQ ID NO: 250 

SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(t) 



70 
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II v) ANTI— SENSE z NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

tlx) FEATURE: 

(A) NAHE/KEY: nasc feature 

(B) LOCATION 1...777 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO:250 



ATGAAAAAAA 
GCCAATGCCC 
GCCACGGCCG 
AGGGTACACC 
AGAGCACACT 
ATTGGTATTG 
GCTCTCGGCC 
TGGGGATGGA 
TATGACACCT 
GTGAGCAATG 
ATCAAGAGCA 
CTATATCCGT 
CGCATGAACC 



CTACTTTGAC 
AATCGAAGTT 
GAAATGGTTT 
ACAGCATTCT 
GGGAAGAGGC 
TTGAAAAGGT 
GCTTCCT CAT 
CCAATGACGC 
ACACAAGAGC 
GCTTTTCACC 
AATATGATTT 
ATCGTAATTT 
ACATCGGCCA 



AGGATCGATA 
AAAGATCAAG 
TGGTGGCAAT 
GCCCGAAGGG 
CGGAAGTCCG 
AATAGTCTTC 
CAGTTTGGGG 
CCATATTCTT 
TGGCAGTGAC 
GGGGATCGGA 
TCGCCTCGGT 
CAAGCTGGAT 
TGTGGGCTTC 



TGTGCTTTAC 
AGCATTGAGG 
ATCTTCGGCA 
TTGGATTTTT 
AAGCTCATGT 
GAAGACGCAG 
ATATCCTATA 
TTCTCACCGA 
CTTGTACTTC 
CTCAAAAGTT 
TTCAGCCTGG 
GGAAATAAGC 
AACTTTACCG 



TCCTGTTTTT 
CAGCTACCAC 
TGGACATGAG 
CGGTAGGAAT 
ATACGAATGT 
AAGACTTTTT 
CCAAGCATCT 
TACCCAAGAG 
AGTCCGAAGA 
CTATTTGGTG 
CCTATGAGTA 
CGCTTTCAGC 
TGGGTCTTTG 



GGGTCTCTCG 
TTTCAGTTCG 
CATACGGATG 
ACATGAAAGA 
CCCAAGTATC 
TGACAAAAAA 
GGGAGCGTAT 
CAAGGTCCAC 
TGTTGCCACA 
GAAAATGCCC 
TCTGAACCTG 
ACT AT CTCCT 
GACTAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
i20 
777 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 251 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 98 base pairs 
(D) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE; DMA (genomic) 

( i i i 1 HY POTHF.T ICAL : NO 

(1v) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY: misc feature 

(B) LOCATION 1...37 08 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:2Sl 



ATGGGCAAGT 
TATACCTTTG 
CAAGCCATTT 
GAAGGGATAA 
GAGCTTTGCT 
CTAAAAAAGC 
ACTTCGTTAA 
AGTCTCACCT 
CTGGAACGTC 
GAGGGTCTGG 
AAGCTGGAGG 
ATCCGTAAGC 
AACCAAATCA 
TCGGGTAACC 
GAACTGTCGG 
AAGCTTCGTC 
CTAACAAAAC 
ACCTCGTTAG 
CGTCTCACCT 
CTGGATAGTC 
GAGGGCCTGG 
AAGCTGGAGG 
ATCAGTAAGC 
AATGATATTC 
GAAAAACTGA 
TATGATAATC 



ATAAAAGAGC 
AGGGAGTTAC 
TAGACTTAGA 
GCTGGTCGGT 
TGCGAGAATG 
TTGATCTATC 
CAAAACTTCG 
CGCTAACAAA 
TCACCTCGTT 
AACGTCTCAC 
GTCTGGAACG 
TAGAGGGTCT 
GTAAGCTA'jA 
AAATCCGTAA 
GTAACCAAAT 
TAAGAAGTAA 
TTTCTCTCTC 
CGGAGCTTTA 
C6TTAACAAA 
TCACCTCGCT 
AACGTCTCAC 
GTCTTGATGG 
TGGAAGGACT 
AATCTATTGA 
GAATCCATGA 
ATTTGCCGGA 



TAAGTACCGC 
TTTTTTATGC 
GAAGGCCTAT 
AAATAGATAT 
TCAGATAGAA 
GTATAACCAA 
TCTAAGAAGT 
ACTTTCTCTC 
AGCGGAGCTT 
GTCCTTAGCA 
TCTCACGTCC 
GGAACGTCTC 
GGGTCTGGAA 
GCTGGAGGGT 
CAGTAAGCTA 
CCAGATCAGT 
CGATAACCAA 
TCTTTTGGAT 
GCTTCGTCTA 
AACAAAACTT 
GTCCTTAGCG 
TCTTGCTTCC 
AGACAGACTA 
TGATATTAAG 
CAATCCATTT 
GATTAAAGCT 



TATTGGCTTT 
GCATCTGACG 
AACATTGAAA 
TTCAAGCAAG 
AGCATGACTT 
ATCAGTAAGC 
AACCAAATCC 
TCCGATAACC 
TATCTTTTGG 
ACGCTTGAAC 
TTAGCAACGC 
ACTTCGTTAA 
CGTCTCACGT 
CTGGAACGTC 
GAGGGTCTGG 
AAACTAGAGG 
ATCAGTAAGC 
AACCAAATCC 
AGAAGTAACC 
TCTCTCTCCG 
GAGCTTTATC 
TTAACAAGGC 
AAGGTTTTGA 
CTATTGGCTC 
GTTGCAT CAT 
CTTCTTGAAA 



TTCCTTTCTG 
ATATGACAAC 
TTCCTGATCT 
ATTCCTCCGG 
GGCTTATTGA 
TAGAGGGTCT 
GTAAACTAGA 
AAATCAGTAA 
ATAACCAAAT 
TATCGGGTAA 
TTGAACTATC 
CAAAGCTTCG 
CCTTAGCAAC 
TCACGTCCTT 
AACGTCTCTC 
GCCTGGAACG 
TAGAGGGTCT 
GTAAGCTGGA 
AAATCAGTAA 
ATAACCAAAT 
TTTTGGATAA 
TTAGTCTAAG 
GAAAACTTGA 
CGATTCTGGA 
CAGGCTTGAT 
AAGAAAAAGA 



TTCGGATTAT 
CAAGAAACCC 
CTCCT CACAA 
TGCAGTCGTT 
TTTTCCTGCT 
AGAACGTCTT 
GGGCCTGGAT 
GCTAGAGGGT 
CAGTAAACTA 
CCAAATCCGT 
GGGTAACCAA 
TCTAAGAAGT 
GCTTGAACTA 
AGCAACGCTT 
TTCGTTAACA 
TCTCACCTCG 
GGAACGTCTC 
GGGCCTGGAA 
ACTAGAGGGC 
CAGTAAACTA 
CCAAATCCGT 
GCGCAACCAA 
TGTTTCGGGC 
GCAAACTTTA 
ACT CT CTCCT 
AAAACAGAAA 



60 
120 
180 
240 
300 
360 
420 
, 480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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50 



AAGACTTCAG 

GGTAAAACAA 

GTGTTGTCGA 

GACTATTATC 

TGGGATGCTA 

AATTTCAATC 

GTTGGAGGAA 

CAGACTCATG 

GGAGTATTGG 

CTCAACTATC 

ATCACAGAAA 

CACATOCCTA 

GATCTTTACA 

GTGCTTTACT 

TTTGTCCAAA 

CCTAAAGACA 

GATGGCCAAA 

GATAAGGACT 

AAATGGCTTA 

CCCTTTGGCC 

CGGTATTGGC 

GAGCAAGAAG 

ATCAAGCl'CG 

TCAGCTAAGG 

TGGAACAATA 

ACTATTCGTG 

CAAGCGGATA 

AGCAAGACTA 

CGGGTGCGAG 

ACTGCAAAAC 

CTGCAATTTT 

ACTAAGTTTG 

ATAATCGCTT 

CCTGTATTTC 

GAAGACGATG 

GGAAAAGAGG 

GAAAATTGGG 

GAGGTAAATA 



TTGAATATCA 

CATTTCTTAG 

TACATCGAAG 

ATGGGATTTA 

AGAAGGATCG 

GCCCCTATTG 

ATCCTGATGG 

CCGATGAAAC 

AAGAAATCTA 

TGAATGAGCG 

AAGATAAGGG 

TCTCTCTCGA 

CCATAGAGTA 

ATCGTGAGAA 

TGATTCATGG 

TTTTTGAATG 

ATGGTAATAT 

GCTATGTGAT 

CTTTGGGATT 

TGATCAACCA 

GAGATCAGGT 

AAGAGAAAGA 

ACTTTACCGA 

ATATGCAGCG 

TCCCTCCGAG 

AAAOAAACAG 

AAGATTTGAC 

CGGCGAGGAT 

AACTATCGAC 

AGATCTTTAT 

TCAAACCCTT 

AAACACCTAT 

TGATCAGCCA 

GGGAGTATAA 

AGTTCCTTCG 

GAAAAACCAT 

TGGCAGTAGT 

CAGATGAA 



CCCATTTTGC 

TCAATACGAT 

CAATAACCCT 

CCAAGCCTTT 

AAACTTTGTG 

GTTAGGACAG 

CAAGGACACA 

GGGCGCTAAG 

TGTATCCTTA 

GGTGCGAGAA 

ATTGTACGAA 

AGCTCTTGCG 

TCTACAGACC 

TGAGAAGCTG 

AGAAATCCTC 

CAAACTGCAT 

GATCTTGCAG 

ACCGGGCTAT 

CGAGAGGCCC 

GATTATAGCC 

CATCTTCACA 

GGGTTTGCCC 

CTTGGCGATA 

GAAAGAGGCT 

GGAGCAAATA 

AAAGAAGAGA 

GGAGTCTCAT 

TGCAGCCTAT 

TCGTCCCTAC 

TTCCTATTCC 

GGAGAAGAAT 

TCACCCTGAA 

ACGCTATCTG 

CAAGACCATA 

GGAGAAATAT 

TAAAGCTTAT 

CAGAGAGTTC 



AAAGTAATGC 

ACAAATTATA 

AATGCGATCT 

TTTACCACCC 

AGCGTAGATG 

ATAGCCTATG 

CCACAGAGCA 

CAGCAAACCT 

GAGCCCAAGG 

GTTGTCGCAA 

GCTCTTCCCA 

GCTCAATTGA 

GAATT gaacc 

AACAATTATG 

CAAAAAGACA 

AATCTAAGTT 

CTATTATTGG 

CTCCCTTTGC 

AATTTTGTCC 

TACTATGGCC 

GCAGGCCGTG 

AAGACGAATG 

TCCGTATTCA 

ACTATCCTCA 

GGAGATAAGG 

CCCATCCAGG 

TATATCCATT 

CCGTTGAAGA 

AAACATCTTT 

AAAGAGGATC 

GGTCAG AT CG 

ATAAGAAAGC 

GCCACGGATT 

GTGCCGATAT 

TTTGCTCAGA 

GATAGTATTA 

AAAGAGAAGA 



TATTGGGAAA 

CGTATCAGAA 

TTTACGACTT 

AAT CGTTAT A 

ATAAAGAATA 

CCTGCAATCG 

CAGACGATAC 

TAGGCTGTGC 

CGAATAGTGC 

GCAGGAGTAA 

CAATCGCCGG 

ATAAGGGAAG 

AGCTTAGTCT 

TCTGGTTAGA 

ACATCAATAG 

CCGGAAGTAT 

AAGAGCTGAT 

ATTCCGATGA 

TCAAATTCGA 

GGGAAGAAGG 

AGATGGATAG 

CCGAGGATTA 

TCAAAGAGCA 

GTGATATGTT 

ATACGGACCA 

ATCTCTACCT 

7GGGCACGCT 

ACGGCGTTAT 

CCGTCAATAA 

AGACTGAACT 

AGATCTACTA 

GTATTGTCGA 

ACATCCTGGA 

TGATCAAGCC 

AAGCTCAAAT 

CGGCATCAGC 

TATTAAGAAT 



TCATTCTTCG 

AAATACACAT 

TGGGGGACAG 

CCTTCTCTTT 

TCAGACTCTT 

TTGTATGTCC 

AATTATCATT 

AGCCGAGAAT 

CGTACATGCC 

ATCAATTCAG 

TGATAATAAA 

AGCTGAAAAT 

GCGAGGGGAG 

TCCGGCAGCT 

AGGAACAGTT 

ATTTGAAGAA 

CGTATATGAA 

CGAAGCCTAT 

ACGTTTTATC 

TGCTCTAAAG 

GCAAACGCTT 

TCAGATCTGG 

GAGAAAGACA 

GGATATGTAT 

AACGAGAAGC 

CTCCTGTGCC 

GGACGATGAA 

CGATAAAGAG 

AAATCTGGCT 

GGAGACCTGT 

TGATAAGTTG 

AGCCGACTGT 

TCATGAGTTG 

TTGTACATTC 

AATCAATCTT 

CCATCGTGAT 

AACAAAACAG 



(2) INFORMATION FOR SEQ ID NO: 252 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3696 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(li) MOLECULE TYPE: DMA (genomic) 
tiii) HYPOTHETICAL: HO 



1620 

1680 

1740 

1800 

I860 

1920 

1980 

2040 

2100 

2160 

2220 

2200 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

37 98 



55 



60 



<iv) ANTI- SENSE : NO 

(vi| ORIGINAL SOURCE: 

tA) ORGAN I St I: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1...3696 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 252 



65 



70 



75 



ATGACAACCA 
CCTGATCTCT 
TCCTCCGGTG 
CTTATTGATT 
GAGGGTCTAG 
AAACTAGAGG 
ATCAGTAAGC 
AACCAAAT C A 
TCGGGTAACC 
GAACTATCGG 
AAGCTTCGTC 
TTAGCAACJC 



AGAAACCCCA 
CCTCACAAGA 
CAGTCGTTGA 
TTCCTGCTCT 
AACGTCTTAC 
GCCTGGATAG 
TAGAGGGTCT 
GTAAACTAGA 
AAATCCGTAA 
GTAACCAAAT 
TAAGAAGTAA 
TTGAACTATC 



AGCCATTTTA 
AGGGATAAGC 
GCTTTGCTTG 
AAAAAAGCTT 
TTCGTTAACA 
TCTCACCTCG 
GGAACGTCTC 
GGGTCTGGAA 
GCTGGAGGGT 
CCGTAAGCTA 
CCAAATCAGT 
GGGTAACCAA 



GACTTAGAGA 
TGGTCGGTAA 
CGAGAATGTC 
GATCTATCGT 
AAACTTCGTC 
CTAACAAAAC 
ACCTCGTTAG 
CGTCTCACGT 
CTGGAACGTC 
GAGGGTCTGG 
AAGCTAGAGG 
ATCCGTAAGC 



AGGCCTATAA 
ATAGATATTT 
AGATAGAAAG 
ATAACCAAAT 
TAAGAAGTAA 
TTTCTCTCTC 
CGGAGCTTTA 
CCTTAGCAAC 
TCACGTCCTT 
AACGTCTCAC 
GTCT GGAACG 
TGGAGGGTCT 



CATTGAAATT 
CAAGCAAGAT 
CATGACTTGG 
CAGTAAGCTA 
CCAAATCCGT 
CGATAACCAA 
TCTTTTGGAT 
GCTTGAACTA 
AGCAACGCTT 
TTCGTTAACA 
TCTCACGTCC 
GGAACGTCTC 



60 
120 
180 
240 
300 
360 
420 
4U0 
540 
600 
6 60 
720 
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ACGTCCTTAG 
CGTCTCTCTT 
CTGGAACGTC 
GAGGGTCTGG 
AAGCTGGAGG 
ATCAGTAAAC 
AACCAAATCA 
TTGGATAACC 
AGTCTAAGGC 
AAACTTGATG 
ATTCTGGAGC 
GGCTTGATAC 
GAAAAAGAAA 
TTGGGAAATC 
TATCAGAAAA 
TACGACTTTG 
TCGTTATACC 
AAAGAATATC 
TGCAATCGTT 
GACGATACAA 
GGCTGTGCAG 
AATAGTGCCG 
ACGAGTAAAT 
ATCGCCGGTG 
AAGGGAAGAG 
CTTAGTCTGC 
TGGTTAGATC 
ATCAATAGAG 
GGAAGTATAT 
GAGCTGATCG 
TCCGATGACG 
AAATTCGAAC 
GAAGAAGGTG 
ATGGATAGGC 
GAGGATTATC 
AAAGAGCAGA 
GATATGTTGG 
ACGGAGCAAA 
CTCTACCTCT 
GGCACGCTGG 
GGCGTTATCG 
GTCAATAAAA 
ACTGAACTGG 
ATCTACTATG 
ATTGTCGAAG 
ATCCTGGATC 
ATCAAGCCTT 
GCTCAAATAA 
GCATCAGCCC 
TTAAGAATAA 



CAACGCTTGA 
CGTTAACAAA 
TCACCTCGCT 
AACGTCTCAC 
GCCTGGAACG 
TAGAGGGCCT 
GTAAACTAGA 
AAATCCGTAA 
GC AACCAAAT 
TTTCGGGCAA 
AAACTTTAGA 
TCTCTCCTTA 
AACAGAAAAA 
ATT CTTCGGG 
ATACACATGT 
GGGGACAGGA 
TTCTCTTTTG 
AGACTCTTAA 
GTATGTCCGT 
TTATCATTCA 
CCGAGAATGG 
TACATGCGCT 
CAATTCAGAT 
ATAATAAACA 
CTGAAAATGA 
GAGGGGAGGT 
CGGCAGCTTT 
GAACAGTTCC 
TTGAAGAAGA 
TATATGAAGA 
AAGCCTATAA 
GTTTTATCCC 
CTCTAAAGCG 
AAACGCTTGA 
AGATCTGGAT 
GAAAGACATC 
ATATGTATTG 
CGAGAAGCAC 
CCTGTGCCCA 
ACGATGAAAG 
ATAAAGAGCG 
ATCTGGCTAC 
AGACCTGTCT 
ATAAGTTGAC 
CCGACTGTAT 
ATGAGTTGCC 
GTACATTCGA 
TCAATCTTGG 
ATCGTGATGA 
CAAAACAGGA 



ACTGTCGGGT 
GCTTCGTCTA 
AACAAAACTT 
CTCGTTAGCG 
TCTCACCTCG 
GGATAGTCTC 
GGGCCTGGAA 
GCTGGAGGGT 
CAGTAAGCTG 
TG AT ATT CAA 
AAAACTGAGA 
TGATAATCAT 
GACTTCAGTT 
TAAAACAACA 
GTTGTCGATA 
CTATTATCAT 
GGATGCTAAG 
TTTCAATCGC 
TGGAGGAAAT 
GACTCATGCC 
AGTATTGGAA 
CAACTATCTG 
CACAGAAAAA 
CATCCCTATC 
TCTTTACACC 
GCTTTACTAT 
TGTCCAAATG 
TAAAGACATT 
TCGCCAAAAT 
TAAGGACTGC 
ATGGCTTACT 
CTTTGGCCTG 
GTATTGGCGA 
GCAAGAAGAA 
CAAGCTCGAC 
AGCTAAGGAT 
GAACAATATC 
TATTCGTGAA 

agcggataaa 
caagactacg 
ggtgcgagaa 
tgcaaaacag 
gcaatttttc 
taagtttgaa 
aatcgctttg 
tgtatttcgg 
agacgatgag 
aaaagaggga 
aaattgggtg 
ggtaaataca 



AACCAAATCA 
AGAAGTAACC 
TCTCTCTCCG 
GAGCTTTATC 
TTAACAAAGC 
ACCTCGCTAA 
CGTCTCACGT 
CTTGATGGTC 
GAAGGACTAG 
TCTATTGATG 
ATCCATGACA 
TTGCCGGAGA 
GAATATCACC 
TTTCTTAGTC 
CATCGAAGCA 
GGGATTTACC 
AAGGATCGAA 
CCCTATTGGT 
CCTGATGGCA 
GAT GAAACGG 
GAAATCTATG 
AATGAGCGGG 
GATAAGGGAT 
TCTCT CGAAG 
ATAGAGTATC 
CGTGAGAATG 
ATTCATGGAG 
TTTGAATGCA 
GGTAATATGA 
TATGTGATAC 
TTGGGATTCG 
ATCAACCAGA 
GATCAGGTCA 
GAGAAAGAGG 
TTTACCGACT 
ATGCAGCGGA 
CCTCCGAGGG 
ACAAACAGAA 
GATTTGACGG 
GCGAGGATTG 
GTATCGACTC 
ATCTTTATTT 
AAACCCTTGG 
ACACCTATTC 
ATCAGCCAAC 
GAGTATAACA 
TTCCTTCGGG 
AAAACCATTA 
GCAGTAGTCA 
GATGAA 



GTAAGCTAGA 
AGATCAGTAA 
AT AACCAAAT 
TTTTGGATAA 
TTCGTCTAAG 
CAAAACTTTC 
CCTTAGCGGA 
TTGCTTCCTT 
ACAGACTAAA 
ATATTAAGCT 
ATCCATTTGT 
TTAAAGCTCT 
CATTTTGCAA 
AATACGATAC 
ATAACCCTAA 
AAGCCTTTTT 
ACTTTGTGAG 
TAGGACAGAT 
AGGACACACC 
GCGCTAAGCA 
TATCCTTACA 
TGCGAGAAGT 
TGTACGAAGC 
CTCTTGCGGC 
TACAGACCGA 
AGAAGCTGAA 
AAATCCTCCA 
AACTGCATAA 
TCTTGCAGCT 
CGGGCTATCT 
AGAGGCCCAA 
TTATAGCCTA 
TCTTCACAGC 
GTTTGCCCAA 
TGGCCATATC 
AAGAGGCTAC 
AGCAAATAGG 
AGAAGAGACC 
AGTCTCATTA 
CAGCCTATCC 
GTCCCTACAA 
CCTAT TCCAA 
AGAAGAATGG 
ACCCTGAAAT 
GCTATCTGGC 
AGACCATAGT 
AGAAATATTT 
AAGCTTATGA 
GAGAGTTCAA 



GGGTCTGGAA 
ACTAGAGGGC 
CAGTAAGCTA 
CCAAATCCGT 
AAGTAACCAA 
TCTCTCCGAT 
GCTTTATCTT 
AACAAGGCTT 
GGTTTTGAGA 
ATTGGCTCCG 
TGCATCATCA 
TCTTGAAAAA 
AGTAATGCTA 
AAATTATACG 
TGCGATCTTT 
TACCACCCAA 
CGTAGATGAT 
AGCCTATGCC 
ACAGACCACA 
GCAAACCTTA 
GCCCAAGGCG 
TGTCGCAAGC 
TCTTCCCACA 
TCAATTGAAT 
ATT GAACCAG 
CAATTATGTC 
AAAAGACAAC 
TCTAAGTT CC 
ATTATTGGAA 
CCCTTTGCAT 
TTTTGTCCTC 
CTATG GCCGG 
AGGCCGTGAG 
GACCAATGCC 
CGT ATT CATC 
TATCCTCAGT 
AGATAAGGAT 
CATCCAGGAT 
TATCCATTTG 
GTTGAAGAAC 
ACATCTTTCC 
AGAGGATCAG 
TCAGATCGAG 
AAGAAAGCGT 
CACGGATTAC 
GCCGATATTG 
TGCTCAGAAA 
TAGTATTACG 
AGAGAAGATA 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1Z60 
1320 
1390 
1440 
1S00 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1900 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
25Z0 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
32 4 0 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3696 



55 



60 



65 



(2> INFORMATION FOR SEQ ID NO: 253 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3525 base pairs 
ID) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D) TOPOLOGY: circular 

(il) MOLECULE TYPE: DMA (genomic) 

Uii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



70 



75 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHY ROt ION AS GINGTVAI.IS 

M x > FEATURE : 

(A) NAME /KEY : mj.^e_featur- 
t B) LOCATION 1. . .3525 

(>:H SEQUENCE DESCRIPTION: SEQ ID NO: 253 



SUBSTITUTE SHEET (RuJe 26) (RO/AU) 



sL3 Q iCii 1L4HL-H 



WO 99/29870 



PCT/AU98/01023 



173/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ATGACTTGGC 
AGTAAGCTAG 
CAAATCCGTA 
GATAACCAAA 
CTTTTGGATA 
CTTGAACTAT 
GCAACGCTTG 
TCGTTAACAA 
CTCACGTCCT 
GAACGTCTCA 
GGTCTGGAAC 
CTAGACGGCC 
AGTAAGCTAG 
CAAATCCGTA 
AGTAACCAAA 
CTCTCCGATA 
CTTTATCTTT 
ACAAGGCTTA 
GTTTTGAGAA 
TTGGCTCCGA 
GCATCATCAC 
CTTGAAAAAG 
GTAATGCTAT 
AATTATACGT 
GCGATCTTTT 
ACCACCCAAT 
GTAGATGATA 
GCCTATGCCT 
CAGACCACAG 
CAAACCTTAG 
CCCAAGGCGA 
GTCGCAAGCA 
CTTCCCACAA 
CAATTGAATA 
TTGAACCAGC 
AATTATGTCT 
AAAGACAACA 
CTAAGTTCCG 
TTATTGGAAG 
CCTTTGCATT 
TTTGTCCTCA 
TATGGCCGGG 
GGCCGTGAGA 
ACGAATGCCG 
GTATTCATCA 
ATCCTCAGT C, 
GATAAGGATA 
ATCCAGGATC 
ATCCATTTGG 
TTGAAGAACG 
CATCTTTCCG 
GAGGATCAGA 
CAGATCGAGA 
AGAAAGCGTA 
ACGGATTACA 
CCGATATTGA 
GCTCAGAAAG 
AGTATTACGG 
GAGAAGATAT 



TT ATT GATTT 
AGGGTCTAGA 
AACTAGAGGG 
TCAGTAAGCT 
ACCAAAT CAG 
CGGGTAACCA 
AACTATCGGG 
AGCTTCGTCT 
TAGCAACGCT 
CGTCCTTAGC 
GTCTCTCTTC 
TGGAACGTCT 
AGGGTCTGCA 
AGCTGGAGGG 
TCAGTAAACT 
ACCAAATCAG 
TCGATAACCA 
GTCTAAGGCG 
AACTTGATGT 
TTCTGGAGCA 
CCTTGATACT 
AAAAAGAAAA 
TGGGAAATCA 
ATCAGAAAAA 
ACGACTTTGG 
CGTTATACCT 
AAGAAT AT C A 
GCAATCGTTG 
ACGATACAAT 
GCTGTGCAGC 
ATAGTGCCGT 
GGAGTAAATC 
TCGCCGGTGA 
AGGGAAGAGC 
TTAGTCTGCG 
GGTTAGATCC 
TCAATAGAGG 
CAAGTATATT 
AGCTGATCGT 
CCGATGACGA 
AATTCGAACG 
AAGAAGGTGC 
TGGATAGGCA 
AGGATTATCA 
AAGAGCAGAG 
ATATGTTGGA 
CGGAGCAAAC 
TCTACCTCTC 
GCACGCTGGA 
GCGTTATCGA 
TCAATAAAAA 
CTGAACTGGA 
TCTACTATGA 
TTGTCGAAGC 
TCCTGGATCA 
TCAAGCCTTG 
CTCAAATAAT 
CATCAGCCCA 
TAAGAATAAC 



TCCTGCTCTA 
ACGTCTTACT 
CCTGGATAGT 
AGAGGGTCTG 
TAAACTAGAG 
AATCCGTAAG 
TAACCAAATC 
AAGAAGTAAC 
TGAACTATCG 
AACGCTTGAA 
GTTAACAAAG 
CACCTCGCTA 
ACGTCTCACC 
CCTGGAACGT 
AGAGGGCCTG 
TAAACTAGAG 
AATCCGTAAG 
CAACCAAATC 
TTCGGGCAAT 
AACTTTAGAA 
CTCTCCTTAT 
ACAGAAAAAG 
TTCTTCGGGT 
TACACATGTG 
GGGACAGGAC 
TCTCTTTTGG 
GACTCTTAAT 
TATGTCCGTT 
TAT CATTCAG 
CGAGAATGGA 
ACATGCGCTC 
AATTCAGATC 
TAATAAACAC 
TGAAAATGAT 
AGGGGAGGTG 
GGCAGCTTTT 
AACAGTTCCT 
TGAAGAAGAT 
ATATGAAGAT 
AGCCTATAAA 
TTTTATCCCC 
T CTAAAG CGG 
AACGCTTGAG 
GATCTGGATC 
AAAGACATCA 
TATGTATTGG 
GAGAAGCACT 
CTGTGCCCAA 
CGATGAAAGC 
TAAAGAGCGG 
TCTGGCTACT 
GACCTGTCTG 
TAAGTTGACT 
CGACTGTATA 
TGAGTTGCCT 
TACATTCGAA 
CAATCTTGGA 
TCGTGATGAA 
AAAACAGGAG 



AAAAAGCTTG 
TCGTTAACAA 
CTCACCTCGC 
GAACGTCTCA 
GGTCTGGAAC 
CTGGAGGGTC 
CGTAAGCTAG 
CAAATCAGTA 
GGTAACCAAA 
CTGTCGGGTA 
CTTCGTCTAA 
ACAAAACTTT 
TCGTTAGCGG 
CTCACCTCGT 
GATAGTCTCA 
GGCCTGGAAC 
CTGGAGGGTC 
AGTAAGCTGG 
GATATTCAAT 
AAACTGAGAA 
GATAATCATT 
ACTTCAGTTG 
AAAACAACAT 
TTGTCGATAC 
TATTATCATG 
GATGCTAAGA 
TTCAATCG'JC 
GGACGAAATC 
ACTCATGCCG 
GTATTGGAAG 
AACTAT CTG A 
ACAGAAAAAG 
ATCCCTATCT 
CTTTACACCA 
CTTTACTATC 
GTCCAAATGA 
AAAGACATTT 
GGCCAAAATG 
AAGGACTGCT 
TGGCTTACTT 
TTTGGCCTGA 
TATTGGCGAG 
CAAGAAGAAG 
AAGCTCGACT 
GCTAAGGATA 
AACAATATCC 
ATTCGTGAAA 
GCCGATAAAG 
AAGACTACGG 
GTGCGAGAAG 
GCAAAACAGA 
CAATTTTTCA 
AAGTTTGAAA 
ATCGCTTTGA 
GTATTTCGGG 
GACGATGAGT 
AAAGAGGGAA 
AATTGGGTGG 
GTAAATACAG 



ATCTATCGTA 
AACTTCGTCT 
TAACAAAACT 
CCTCGTTAGC 
GTCTCACGTC 
TGGAACGTCT 
AGGGTCTGGA 
AGCTAGAGGG 
TCCGTAAGCT 
ACCAAATCAG 
GAAGTAACCA 
CTCTCTCCGA 
A6CTTTATCT 
TAACAAAGCT 
CCTCGCTAAC 
GTCTCACGTC 
TTGATGGTCT 
AAGGACTAGA 
CTATTGATGA 
TCCATGACAA 
TGCCGGAGAT 
AATATCACCC 
TTCTTAGTCA 
ATCGAAGCAA 
GGATTTACCA 
AGGATCGAAA 
CCTATTGGTT 
CTGATGGCAA 
ATGAAACGGG 
AAATCTATGT 
ATGAGCGGGT 
ATAAGGGATT 
CTCTCGAAGC 
TAGAGTATCT 
GTGAGAATGA 
TTCATGGAGA 
TTGAATGCAA 
GTAATATGAT 
ATGTGATACC 
TGGGATTCGA 
TCAAC CAG AT 
ATCAGGTCAT 
AGAAAGAGGG 
TTACCGACTT 
TGCAGCGGAA 
CTCCGAGGGA 
CAAACAGAAA 
ATTTGACGGA 
CGAGGATTGC 
TATCGACTCG 
TCTTTATTTC 
AACCCTTGGA 
CACCTATTCA 
TCAGCCAACG 
AGTATAACAA 
TCCTTCGGGA 
AAACCATTAA 
CAGTAGT CAG 
ATGAA 



TAACCAAATC 
AAGAAGTAAC 
TTCTCTCTCC 
GGAGCTTTAT 
CTTAGCAACG 
CACGTCCTTA 
ACCTCTCACT 
TCTGGAACGT 
GGAGGGTCTG 
TAAGCTAGAG 
GATCAGTAAA 
TAACCAAATC 
TTTGGATAAC 
TCGTCTAAGA 
AAAACTTTCT 
CTTAGCGGAG 
TGCTTCCTTA 
CAGACTAAAG 
TATTAAGCTA 
TCCATTTGTT 
TAAAGCTCTT 
ATTTTGCAAA 
ATACGATACA 
TAACCCTAAT 
AGCCTTTTTT 
CTTTGTGAGC 

aggacagata 
ggacacacca 
ccctaagcag 
atccttagag 
gcgagaagtt 
gtacgaagct 
tcttgcggct 
acagaccgaa 
gaagctgaac 
aatcctccaa 
actgcataat 
cttgcagcta 
gggctatctc 
gaggcccaat 
tatagcctac 
cttcacagca 
tttgcccaag 
ggccatatcc 
agaggctact 
gcaaatagga 
gaagagaccc 
gtctcattat 
agcctatccg 
tccctacaaa 
ctattccaaa 
gaagaatggt 
ccctgaaata 
ctatctgccc 
gaccatagtg 
gaaatatttt 
agcttatgat 
agagttcaaa 



60 
120 
160 
240 
300 
360 
420 
480 
540 
6O0 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
I960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3525 



12) INFORMATION FOR SEQ ID NO: 25*1 



65 



70 



75 



(11 SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 687 base pairs 

(B) TYPE: nucleic acid 

(C) STRA1 JDEDNESS : double 
[D> TOPOLOGY: circular 

<li> MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AHTI -SENSE: NO 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



30 



35 



65 



70 



174/ 490 

[vi) ORIGINAL SOURCE: 

(AJ ORGANISM: PORYPHYROJJONAS GINGIVALIS 

(ix> FEATURE: 

(A) NAIIE/KEY: misc feature 

(B) LOCATION 1...687 



<xi> SEQUENCE DESCRIPTION: SEQ ID HO:2S4 

10 ATGATGAAAA AAGCATTTGT TTTCGTACTA CTGGTTTGCC TATTCTCCTC GTTCAGCA^T 

™^ CAACGACGAA CAGTAGCCGG AGTTATTTTA SSSSSE CgIg^G 12 

TAGGGGTCCC CCCCGTAAGC ACAGAGGTTT GGGGAATGAC CCATGATG^G 
^S^l?^ ^TTTCGAAAT ACCTATCTCT TTCAGTCGTT TCAACAGCCA GGGAGATATA 24C 
15 ^cc£££ SSE^ GAATAGCGAG GCAACTTTGA ATGAATGGTG CGACTATGCA 300 
GCATCGTGAG GGTAGAAGGT CGTTTTTGGA AAATG ACT ^ A CAACATA^CA 
SSSSS? 8 CA,3TCTGCA ^ CCG G ATT ACA TTCGAAAATC AAGAAATAGA AGGAACGATC «S 
£2^2? ^ AAGCCCAA AGTCTCGCTG CCTCATGTGT CGGAATCGGT GCCTTGCAT C , lo 

- 0 ?StgcSa g ssssss ess™ 2522255 So 0 



25 t2> Il,FORiIATION FOR SF *Q II> "0: 255 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 694 base pairs 
(3) TYPE: nucleic acid 

(C) STRAIIDEDNESS: double 

( D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE t NO 



(vi) ORIGINAL SOURCE: 
4Q < A > ORGAN I SH : POR Y P HYROMOHAS GINGTVALIS 

(ix) FEATURE; 

(A) NAME/ KEY : nlec feature 
(□) LOCATION 1. . .684 

4 ^ (xi > SEQUENCE DESCRIPTION: SEQ ID NO; 255 

JSSSJJS*? CATTTGTTTT CGTACTACTG GTTTGCCTAT TCTCCTCGTT CAGCAGTTCC 
2^***^* CGACGAACAG TAGCCGGAGT TATTTTACAG GACGAATCGA GAAGGTGAGT 
50 ™™ ^I^ CCCC roTA *GCACA GAGGTTTGGG GAATGACCCA S S 
JSESSS ^ CTCTTTC AGTCGTTTCA ACAGCCAGGG AGATATAGCT 

CC^ZrJl ^£52S£ TAGCGAGGCA ACTTTGAATG AATGGTGCGA CTATGCACAC 
CCGGGCGGCA TCGTGAGGGT AGAAGGTCGT TTTTGGAAAA TGACTTACAA CATArrAfirr- 

AGCCCAAAGT 2™**™ GAAA *^ AA^GATCGTC J£ 

I^ A T A ^ C . AGCCCAAAGT CTCGCTGCCr CATGTGTCGG AATCGGTGCC TTGCATCCGA 4 80 

JESSSS S GAGGGAATT TATCCTTTGC GAAGAAGACG ACACCTTTGT GTCTCACGAT 540 

^ GATAGG CGGTAAACCT TTCTTGCTCA ATACCAACGT AAAGATTGTG 600 

GCCCAAACAG TAT^ACAACA EE""* 0 GCAGATTTGT 660 



60 

12) INFORMATION FOR SEQ ID NO: 256 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1620 base pairs 
(B> TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

tii) HOLECULE TYPE: DNA (genomic) 

(111) HYPOTHETICAL: NO 

llv) AHTI-SEiJSE: NO 

75 (vi) ORIGINAL SOURCE: 



60 
120 
lbO 
240 
300 
360 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/D1Q23 



175/490 



10 



15 



20 



25 



30 



35 



{A) ORGAN I SH: PORY PHYROMON AS GTNGTVALIS 

fix) FEATURE: 

(A) NAME/ KEY : ml5c_featurs 

(B) LOCATION 1...1620 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 256 



ATGAAGACAA 
CCTCTCCCCC 
GAAAAACTGG 
CACAGTGGAT 
GCCACATACC 
GATATCGTGG 
AATAAGCCCG 
AATGCGAAAC 
GCCTCCAACT 
TTCGCTTACA 
CTGAATGGAG 
ATTGACAAGG 
CCGCTAATGG 
TTGTCGAACT 
GAAAGCGACA 
ATCAATGGGG 
TCGGATTGGG 
ACTCCGACTA 
AACTCGGGGC 
GAAGAGAACG 
AAAGATT TGT 
CAAGTAGACA 
GGCAAACGGA 
GTAATGCAAG 
AGCCTGCCGA 
GCTGAGGCTT 
ACGTACATAC 



AAGTTTTACG 
AAGCGCAGAC 
TACAAACGAG 
ACGACACCGG 
AAAAGTTGAG 
TAACGGGTAA 
GAGGATATAA 
TCGTGTATAA 
ATCGTTCGCC 
CCGGCTTCAA 
GGCAAAATTT 
TGGATCTCTC 
GTGTGGTATT 
TTGTCGACAA 
TCTCGTTCAG 
AGAGTTGCCA 
ACATTCGGTA 
TGGATGATCT 
TGGGGTATGA 
GTACGAACAC 
GGAGCGACAC 
TCAATCCGAC 
TCGTTTGGTC 
AAGGCAGCAT 
CGGCAGCAAA 
CTTTCTCCGG 
TCAAGGTCGT 



CAAATTCGTG 
GATGGGAGGA 
GATGAGTGTG 
CAATAGCAAT 
GGATTGGGAT 
GAATGAATCC 
GAGTAGAGTT 
GGAAGACTTC 
TTCTTCTCTT 
CAATACGCAC 
CAATAAAAAC 
ATTGGGTAGC 
CGAAATGAAT 
TGATCCCGAA 
CCCCAAAATC 
CAACTTCATG 
TGTATATCCC 
GGTGGAAGCT 
CAAGAACGCC 
GCTGAAATAC 
ATTTACGTAT 
CAAGGGTCTC 
TGATACGCAG 
GAAGCTCTAC 
CTGCAAGGCT 
CAACGAATAC 
ATCCGATACG 



GTGGCGGCTT 
GATGATGTCA 
GCGGACAACG 
GTGAAGATCT 
CCATCGGATG 
GACATCAAGA 
GCGGTCTTCA 
TCCAATGTGC 
AACAATGGTG 
AAAATAAGTT 
TTACTCTTCA 
ACCTCTGAAT 
AAACAAGGGG 
TTTCAGTGGT 
CAAATGTTGC 
ATTACGTACA 
AAGAAATCGT 
TTCCTTACAG 
AATCACTACC 
CGCTGGGCCA 
ACATCATCTG 
GTGTGCTGGT 
TGGACCCATG 
CCGAATCCGG 
GTTGTTTACG 
AGGCTGAACG 
GAGCGTTTCG 



TCGCCGTCGC 
AGGTGGTCCA 
GATGGATCTA 
TCCGCTCCAA 
ATTATCAGTT 
TTTGGTCGGT 
GTCGCGATGC 
AGTTGTACGA 
GCAACCCTTT 
TTGTGGACTA 
GTCAAGATGG 
CCATGGGTCA 
GAAAAAGCGA 
CAGGCCCTAT 
TGGACGAGGA 
GCGATTATGA 
TCAAGTATGA 
CTTCGTACCA 
TGATTACATA 
ATTATGACAA 
CCAATGCTCT 
CATGGGTGGA 
CCAACGGTGT 
CTCAAGAATA 
ATATGCAGGG 
TGCAGCACTT 
TAGAGAAGCT 



AACCCTCTGT 
GTACAATCAG 
TGTAATGACC 
AGACCAAGGT 
TCAAGACTTC 
AGAGCTCATG 
CAACGCGCAG 
TGTGGATATA 
TGCTTTGGCT 
TGTGTTCTCT 
AGAGAAGAAA 
CAATGCCTGG 
TATCGGTTTC 
AAAAGTGAGT 
TAACAATACG 
TTCTGAATAT 
AAAAGGAAAA 
GAGTGAGACU 
TGCCAAAAAA 
GATTCATAAC 
CTACACACCT 
ATATCTGCCG 
AGAAGACATC 
TGCTGTGATT 
CAGAGTAGTC 
GGCTAAGGGT 
CATCGTGGAA 



60 
120 
180 
240 
300 
360 
420 
480 
S4 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



40 



45 



50 



{?) INFORMATION FOR SEQ ID NO: 2 57 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2313 base pairs 
(8) TYPE: nucleic acid 
<C> STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA \ genomic) 

(ill) HYPOTHETICAL: NO 

liv) ANTI -SENSE: NO 



55 



60 



65 



70 



75 



tvl) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROJ40NAS GIHGIVALIS 

(ix) FEATURE: 

(A> NAME/ KEY : misc_f eature 

(B) LOCATION 1...2313 



txij SEQUENCE DBSCRI PTIOH : SEQ ID 110:257 



ATGCTGACGA 
GATGCACAAA 
GGAACGGCCA 
GTTACCCCTT 
TGGAGTGCGC 
AGTGGAGTAG 
GATCCTGATT 
TACAAGGCTG 
GCTTTGACCG 
GCCCTAAGCA 
AGCAGGCTGA 
CTTCAGGTGA 
GTGGAAGCCC 
GAG AT GAG CG 
ATCTATGTAC 



TCCGAAACTT 
GCTCT GTCTC 
ATTACGGTAC 
TCAATGTCAG 
CTTATTCGGC 
GCTGTCCCGA 
ACTGCTGCTA 
TATTGGACAA 
AATTTGCTTT 
ATGAATCGGG 
TGGGGACGTT 
GTCGGCGACC 
AATGGGATTC 
GTGATGACAT 
GATCGGCCGT 



CCTCCTCTTT 
TTCGGGTAGA 
CACGAATCCG 
CGGATCGACA 
CGACAATAGT 
ACTGAGTGCA 
TGGCTCTTCG 
ATACGGTATA 
TCCCGAAGGA 
AGCCTCTGTT 
CTGCTACAAT 
GATCTCTGCC 
GACTGCAGGG 
CGGT GTCCG A 
TTCATT CGT C 



TGTTGTCTGT 
OGACTGACAG 
GGAGCAGTAT 
GAGAATCGCT 
TACTGCATCG 
ATACTGCTGA 
CTCAGTCGAC 
GATGCAGCCG 
GAAGGCCATA 
CGATTCTTAA 
CCGCAAGCAG 
GGCTATTGGA 
AAATATAAGC 
TTCTCGTTCA 
AGCGAAGCCA 



CGCTGATAGC 
AATATGTCAA 
TGCCCAATGG 
TCGACAAAGA 
GTTTCAGCCA 
TGGCCACTTC 
AATAT GCGCG 
TGACCGTAAC 
TCCTGCTGAA 
ACGACTCCAC 
TTTTTCGTCA 
AGAAGCAGCC 
AGTACGACGG 
ACTGCGATCA 
ATGCGCTCTA 



GTTTGCTGCC 
TCCCTTTATC 
GTTGATGAGC 
TTCGCGTTGG 
TGTGAATCTG 
CGGCACATTC 
CCCGGGAGAA 
CGAGCGGACT 
CCTGGGACAO 
AGTCGTCGGC 
GTATTTCGTA 
TCCTATGACA 
CTACAAGCGT 
GGGGGAAAAG 
TAATCTGHAA 



60 
120 

ieo 

240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



ii^ ji *' w,H . ii^ k-"b 



ij -113 Ij=^. iUs 5 i^ii i 



WO 99/29870 PCT/AU98/01023 

176/490 



GCGGAGCAAG AAGAGGTGTT CAAAAGTGTC 
ATACGCTCTC GCGCTATAGA GCGTTGGGAU 
GGCACACCGG ATGAAAAGAC GATATTCTAT 
AATATCCTAC AAGATGCCAA TGGAGAATAT 
5 ACGGCTCACG ACCGCTACAC CGTGTTCTCT 
CTGCTCTGCC TCCTCTATCC GGAGAAGCAG 
TACCGAGAGA GCGGGTGGCT GCCGAGATGG 
GAGGGCGACC CCTCGCTTAT CGTCATCAAT 
GATACGGCAA CGGCCTATGA AGCCATGAAA 

10 CCGATCCGTC CTGACAACGA CGACTATCTC 
TACGACAATT CCGTATCGCA TGCGCTGGAA 
TTTGCCCACG CACTTGGGCA TAAAGAAGAC 
TACAGACACT ATTATAATAA GGAGTATGGT 
TTCCTCACTC CTTTCCATCC CAAACAGGGT 

15 GAGGGCAGTG CTTATAACTA TGCCTTTTTC 
CTGATGGGAG GAGCAAAGGT TTTTTCGGAA 
TATGATCXGA CCAACGAGCC GGACATCGCC 
GAAGCATGGC GAACGCAGAA ATTGACCCGG 
CCTAACGGCT TGCCCGGTAA TGACGATGCC 

20 ATGCTGGGAT TCTACCCTGA CTGTCCGGGC 
TTCCCCCGAG TTAGGATTCG GCTCAATCCG 
ACGACCAATA CAGAGAATCA ACCGACAGAT 
AATAAAACAC TTCCGCAT GG AACAAGGCAT 
CACCTCCGTT ACGAACTAAG CAATCGTCCT 

25 



GGAGGGAATC CGGCCAAGGC TTTCTCCGCT 960 

GAAGCCCTCG GTACGGTGGA AGTGGAAGGA 1020 

ACCGCACTCT ATCACCTGCT GATACATCCG 1080 

CCTATGATGG GCAGTGGCAA AACGGGTAAT 114 0 

CTTTGGGACA CGTACCGCAA TGTACACCCG 1200 

TTGGATATGG TACGGACACT GATCGACATG 1260 

GAGCTGTACG GACAGGAGAC CCTGACGATG 1320 

GACACTTGGC AAAGGGGCCT TCGTGCTTTC 1380 
AAAAATGCTT CTTCGGCAGG AGCGACCCAT J 144 0 

ACCCTCGGCT TCGTACCGCT TCGCGAACAG 1S00 

TACTATCTGG CCGACTGGAA TCTGTCCCGG 1560 

GCAGCTCTAT TCGGAAAACG CTCGTTGGGC 1620 

ATGCTGTGTC CATTGCTGCC GGATGGATCA 1680 

GAAAACTTCG AGCCTAATCC CGGTTTCCAC 174 0 

STTCCCCAC3 ATATACAAGG GCTTGCCCGG 1800 

AGGTTGCAGA AAGTCTTCGA TGAAGGATAT 1860 

TATCCTTACC TCTTCTCCTA TTTCCCCAAG 1920 

GAGTTGATAG ACAAACATTT TTGCAATGCT 19C0 

GGTACGATGA GTGCTTGGCT TGTCTATTCC 204 0 

AGCCCCACCT ATACACTGAC CTCGCCGGTA 2100 

CAG TAT TAT C CTCAGGGGGA GTTGATCATT 2160 

TCCATTTACA TCCATACGGT TTCTCTTGGC 2220 

ATCAGCCATG CCGATTTGGT GCGCTGOGGT 2280 

CGA 2313 



(21 IHFORHATIOM FOR SEQ ID HO; 258 



<l) SEQUENCE CHARACTERISTICS: 
30 (A) LEHGTIl: 2328 base paics 

(BJ TYPE : nucleic acid 
{Ci STRANGENESS: double 
(D) TOPOLOGY: circular 

35 (ii) MOLECULE TYPE: DNA (genomic! 



<iii> HYPOTHETICAL: NO 



(iv> AMTI-SEHSE: HO 

40 : 



(vl) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHOllAS GIUGIVAMS 



tix} FEATURE: 
45 (A) NAME/ KEY: misc_f sature 

(D ) LOCATION 1...2328 



(>:i) SEQUENCE DESCRIPTION: SE 

50 ATGTGCAAAA TAAGATTCAG CCTCTTGCAG 
TTTTCTCTCC AAGCTCAAGA GGAAGGTATT 
GAAAAAGCCG TAGAAACGCC CAAGAAAGTC 

' GTGGGCAAAG AAGCTCCCCA TGAGGCAAAG 

- - ACATCGGTAA GCAGCATAGC CTACTCCGAG 

55 TCAGGCAATA TCGACATCTT GGACGAAGCA 
GACAATATCG ATCTGATAGA CAAAACGCTC 
TATTTGGCAG GAGGATTCGG CCTCTCCGTT 
ACCTACGCCA AGGGAACTAA GGTGACCGAT 
ATGCTGAAAG AAGGGCAGCT CTTCAT CGGA 

60 GCATGGACAG CCTTGTCTTT GAATTTGCCG 
GGGGAAGACA TCTGTTTCCT GCTCGCCGAT 
TTTGAGCCGG AGCTATTGCT CTCTTCCTCC 
GGTCTGTTCA TCTGTGCCGA GAATCGAATT 
CAATTTCCTA TAGCCGACGT CCTTGGTGTC 

65 ATAGCATTGG GAGAAGAAGG TTTGGCTTCA 
GCCATGCCTG TAGCATTCGA CGGACCGGGG 
CACGGACGTC TGTATGCAGC CAGCGGACTC 
ATGGTGAAGC TATACGACGG CAACCGATGG 
CAGTTGGGCG GCGGATTCAG TTTCAATGAT 

70 CCCGATCACT TTTTTGTCGG TACATGGGGA 
GC CAT AG CT C GCTAT TCGGG AAACGAAACT 
CGTGTGAAAG CGATTGCCTT TGACAATAAG 
GGCAAGAACA TCTTCATGTA CGATC CGCAG 
GATGTAGCCA ATCTGGCCTC CTTCGGCAAT 

75 GTAAATATCC TTCACCGTAG TGGCGGATCC 



:Q ID NO: 258 

GCTCTTGTCG TCTGCTTATT GTTCACCTCT 60 

TGGAATACCC TCTTGGCTAT CCACAAGACG 120 

TTTGCCGTAG ccaacggagt actttactcg 100 

ATCTTCGACC GTATCAGCGG ACTCAGCGAT 24 0 

CAACTAAAAT CCTTGGTCAT ATACTATGCA 300 

GGCCGTGTGA CCAACGTACC TGCATTGAAA 360 
AATCGCCTTT TGATCGTAGG CAACAGGGCT . 420 

CTGGATGTCG CCGAAGCTCG CATACCGGCT 4 60 

GTGGCTAAGT TGGACAATGA TCGCTTGCTG 54 0 

AAAGAGACCG ATAACCTGCA AG AT CCGGCC 600 

ATGGGCTCGG TCACCGGTCT GGGCATTGTC 660 

GGCCGTGTAT ATGTCGCTGC AAACCAATCG 720 

GCCGATTCAC GACTGTATGT GACGGATCGT 780 

TATTTCATAG AAAAAGGTCG CAAAACGACA 84 0 

GGTGCCATGA ACGAAAGCAA TACGGCATAC 900 

CTTCTTCTCG CAGAGGGAAG TACGGCCGAA 960 

GACAATGATT TCTACGAGAT GCGGTTTAGT 1020 

TGGGGAACAA ACCTGATGGG ACATGCCGGT 1060 

ACTAACTTCG ACAAGAAGAC CGTACAGGAA 114 0 

GCTATCGATA TAGCTGTTTC CAACGGAGAC 1200 

AACGGTCTGT TCGAATTCAA GGATGGCAAA 1260 

GCTATCGCAG AATGTAATCC CGGAGATGCC 1320 

GGCAACCTtT GGGGGACGCT CGGTGCCGTA 1380 

AGTAGCACAT GGCATTCTTT CAGCTATCCG 14 40 

ATGATTAT-C TACCCAACGG AGACAAAT GG 1500 

ACGCGCAAAG GTGTCTTGAT CTTCAACGAT 1560 
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CGGGGTACAC 
CGCCTCGGGG 
TCTGTCTGGA 
TCCTCGACTT 
TATGTGCTGG 
GTTGCCACCC 
CAATTTACCG 
GACGATAACG 
ACGGGGAGTG 
CCGGAATATC 
ACCGATACCA 
AATGCTCGAG 
GATCCGGTAT 



CGGAAACGAC 
CAGCCATAGG 
TGGGATCGGA 
CTACCCCTAT 
ACAAGGTGAC 
AAGGGACAGG 
TAGAAAACAG 
GACTGCTGTA 
GATCAGCTTC 
CCGATGGCGT 
CCGGCAGACT 
GTGCCGATGG 
CGAAAAAGTC 



TTCGGACGAC 
ACATAAGACT 
TATAGGCATT 
CGCTGTTCGG 
GGTGACAGAC 
ACTCTATCTC 
CCCTTTGCTT 
CATCGGTACG 
CGAACTGGAC 
CACCATTGCC 
GCTATACCAG 
CAATAGGGTA 
CAAACTAATT 



AGCCATCTTT 
ATCTATGCAA 
TTCGGCGTCT 
CCGGTCGGAG 
ATCGTCGTGG 
CTTTCGGAAG 
TCTAACAACA 
GCGGACGGAC 
GGCGTCTATG 
GG ACT GCAAG 
ACTGAGAGCG 
GCTTCGGGCG 
CGCTTCGCAG 



acutcgagca 
tggccgtcga 
acaatgcagc 
gagaagaacc 
acaaactcaa 
attgcagtaa 
tactatccct 
tgatgaogtt 
tataccccaa 

CCGGCTGTAG 
TAACCACCGA 
TATATGCCGT 
TGATTCGC 



GTTTGTCAAT 
TCATAACGGC 
CGGAGTATTG 
CAATTTGTAC 
TCACAAATGG 
GATCCTCGCG 
GGCCTTAAAT 
CCAAACGGGT 
TCCGCTAAGG 
TGTCAAAATC 
AGTCAAATGG 
TGCAGTGTAC 



(2) III FORI 1ATION FOR SEQ ID HO: 259 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3474 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIJDEDNESS: double 
ID) TOPOLOGY: circular 

(il) MOLECULE TYPE: DMA t genomic J 

liiil HYPOTHETICAL: NO 

(1v> AIITI— SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROl lONAS GINCIVALIS 

t ix ) FEATURE : 

(A] NAME/ KEY: mi?c_fsature 

(B) LOCATION 1...347 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 25 9 



ATGAAACGAA 
GCGAAAGCGC 
TGGGTCAAGA 
GCCAATGGCT 
CCCGAAGATC 
GGCAATGCGC 
ACCACCATGG 
GCTGCCGGAG 
TTGATCGACT 
TCGGGACGAG 
CCTTTGAGGG 
AAGGCCAGAT 
TTCAGCGATC 
AAGCGTCGTC 
GCGAACTATA 
AACGACCTCC 
TTGGGGGGCG 
CAGGCCAATT 
ATTGAGTTCG 
GTGGACTTGT 
CATGGAGAGG 
GATCG ACT GG 
GAACAGGT GT 
GCCAAAATGT 
CAAATGCTTC 
AAACCGTATC 
GTAAACAGTT 
AATATCGGTT 
GAAGCTCGCA 
TGGCGTATTC 
CGTTTGATCG 
GACGTATATC 
ATGCTGGAAA 
GCCGGATGGT 
CATATGCCCA 
TCGGCAGGGG 
ACGCGTGTCG 



TACTTCCAAT 
AACGAGCTAT 
TACGTGTCGA 
TCTCCGATCC 
TGAGCCGGAT 
TGTATTTCTA 
AGCATACGGT 
CACCTTTGCA 
ACTACGATGA 
ATCTGTATGG 
GCAACACCCG 
CGG CCGGTGG 
CTTTTTCCAT 
TCTATCACAG 
GCATGACAGG 
GGTACGATGG 
AGTCCTGCCG 
CTTCCCTGAC 
TGGCTCCGCC 
CACAGGCTTC 
AAATCCCTGA 
CCACCTATCG 
TCAACGAGTT 
TCTACGACAG 
TCTTCGGTGA 
TCCAACAAAC 
AT GT G AC GG A 
GGCGCAATTA 
TCGCAGTGGA 
GTGCCTGTTT 
ATACCGTCAA 
CGCATGTCAT 
CCCTTCAGTC 
CGGACGAGCA 
TTTGGATTAC 
AGGAGGTTTT 
T IT ACAAT AC 



AGTCGCATTC 
GGGGAAGACG 
TGCAAGTGGA 
GTCCAAGGTA 
CACGACAGAC 
TGCCGTGGGC 
GAATACATAC 
GATGTCCCAA 
GCTGATGCTC 
CGAGTCTTTC 
CTCGTCTGGC 
CGGCCGTGAG 
GACATCGAAT 
TACGCCGATG 
AGATGCGGTC 
CGCACCCATG 
GTTCGTTATC 
AGCATCGCTT 
GAAGGGTCAG 
TGCTCCGGAG 
TCTGATCATT 
TAGAGAGAAA 
TTCGGGTGGA 
ATGGAAGGCA 
TGGGGCTCA'f 
GGAGTTCTTG 
TGATTACTTC 
TAATATGGCT 
CAAGACCATC 
TGCGGCAGAC 
GCGTTATGCT 
CGAGAACGGG 
GGGTATTATC 
TTTGCTGACG 
TGCCACGTGC 
CCTCCATGAG 
GCAGAATGAG 



CTTTCTCTCT 
GCCGACCGTT 
GTGTATCGCC 
GGTGTGTTCG 
GATTTGCCTC 
CCGGTGACAT 
AGTACGCATG 
TATACGGGTG 
CATGAACAGG 
AGTGCAGTCA 
GAACTCGGTA 
ATGTCGCTCT 
GAAGTCTCCA 
AACAGCTTGG 
AATCTGGATT 
CATATCAGGC 
AGTGAGGTGC 
GTTCCCGTTA 
GATCGTAGGA 
ATCCTCGGAG 
GTCTCTACTC 
AACGGGCTGA 
ACTCCCGATG 
AATGCACCTG 
GACAACAGGA 
CTGACATTCC 
GGCTTGCTCG 
GTAGGGCGAT 
CGATATGAGG 
AACGGGGACA 
CCTGCCATCA 
TTGCACAGCA 
CTGCTTAATT 
CTCAACGATA 
GACTTTGCCA 
AAGAGTGGCA 
AAGATCAATG 



TCCTTGCCCT 
CGCTAATGGC 
TTACGGACGA 
GTTATGGTGG 
CGGTACCGGT 
GGTTCTACAA 
GCTACTACTT 
GAGGTGCGTC 
AATTGTATTC 
ATACGCGTAC 
CCGTATTCTC 
CGGCGAATGG 
ATTCCTATTT 
TCAATGAGTT 
TCATAGAGGT 
GGTTTTCCAA 
CGGAGTCTCT 
AGACTGTCGG 
CTATCAATAC 
CGGTACCCAA 
AGGCGCTCCT 
AGGTTTTGGT 
CTACAGCATA 
TGGGAGAGAC 
AGGTCTCCGT 
AAGCCGTCAA 
ATGATCAGCC 
TCCCCGTACG 
AGGATCGAGA 
AGCACGCAAC 
TGCCGGTACG 
TTCCGGGTGC 
ATGCTGGTCA 
TACACAAATT 
ACTATGACAG 
CTCCGATCAT 
GTTTTATGCT 



TGCTTTGCCT 
TTCGGGACAT 
ACAGCTCCGT 
AGGGGTGCTT 
ACTCCGTCAG 
TCCGGCCAAA 
CCTGTCGGAT 
GGCCGAGGCT 
GCCCAAAGAA 
GGTCAAGTTC 
ATACATAGCC 
CATTCTGATC 
GGCCGGCAAG 
GCGCTTGGAC 
GGCTACACAG 
TTTGCCCGTT 
GGTGGTTTTG 
GGATAAGACC 
GTTTTATGCC 
TCAAAACCTG 
CCTTGAGGCT 
CGTGTT GCAG 
CCGCCTCTTT 
CTTCCCGATG 
AGCTTGGCAG 
TTCGACGAAC 
GGCCTCGGTC 
TACTCCGGCC 
GAGTGGTGCC 
CGAGACTTCC 
CGCCTTTCAG 
AAAGAAAAAG 
TGGCGGTCCT 
CAATTATAAG 
TCAGACGACC 
GTTCTCGACT 
TCGGCGTATG 



1620 
1660 
1740 
1800 
1860 
1920 
I960 
2040 
2100 
2160 
2220 
2280 
2328 



60 
120 
180 
240 
300 
360 
420 
4 80 
540 
600 
660 
720 
780 
840 
90O 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
18C0 
1860 
1920 
1980 
204 0 
2100 
2160 
2220 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



178/490 



10 



15 



20 



TTCGAGAAAG 
CAGGGGATGC 
GGTGATCCGT 
GGGCAGGATC 
CTGAAGGGTA 
TTCCTGACCG 
CTCTCTCTTG 
GACGGACTCT 
GGCCGGATCA 
TTCTCCATCA 
GAAATCATAA 
ACTCCTCTGT 
GTAGGGCATG 
GCATATTTCA 
GCTTTGGCCG 
GTCCATCATG 
ATTCTATTCC 
GGAAGCGATT 
TTGCCAGTCA 
ACCTCCAAAT 
TCTCCCGGAG 



CTAAGGATGG 
TCAGTACTGT 
CCGTGCGTAT 
CCGAAGGGCA 
AGGTAACCGA 
TCTTCGATGG 
TATATTATGA 
TCGAAACTTC 
ATCTTTATGC 
GAGTCCAACC 
GCTGCTTCCT 
TTATGGCCGA 
ATATTACGCT 
CAAGTTCGGC 
AAGGAGATCA 
ACTTTTCATT 
CGAATCCGGT 
TGAACGTGGC 
AGACCTATTC 
ACGGAGTGAA 
GACAGACGGC 



GCGTTATCGT 
TTTCCCCGAT 
GAATCTTCCT 
GTATGGAACT 
TGAAAAGGGG 
CAGAAAGAAA 
CTATCCTAAC 
GTTTATCGTA 
TTATAACGAG 
GGGTATTCCT 
CAATGACAGT 
AGTATTCGAC 
TTGTATCGAT 
TACGGATGCA 
TACTGCCCGA 
CAGAGTGGTA 
ACGCGAGAGT 
CCTGGAGATC 
GTCTTCCTAC 
GATCGGAAAC 
CTCCATGGCC 



ACGATGGGCG 
TCGATCAACC 
ACCCACAAAG 
ATTATGCTCA 
ACATTCGACG 
ATGACAGCTT 
GTGATGTATG 
CCCAAGGATG 
AGCACAAAGG 
GATGAGGTAA 
ACATTCCGAT 
TTGAATGGAA 
GGCCGTGCCG 
GGTGTGGGCA 
CTGACGGTTT 
GATGGCATTG 
GCTACGTTCC 
TATGACTTGA 
GGAGAACCl'A 
GGATTCTACC 
AAGAAAATGA 



AGATTATCCG 
AGTTGAGTTT 
TGCAATTGAC 
AGTCTTTGGA 
AGACATTCAG 
TGGAAGAGGA 
CCGGTATTGC 
TGAACTATTC 
CGGAAGCCAT 
CGGAAGATAA 
CGGGAGATGA 
TCAATATCAC 
ACCTGACCTA 
CTATTCTCTT 
GGGACATTTT 
CTCCGGATGT 
GAATCTTCCA 
CCGGTCGTCT 
TAGAGATCAA 
TCTATCGTTG 
TCGTGGTAGG 



ATCGGCCAAA 
CTTTCTGATG 
CGCAATCAAC 
ACGGGTAGCT 
TGGCAAGGTT 
GGGAAACGAT 
CGAGGTGAAA 
CGAGCACGAA 
GGGGGTAGAC 
TACACCGCCT 
GGTTAATCCT 
GGGTAGCGGA 
CAACCTCAAT 
CATGATACCG 
CAATAATGCC 
GGCTGATGTG 
CAATCGCCCC 
TGTGAACA6T 
GTGGGATCTG 
TGTGGTGAAC 
ACAA 



228D 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2620 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
34 7 4 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID HO: 2 60 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 883 base pairs 
(8 } TYPE: nucleic acid 
<C> STRAHDEDWESS: double 
(D) TOPOLOGY; circular 

(il> MOLECULE TYPE: DNA (genomic) 

Uii) HYPOTHETICAL: HO 

(lv) ANTI-SENSE; NO 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

tlx) FEATURE: 

(A) NAME/KEY: misc_feature 
IB) LOCATION 1...2883 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260 



ATGAAGAAAC 

AAAGAAAAAC 

TACGAAAAAG 

AAGAACCTTG 

TCCCAATACA 

AGGGACACCA 

GGTGCTTTGG 

AGCGGAAACC 

AACATCAATC 

CAAGGTGAGA 

CGTATCTATC 

GAGCAGAACA 

GAAGTAGACG 

AAA CT GAT GG 

CGTATCAATC 

GGGACTGCGA 

GAGAATCTCG 

ACGTTGAGCG 

AACTACGAAC 

AAACAAGAGA 

CTTTTGATTT 

AAAGAGACGG 

CCTATCACCG 

TCGGACAGAA 

GTGAACCAGC 

AATGGCGACA 

GCTCAAGACA 

CCCTCCTACT 

GCCGGAGGGA 



TTTTCCCATT 
ACTCTGTAAC 
AACTCCCCAC 
TGGGACAAAT 
CAGATGCGAT 
TCGTAGCCGC 
CAGAAGCTTC 
AGAATAGGCC 
TTGAGTTAGG 
AAGAACTCGA 
GACAACGCAA 
ATATGGCAGA 
AAGAAAAAGG 
AACAGTTGTC 
TCAAACAAGG 
AAGAAATTAA 
AGCGTAAAAA 
ACAGCATGTC 
GAAACAAGTC 
AATCGAAGTT 
CTGTTCTGAC 
ATAAACTTCG 
TCATACAAGG 
CCGAGCTGCA 
TGTTGGATAT 
TCGTCTCCTT 
TAACCTTGGA 
TGCAAAAAAT 
GAGTGGTCAT 



ATTATTACTC 

TGAAATCGCC 

TGATTCTGTT 

GTTATTCGCG 

GCTATATCAC 

ACAAGCATGG 

GGATTATCAT 

TGCGATCAAA 

ATACCATGAT 

CAGTCCTCTC 

AGAATACGAC 

GAATCTGATG 

GGATTATCAA 

CGATCGATGG 

TAACGAAAGG 

TTCGACTTCA 

AGAATACAAA 

CATTGCGCAC 

CCAAAAAGAG 

TATCCTCTTG 

AT AT GCATAC 

CTCCGGCTTT 

TTTGAATGAG 

CAAGATAATA 

TTGCAAGATC 

CGTACAGATT 

GCTACAACCC 

CATATCCAAT 

ATCTCTGGCA 



ATATTGTCGA 
CGAGAGAAAA 
AAACAGCTTT 
ATCGAGGTCG 
CAAAAAGGGT 
AATCATCTCG 
TACAAGGCTC 
GCCAGATCGG 
GAGGCCGAAA 
GGGCAAGCTA 
AAGGCTCGTA 
GGTATCGGAC 
AAGGCTTTGC 
CACTGGCTGA 
CTCTACCAGC 
CAT CT GAT AG 
CAAGCCCTCG 
AAGGTCAGCA 
CTTGAAGAAA 
AGCACTCTTT 
CGTCAGGGCA 
TTCACCGGTA 
AAAATGAGTT 
GACAGACACA 
AGAAGCGGAG 
CTCATCGATT 
GAGAGCAAAC 
CTTTTGT CCA 
AAAACCAAGA 



TTTTGGTCGG 
AGCGTATTAC 
ACGAGAACAG 
GCAAACGGAT 
TGAACGCTGC 
GAACGGATAG 
TTTCCTTGAT 
CGGCCCTGAA 
AGAATTTCCT 
TCAACTATGC 

ccTAcrrccr 

TCTGTAGCAT 
AAGAGTATGC 
ATTCCTGTAT 
ATTTCATTTC 
AAATATACAA 
AAGCATTCTG 
GCATACAAGA 
TACAGCAAGT 
TTGCCCTTTT 
AGAAGCATAA 
TTACACACGA 
CAAGTCCTGA 
GTAGCCATAT 
TATCCACGCC 
CGTTTGCACC 
CTATTGTCGT 
ATGCCAT CAA 
ATGAAAAAAA 



ATGTGGAAAA 
TGCATTGCTG 
CGTACAGAAC 
CCCTAATATG 
ATTGAATCTA 
CCGTCGTATC 
AGAATCTTTT 
CGGCATCGGC 
GAAAGCACTG 
GAACCTCGGA 
CTTGTCTCTG 
CAATCTCGGA 
CACGGCATAC 
CCCGATGGCA 
TTTGGCCGAA 
TCTTCAATAC 
TCTGAGCAAG 
AACGCGATTC 
AAGCAAGGCA 
CATCTCGATT 
CAAGCTGATC 
ATTTC'jTACG 
TCTCCAAGCA 
GCTGAATTTG 
CGAATGGCGC 
ATACGCACAG 
GGACTTCGTC 
GTATTCTTTA 
TCTGATCATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
1 900 
960 
1020 
108O 
1140 
1200 
1260 
1320 
1380 
1440 
IbOO 
1560 
1620 
1680 
1740 
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CGCGTTGCAG ACAATGGCAT AGGAATAdAT 
TTCTATCGAG GACAGTCCGC TACCGAAAAG 
AATATACTGG TCGAAAACCT TCGAGGTACG 
AGTGCCTTCA CCATCAGTAT TCCTACACAA 
5 TGGCTACCCT CCTCCGATGA CATTGTCATG 
ACATCTCCGA TGGTAGCAGC TCTGAATCAT 
CTCGTCGAGG ACAATAAGGA TATCAACCTG 
AATGTCCTAT CCGCCGCAAA CGGAAAAGAG 
GACATTATCA TTACGGATAT TATGATGCCG 
10 ATGAAGCAAT CGCCTCTGCT CTGTCACATT 
GAACAGGACA GATTGGAAGG AATCAAAAGC 
TCTCCGGAGG AGCTTTTGAT GCGGATCGAG 
AAGTTCTATA TGCAAAAACT CATGCTGGAT 
GATGACAGCA GTATGCAGTT TCTCCTTGCT 
15 CAAAATCCGG ATTTTTCCGC TCAAGACTTG 
CTCAACAGAA AGCTCACGAG TGTCGTAGGT 
AAGATAAAAT TGGCCTGCAA GCTCCTTGCC 
ATTGAGGCAG GCTTTTCGGA TCCGGCTTAC 
TGCTCTCCCT CCCAATATCG GCAAAAACTC 
20 GTT 



AAAACTGATC AGGCTCATAT CTTCGACATC IflOO 

CATGGATCAG GCGTCGGACT CTCGTTTACC I860 

ATCAAAGTGG AAAGCCAGCC GGGGAAAGGA 1920 

AACCAGTCCT CTTCGGCAGA GATTCTTCCT 1 980 

CCTGTCCACA TCGCGCCCGA TGACTCACCG 204 0 

CGCTTCGAGG ACGAACGTCC GACCATACTG 2100 

CTCGTCAAAC TACTCCTTTG CGATCGCTAC 2160 

GGTATAGCCC TCGCTACCGA GCATATTCCC 2220 

ATAATGGATG GGATAGAAAT GACAATCCGG 2280 

CCCATTGTCG CTTTGACGGC CAAGAGTACC 2340 

GGTGTAGTCT CTTATCTATG CAAGCCATTC 2400 

CAGCTTCTGA AAGACGGTGA GTTGCTCAAG 24 60 

CGGAAGCCGG AGGAGGAGCC TCAACCGATA 2520 

GCCAAAGATG CAGTGTCCGG TGGAATCAAA 2580 

GCCGAAAAAA TGTGCATGAG TCCATCCCAA 2640 

TGCTCCACCA TCGGCTACAT ACAGCAGATC 2700 

GATGAGAGCA AAAACATCTC CGACATTAfiC 2760 

TTCTCTCGCA CCTTCAAACG CTACATGAAC 2820 

CTTGCCATGC CGGGGAGCGA CAAGGAGACA 2880 

2003 



t2J INFORMATION FOR SEQ ID WO: 2 61 

25 ( 1 ) SEQUENCE CHARACTERISTICS : 

(A) LF.NGTH : 1668 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDHESS : double 

(D) TOPOLOGY: circular 



30 



35 



ill) HOLECULE TYPE: DMA (genomic) 
iiil) HYPOTHETICAL: NO 
liv) AWTI —SENSE : NO 



(vi) ORIGINAL SOURCE: 

<A) ORGANISM: PORYPHYROMOHAS GINGIVALIS 

40 (i X ) FEATURE: 

(A) NAME/ KEY : mioc_f eature 
(B> LOCATION 1...1668 

^ txi) SEQUENCE DESCRIPTION: SEQ ID NO: 261 

ATGAAAAAGC TTTTACAGGC TAAAGCCTTG AT TCTGGCAT TGGGACTCTT CCAACTGCCC 60 

GCAATCGCCC AAACGCAAAT GCAAGCAGAC CGAACAAACG GTCAATTTGC AACAGAAGAG 120 

ATGCAACGAG CATTC CAG G A AACGAATCCC CCTGCAGGTC CTGTGCGTGC TATCGCTGAG 180 

TACGAACGCT CTGCAGCCGT TTTGGTACGC TACCCGTTCG GTATCCCGAT GGAATTGATC 24 0 

OU AAAGAGCTGG CCAAGAACGA CAAGGTGATT ACCATTGTGG CGAGTGAAAG CCAAAAAAAC 300 

ACCGTTATAA CCCAGTACAC CCAAAGCGGT GTGAATCTCT CTAATTGCGA TTTCATCATT 360 

GCGAAAACTG ACTCTTACTG GACACGCGAC TATACCGGTT GGTTCGCAAT GTACGATACG 4 20 

AACAAAGTAG GTCTCGTGGA CTTTATTTAT AACCGCCCTC GTCCTAACGA TGATGAATTC 480 

CCCAAATACG AAG CACAAT A TCTGGGCAT C GAGATGTTCG GGATGAAGCT CAAGCAGACC 54 0 

OO GGTGGCAACT ACATGACGGA CGGATATGGA TCCGCTGTGC AGTCACATAT CGCATATACG 600 

GAGAACTCCT CTCTGTCTCA AGCTCAAGTA AATCAAAAGA TGAAAGACTA TCTCGGCATC 660 

ACACAT CATG ATGTGGTACA AGATCCGAAC GGCGAATATA T CAACCATGT GGACTGTTGG 720 

GGCAAGTATT TGGCACCGAA CAAAATCCTC ATCAGGAAAG TGCCTGACAA TCACCCTCAG 780 

CACCAAGCCC TGGAAGATAT GGCAGCCTAC TTCGCAGCAC AGACCTGCGC ATGGGGAACG 840 

AAGTACGAGG TATATCGCGC TTTGGCCACC AATGAACAAC CGTACACGAA CTCTCTGATT 900 

CTGAACAACA GGGTATTTGT TCCTGTCAAT GGCCCCGCCT CCGT GGACAA CGATGCTCl'G 960 

AACGTCTATA AGACGGCAAT GCCCGGTTAC GAAATTATAG GTGTCAAAGG GGCTTCAGGA 1020 

™^ TTCGT TA <^AACAGA TGCCCTGCAT TGTCGTACTC ACGAGGTAGC GGATAAGGGC 1080 

TATCTCTATA TCAAGCACTA CCCGATACTG GGCGAACAGG CAGGCCCTGA TTATAAGATC 1140 

CAAGCAGATG TCGTCTCATG CGCCAATGCT ACTATCTCGC CGGTACAATG TTACTATCGT 1200 

ATCAATGGTT CCGGTAGCTT TAAGGCTGCT GATATGACGA TGGAATCAAC AGGTCACTAT 1260 

ACTTATACCT TTACAGGTCT TAACAAGAAT GATAAGGTAG AATACTATAT CTCTGCCGCT 1320 

GACAATAGTG GTCGCAAAGA GACTTATCCC TTTATCGGCG AACCTGATCC TTTCAAGTTT 1380 

ACGTGTATGA ACGAAACCAA T ACAT GT ACT GTGACCGGAG CTGCCAAAGC TCTTCGTGC? 1440 

I^ TTCAACG CCGGTCGTTC AGAACTGGCT GTTTCGGTAA GTTTGAATAT TGCCGGCACA 1500 

^T^™** AGCTTTATAA CACCGCAGGA GAAGAAGTCG CTGCAATGAC CAAGGAATTA 1560 

GTAGCAGGGA CGAGTGTCTT CAGTATGGAT GTGTATTCTC AGGCTCCGGG CACATATGTT 1620 

CTGGTTGTTG AAGGAAATGG AATCCGTGAG ACAATGAAAA TTCTCAAA 1668 

75 



60 
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(2) INFORMATION FOR SEQ 10 HO: 262 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12H4 base pairs ? 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

1Q <ii> MOLECULE TYPE: DMA 1 genomic] j : 

{lil) HYPOTHETICAL: HO 

(iv) ANTI -SENSE: NO 

15 {vi) ORIGINAL SOURCE : 

(A) ORGANISM: PORY PHYROMONAS GINGIVAL IS 



20 



fix) FEATURE: 

<A) UAI IE/KEY: misc feature 
(B) LOCATION 1...1284 

(xi) SEQUENCE DE SCRIPT I Oil: SEQ ID HO: 262 



Orz 5I? A ^ A 5 TTT ^TCl'AAGAA AATCTTAGCA ATCATTCCAT TGCTGACGAT GGGACATGT fin 

fl^^ 0 AGTTTG,rTCC GGCTCCCACC ACAGGGATTC GCATGTCTCT C^SaaIS 1^0 

AAGGCCGTAG GCGAAAAAAT CGAATTGTTG GTTCATTCCA TAGAGAAGAA ACGTA^CTGG \»n 

SSSTT* AT GGGGATGC CACTTACCAA CAAGGAGAGG AAATAACCGT ATTCGATGAG 240 

^I^S G ^TACACGAT CGGGACGCAA ACCCTCACTA TCTATGGTAA TAcS^CCGA 300 

30 ™^^ GTC GATCTACCGG TGCAACGGCT GTCGATGTAA CGAAAAACCC TAATcJ^CC 360 
30 JATCTCGCAT GCCCGAAAAA TAATCTGAAA TCATTG GACT TGACGCAAAA CCCAAAG^TG 

S^^ TT ^^^C TAACGAAATA GAAAGTTTGG ACCTGAGTGG cSt^CT 4 B0 

T3^ CATCC TCGG( ^«'^A CAGGAATAAG CTGACTGAGC TGAAGACCGA TAACAAC^-C s!n 

~™™ CCT CTCTTTGGTG TTCTGATAAT AACCTGACGG AGTTGGAAC? SSE^SSJ 6Q0 

CCTCGTCTCA ATGATCTTTG GT GCTTCGGT AATCGGATCA CGAAACTCGA TrrrAfTTrr «2 

™^ TGGTAACACT TTGGTGCAGT GACAATGAGC T^ACCT^ GGA^^CC "S 

™^™2 GG AC ^ rG CTTA CCTTTGGTGT TCAT CGAACA AACTTACATC CTTGAATCTG 780 
" !5r^r G I GA A SS ACTGAG TGT TTTGGTT TGTCATTCCA AT CAGATCGC AGGTGAAGAA 

' A I^ A S GAAAG TGGTGAATGC TTTGCCCACA CTATCTCCCG GCGCAGGCGC TCAGAGCAAG 900 

40 - ^ AGACCTCAA GG ACACTGAT GAGAAGAATA TCTGTACCGT AAAGGATGTG 960 

2 AAAAAGCTA AAAGTAAGAA CTGGCGAGTA TTTGACTTCA ACGGTGATTC TGACAATATG 1020 

CTTCCATACG AAGGAAGTCC GACATCGAAC TTGGCAGTAG ATGCTCCCAC TStSSS"a lolo 

GGGTAGGAA ^ ATATGCGCTC GTCGAGATCC CCGAGTCTCT tt£S£ 

5??^^" TATACGATAT GAATGGGGTA AAAGTCTATA GTTTCGCGGT AGAGTCTCTT 12o2 

45 SSSSSS ^ CTTCCC ATTTCTTCCG TCTCGA^C ^60 



1284 



(?) INFORMATION FOR SEQ ID MO: 263 

5 ^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 base pairs 
IB J TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: circular 



60 



(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRQf IONAS GINGIVALTS 
65 ( ix ) FEATURE : 

(A) HAHE/KEY: mlsc_f eaturc 
<B) LOCATION 1. . .930 

7 q < Xi > SEQUENCE DESCRI PTION : SCQ ID NO: 263 

rinarnrr^ CAATAATTTT CTGCTTGTTG CTCGCCCTAT TTGCCTGTTC TTGGGCACAA 
larrTrnBA ATGAAAAAGT ATTCTCCGCA GGAACAAGTA TTTTTAGGGG CATCCTTGAA 
AAGGTGAAAG CACCGCTTAT GTATGGAGAT CGTGAGGTAT GGGGTATGG^ ^rTrrrZZ 
75 Sr^ST^ TTTTT ^ACT TCCCGTTACG GATgS CTCCCGTGCT? Itt^I^C 2*0 
75 CGTCTTACAA ACGAACCCTG CTTTGTGTCA GACCAAGGAA TAACTGAGTA TTT^Era 300 



60 
120 
180 
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GCTCAAGAAG GTGATTACAT TGAAGTCGAA GGAAGCTCTG TATTCATGGC GAATCTTTTC 360 

TACTATCGTT TTTTCCCGAC AAGAATTACC TCCTATAATG CTCCCATTGA AGGTGTTGTG 420 

AGCAAGACGG GAAATCCTGC TTTTACAATC CCGATGCTCC CGGGGGTTTC TGATTGCATA 4 80 

GAAATCTCAA ACAACCGCAA AGTCTTTCTG ACCAATCAAT TAGCCGTTGT AAACATCACT S4 0 

5 GACGGGATGG AACCTCCGAT TATTGCCGGA GTCTCTGCTT CCTATGGATC TTCCGTCCGG 600 

GTGTATGGTC ATGTCTCACA GCGGTGGGAC ATCATAGGCC ATTGCTATTT GGATATCTAC €60 

CCAACCAATT GCTATCCGCT CAGCACGAAA CCCGTTGCAG GAGACGATGA GGTTTTTGTC 720 

AAACAACAAG GCAGGCAAAT AGAGATCGAT AGCAACAGCC CCATAGTCCA AGTGGTCGTA 780 

TACGATCTTG AGGGGAAAAG TGTTTTTCGC AAAAGAATGA CCGAAAACGC TTATACCCTA 64 0 

10 TCCTTTAGAG CACCCATGCT CGGCTTTATG ACCATCATGA TCGAAACACA AAATTCGATT 900 

ATCAATAAAA AACTTAATGT TACACAGCTA 930 



15 



25 



30 



70 



(2) INFORMATION FOR SEQ ID NO: 264 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1215 base pairs 
(B» TYPE: nucleic acid 
(C) STRANDEDUESS: double 
20 (Dl TOPOLOGY: circular 



(ii) MOLECULE TYPE: DJ1A (genomic) 

(lii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Vl) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



(iX) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1...1215 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264 

ATGAAAAAAA CAACCATTAT TTCTTTGATT GTCTTCGGTG CTTTCTTTGC AGCCGTGGGC 60 

CAAACCAAGG ACAATTCTTC TTACAAACCT TTTTCGAAAG AAGATATTGC CGGAGGAGTT 120 

TACTCTCTCC CGACTCAAAA TCGTGCGCAG AAGGACAATG CCGAGTGGCT TCTTACAGCG 180 

40 ACCGTCTCCA CAAACCAGTC TGCAGATACT CACTTTATCT TCGATGAGAA CAACCGCTAT 24 0 

ATCGCTCGTG AC AT AAAAG C CAATGGGGTA AGAAAATCCA CGGACTCCAT TTACTACGAT 300 

GCCAACGGGC GAATATCGCA TGTGGATCTT TATATCTCGT TCAGTGGCGG AGAGCCTGCA 360 

CTCGACACCC GATTCAAGTA CACCTATGAT GACGAGGGAA AGATGACCGT GAGGGAAGTA 420 

TTCATGCTGG TAAT GGATCC GAATACACCT ATCTCACGCT TGSAATATCA TTATGATGCA 480 

45 CAGGGCAGAC TGACCCACTG GATTTCTTTT GCTTTCGGGG CAGAATCCCA AAAGAATACG 54 0 

TATCACTATA ATGAAAAAGG TCTGTTGGTC AGCGAAGTGC TGAGCAATGC AATGGGGACA 600 

ACCTATTCAG ACACCGGCAA AACGGAATAC AGCTATGACG ATGCAGATAA TATGGTGAAG 660 

GCCGAGTACT TCGTCGTCCA GCAAGGAAAG GCATGGCAAG TACT C AAAAG AGAGGAATAC 720 

ACCTATGAGG ACAATATCTG CATACAATAT TTGGCTATTA ACGGTACCGA CACAAAGGTG 700 

50 TACAAGCGAG ACATCGAGAG CGATAAGTCC ATCTCCGCAA AT GTCATTGA CATTCCGTCA 64 0 

ATGCCGGAAC AGACCTGGCC TAATATGTAC GGATTCAACG CAAAGCGACT GAAAGAGACT 900 

TATTCCTCCT ACGAAGGAGA TGTGGCTACT CCTATATTCG ACT AT AT CT A TACGTACAAG 960 

GCTCTTACCT CAATGCCAAC ACCTTCG AC A GAAGCTCAGG T AGCAGT CT A TCTCAATCCG 1020 

TCAACGGACC GGTTAGTGAT TCTGGCCAAC GGCATCACAC ATCTGAGCAT GTACGACTTG J 080 

55 CAGGGTAAGC TTATCCGTGA TTGTGCCTTG AGCGGCGATA AGGTGGAAAT GGGTGTCGGA 114 0 

TCTTTGACCA AAGGGACATA CCTGCTTAAA GTGAATACGG ATCAGGGAGC CTTTGTGAGA 1200 

AAAGTCGTGA TTCGA 1215 

60 (2) INFORMATION FOR SEQ ID NO: 265 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 454 amino acids 
(D) TYPE: amino acid 
65 (D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi> ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 



(ix) FEATURE: 
75 (A) NAME/ KEY: miscj' eature 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(B) LOCATION 1. . .454 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 265 

Phe Cys Val Met Ala Lys Val He Lys Thr Lys Lys Gly Leu Ala Leu 

1 * 10 15 

Arn Leu Lys Cly Lys Pro L&u Pro Glu Met Leu Ala Glu Pro Ala Gin 

20 25 30 

Ser Pro Thr Tyr Ala Val Val Pro Asp Asp Phe Glu Gly Val He Pro 

35 40 45 

Lys Val Thr Ala Arg Pro Gly Asp Lys Val Arg Ala Gly Ser Ala Leu 

50 55 60 

Met His His Lys Ala Tyr Pro Glu Met Lys Phe Thr Ser Pro Val Ser 
65 70 75 80 

Gly Glu Val He Ala Val Asn Arg Gly Ala Lys Arg Lys Val Leu Ser 

B5 90 95 

He Glu Val Lys Pro Asp Gly Leu Asn Glu Tyr Glu Scr Phe Pro Val 

100 iob no 

Gly Asp Pro Ser Ala Leu Ser Ala Glu Gin He Lys Glu Leu Leu Leu 

US 120 125 

Ser Ser Gly Met Trp Gly Phe He Lyt' Gin Arg Pro Tyr Asp He Val 

130 135 140 

Ala Thr Pro Asp He Ala Pro Arg Asp He Tyr He Thr Ala Asn Phe 
14S 150 155 160 

Thr Ala Pro Leu Ala Pro Asp Phe Asp Phe He Val Arg Gly Glu Glu 

165 170 175 

Arg Ala Leu Gin Thr Ala He Asp Ala Leu Ala Lys Leu Thr Thr Gly 

180 185 190 

Lys Val Tyr Val Gly Leu Lys Pro Gly Ser Ser Leu Gly Leu His Asn 

195 200 205 

Ala Glu He Val Glu Val His Gly Pro His Pro Ala Gly Asn Val Gly 

210 215 220 

Val Leu He Asn His Thr Lys Pro He Asn Arg Gly Glu Thr Val Trp 
225 230 235 240 

Thr Leu Lys Ala Thr Asp Leu He Val He Gly Arg Phe Leu Leu Thr 

245 250 255 

Gly Lys Ala Asp Phe Thr Arg Met He Ala Met Thr Gly Ser Asp Ala 

260 265 270 

Ala Ala His Gly Tyr Val Arg He Met Pro Gly Cys Asn Val Phe Ala 

275 280 285 

Ser Phe Pro Gly Arg Leu Thr He Lys Glu Ser His Glu Arg Val He 

290 295 300 

Asp Gly Asn Val Leu Thr Gly Lys Lys Leu Cys Glu Lys Glu Pro Phe 
305 310 315 320 

Leu Ser Ala Arg Cys Asp Gin He Thr Val He Pro Glu Gly Asp Asp 

325 330 335 

Val Asp Glu Leu Phe Gly Trp Ala Ala Pro Arg Leu Asp Gin Tyr Ser 

340 345 350 

Met Ser Arg Ala Tyr Phe Ser Trp Leu Gin Gly Lys Asn Lys Glu Tyr 

355 360 365 

Val Leu Asp Ala Arg He Lys Gly Gly Glu Arg Ala Met He Met Ser 

370 375 360 

Asn Glu Tyr Asp Arg Val Phe Pro Met Asp He Tyr Pro elu Tyr Leu 
385 390 395 400 

Leu Lys Ala He He Ala Phe Asp He Asp Lys Met Glu Asp Leu Gly 

405 410 415 

He Tyr Glu Val Ala Pro Glu Asp Phe Ala Thr Cys Glu Phe Val Asp 

420 425 430 

Thr Ser Lys He Glu Leu Gin Arg He Val Arg Glu Gly Leu Asp Met 

435 440 445 

Leu Tyr Lys Glu Met Asn 
450 



(2) INFORMATION FOR SEQ ID NO: 266 

(1J SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 amino acids 
(BJ TYPE: amino acid 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 
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10 



15 



20 



25 



30 



35 



55 



60 



65 



70 



75 



(A) ORGANISM: Porphyromonas gingival!* 

(ix) FEATURE : 

(A) NAME/KEY: roiscjeatuie 

(B) LOCATION 1 201 

(x±) SEQUENCE DESCRIPTION: SEQ ID NO: 266 



Glu 


Leu 


Ser 


Lys 


Cyn 


Tyr 


Met 


Asp 


Lys 


Val 


Ser 


Tyr 


Ala 


Leu 


Gly 


Leu 


1 








5 










10 










15 




Ser 


He 


Gly 


Asn 


Asn 


Phe 


Lys 


Ser 


Ser 


Gly 


He 


Asp 


Ser 


Val 


Val 


Met 








20 










25 










30 






Asp Asp 


Ph» 


Hot 


Gin 


Gly 


Leu 


Ser 


Asp 


Val 


Leu 


Glu 


Glu 


Lys 


Ala 


Pro 






35 










40 










4 5 






Gin 


Leu 


Ser 


Tyr 


Asp 


Glu 


Ala 


Lys 


Axg 


Glu 


He 


Glu 


Ala 


Tyr 


Phe 


Met 




50 










55 










60 








Asp 


Leu 


Gin 


Gin 


Lys 


Ala 


Val 


Lys 


Leu 


Asn 


Lys 


Glu 


Ala 


Gly Glu 


Glu 


65 










70 










75 










80 


Phe 


Leu 


Lys 


He 


Asn 


Ala 


His 


Lys 


Glu 


Gly 


Val 


Thr 


Thr 


Leu 


Pro 


Ser 










B5 










90 










95 




Gly 


If*iU 


Gin 


Tyr 


Clu 


Val 


He 


Lys 


Met 


Gly 


Glu 


Gly 


Pro 


Lys 


Pro 


Thr 








100 










105 










110 






Leu 


Ser 


Asp 


Thr 


Val 


Thr 


Cys 


His 


Tyr 


His 


Gly Thr Leu 


He 


Asn 


Gly 






115 










120 










125 






He 


Val 


Phe 


Asp 


Ser 


Ser 


Met 


Asp 


Arg 


Gly 


Glu 


Pro 


Ala 


Ser 


Phe 


Pro 




130 










135 










140 










Leu 


Arg 


Gly 


Val 


He 


Ala 


Gly 


Trp 


Thr 


Glu 


He 


Leu 


Gin 


Leu 


Met 


Pro 


145 










150 










155 










160 


Vat 


Gly 


Ser 


Lys 


Trp 


Lys 


Val 


Thr 


He 


Pro 


Ser 


Asp 


Leu 


Ala 


Tyr Gly 










165 










170 










175 




Asp Arg 


Gly 


Ala 


Gly 


Glu 


His 


He 


Lys 


Pro 


Gly 


Ser 


Thr 


Leu 


He 


Phe 








180 










185 










190 






Tie 


He 


Glu 


Leu 


Leu 


Ser 


He 


Asn 


Lys 




















195 










200 

















{2) INFORMATION FOR SEQ ID HO: 267 



(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 amino acids 
40 (B) TYPE: amino acid 

ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 (iii) HYPOTHETICAL: YES 

IviJ ORIGINAL SOURCE: 

<A) ORGANISM: Porphyromonas gingival!*? 

50 (ix) FEATURE: 

(A) NAME/ KEY: miscf eature 

(B] LOCATION 1...279 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 267 



Gin 


Lys 


Asn Lys 


Arg 


Lys 


Met 


Lys 


Lys 


Ala 


Leu 


Leu 


He Gly Ala Ala 


1 






5 










10 






15 


Leu 


Leu 


Gly Ala 


Val 


Ser 


Phe 


Ala 


Ser 


Ala 


Gin 


Ser 


Leu Ser Thr He 






20 










25 








30 


Lys 


Val 


Gin Asn 


Asn 


Ser 


Vol 


Gin 


Gin 


Pro 


Arg 


Glu 


Glu Ala Thr lie 






35 








40 








45 


Gin 


Val 


Cys Gly 


Glu 


Leu 


Ala 


Glu 


Gin 


Val 


Asp 


Cys 


He Gly Thr Gly 




50 








55 










60 


Atfn 


Ser 


Ala He 


He 


Ala 


Ala 


Ala 


Ala 


Lys 


Phe 


Glu 


Ser Asp Asp Leu 


65 








70 








75 




80 


Glu 


Ser 


Tyr Val 


Gly Trp 


Glu 


He 


Met 


Ser 


Val 


Asp 


Phe Phe Pro Gly 


Tyr 






85 










90 






95 


Lys 


Ala Cys 


Lys 


Tyr 


Thr 


Ser 


Ala 


Val 


Trp 


Ala 


Asp Asp Met Thr 






100 










105 






110 


He 


Leu 


Gly Gin 


Ser 


Glu 


Asp 


Ser 


Asp 


Pro 


Glu 


Met 


Gin Thr He Asn 






115 








120 










125 


Asn 


Leu 


Ala Leu 


Lys 


Thr 


Ser 


Val 


Lys 


He 


Glu 


Ala 


Gly Lys Asn Tyr 




130 








135 










140 


He 


Val 


Gly Tyr 


He 


Ala 


Asn 


Thr 


Ala 


r.ly 


Gly 


His 


Pro He Gly Cys 


145 








150 










155 




160 
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10 



15 



25 



30 



35 



40 



45 



50 



55 



60 



Atp 


Gin 


Gly Pro Ala 


Val 


Asp 






165 




Glu 


Asp 


Gly Gly Ala 


Thr 


Phe 






100 






Val 


Pro 


Thr Leu Asn 


Tyr 


Asn 






19b 






Gly 


Glu 


Gly Val Glu 


Ala 


Val 




210 






21b 


Val 


Gin 


A£ii Gly Val 


lie 


Tyr 


225 






230 


Ser 


Leu 


Phe Asp Met 


Asn 


Gly 






245 




Thr 


lie 


Ala Ala Pro 


Gin 


Lys 






260 




Lys 


Ser 


lie Lys Leu 


Ala 


He 



110 175 
Pro Pro Phe Glu Ser Leu His Gin Ala 

185 190 
He Tyr Val Val Val His Lsu Lys Ly* 
200 205 
Leu Thr Asn Asp Lys Ala hsn Ala Tyr 
220 

Val Ala Gly Ala Asn Gly Arg Gin Val 
235 240 
Lys Val Val Tyr Thr Gly Val Ser Glu 

2S0 255 
Gly Met Tyr He Leu Arg Val Gly Ala 
265 270 

i^ys Leu Ala He 

275 

(2) INFORMATION FOR SEQ ID NO: 268 

2® U> SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 157 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECUT.R TYPE: protein 

(ill) HYPOTHETICAL: YES 

( v I ) OR I G I U AL SOURCE : 

(A) ORGANISM: Porphyromonas gingivalis 

lix) FEATURE: 

(A) NAME/KEY: misc feature 
< B ) LOCATION 1 ... 1 57 

<xi) SEQUENCE DESCRIPTION: SEQ ID WO:26B 



Ara 


Unk 


Phe 


Leu 


Pro 


Glu 


Lys 


1 








5 




Thr 


Gin 


Glu 


Gly 


His 


Ala 


Val 








20 








Met 


Lys 


Gly 


Lys 


GJ y Asp 


Leu 






35 










Pro 


Met 


Pro 


Phe 


Val 


Pro 


Leu 




so 










55 


Ala 


Asn 


Phe 


Tyr 


He 


Tyr 


Ser 


65 










70 




Asp 


Val 


Leu 


Gin 


His 


Arg 


Met 










85 






Glu 


Leu 


His 


Val 


Glu 


Ala 


Leu 








100 








Val 


Tyr 


Asp 


Het 


Arg 


Gly 


Arg 






115 








Glu 


Ly* 


Leu 


Cys 


He 


Asp 


He 




130 








135 


Leu 


Arg 


He 


Gly 


Ser 


Tyr 


Ser 


145 










150 




(2) 


INFORMATION 


FOR 


SF.Q 


ID U 



10 15 
Gly Phe Gly Leu Asp Asp Gly Pro Ala 

25 30 
Val Gly Ser Tyr Leu Pro Gly Ala Ala 
40 45 
Ser Asp He Pro Ala Arg Ser Met Asp 
60 

Arq He Ser Leu Gly Ser Gly Thr Gin 

75 80 
Lys Val Tyr Pro Asn Pro Ala Thr Thr 

90 95 
Ser Ala Trp Val Gly Glu Gin Ala Ala 

105 no 
Arg Val Ser Ala Arg Thr Val Asp Ser 
120 125 
Ala Ser Leu Pro Val Gly Val Tyr Het 
140 

Ala Lys Phe Glu Lys Arg 
155 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 562 amino acids 

(B) TYPE: amino acid 
gt* <D) TOPOLOGY: linear 

(it) MOLECULE TYPE: protein 

I ill* HYPOTHETICAL: YES 

70 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyroraonas gingival is 

( ix ) FEATURE : 
7I - (A) NAME/ KEY : misc feature 

/0 (S) LOCATION 1...562 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(xi) SEQUEIICE DESCRIPTION! SEQ ID NO: 269 



Thr Arg Asn Val 
X 

lie Ala Leu Ary 
20 

Pro Leu Ala Lys 
35 

Glu Thr Met Ser 
50 

Met Ala Val Ala 
65 

Arg lie Tyr Phe 

Ala Thr Tyr Gin 
100 

Phe Asp Val Ser 
115 

Asp lie Asn Val 
130 

Thr He Gly Asn 
145 

Asn Asn Thr Glu 

Met Gly Val Ala 
180 

Leu Gly Gly Asp 
195 

Ser Asp His Ser 
210 

His Phe tilu Gin 
225 

Asn Arg Val Asp 

Asn Thr Trp Pro 
260 

Asp Gly Phe Asp 
27b 

Arg Tyr Ala Trp 
290 

Thr Asp Phe Asn 
305 

Leu Asp Asp Asn 

Phe Leu He Thr 
310 

Asn Tyr Ser Pro 
35S 

His Cys He Gly 
370 

Tyr Leu Thr Thr 
385 

Lye Tyr Asp Asp 

Tyr Ala Lys Glu 
420 

Leu Asn Pro Thr 
435 

Ser Pro Tyr Asp 
450 

His Phe Leu Trp 
4 65 

Asp Val Leu Gin 

Lys Glu Tyr Val 
500 

Val Val Tyr Asp 
515 

Gly Lys Glu Tyr 
530 

Met Leu Lys Val 
545 

Val Glu 



Ser Leu He Lys 
5 

Leu Leu Leu Ala 

Ala Gin Met Asp 
40 

Thr Leu Sec Gly 

55 

Asd Asn Gly Trp 
70 

Asp Asp Val Arg 
85 

Lys Leu Gly Ser 

His Cys Asp Phe 
120 

Trp Thr Val Met 

135 

Gly Val Leu Leu 
150 

Cys Val Tyr Lys 
1S5 

Tift Ala Ser Asn 

Pro Phe Ala Luu 

200 

Phe T.eu Asp Tyr 
215 

Lys Arg He Tyr 
230 

Leu Ser I.eu Gly 
245 

Leu Met Gly Val 

He Gly Phe He 
280 

Ser Glu Pro He 
295 

Pro Leu Gly Ala 
310 

Ser Asp Asn Thr 
325 

Tyr Pro Gly His 

Gly His Thi Pro 
360 

Ho Pro Ala Leu 
375 

Phe Gin Asp His 
390 

He Asn Ser Phe 
405 

Ala Lys Asp Lys 

Asn Gly Lys Ala 
440 

Glu Thr Lyo Pro 
455 

Ser Asp Thr Glu 
470 

Lys Glu Gly Ser 
485 

Leu He Asn Leu 

Met Gin Gly Arg 
520 

Lys Leu Asn Val 
535 

Val Ala Asp Thr 
550 



Met Pro Arg He 
10 

Thr Phe Ala He 
25 

He Gly Gly Asp 

Tyr Ser Glu A3p 
60 

He Tyr Val Met 

75 

Leu Tyr Arg Ser 
90 

Leu Gly Ser Leu 
105 

He Vnl Thr Gly 

Thr Ala Phe Glu 
140 

Met His Arg His 
155 

Lys Asp Phe Pro 
170 

Tyr Arg Ala Pro 
185 

Ala Val Ala Val 

He Phe Ser Leu 
220 

Thr Arg Pro Gin 
235 

Ser Thr Ser Pro 
250 

Val Phe Glu Met 
265 

Ser Asn Phe Val 

He He Glu Glu 
300 

Leu Ser He Glu 

315 

Val Gly Gly Glu 

330 

Tyr Val Tyr Pro 
345 

Thr Lys Lys Asp 

Ala Tyr Asp Lys 
380 

Asn Leu Met Arg 
395 

Tyr Gly Trp Ser 
410 

Lys Arg Arg Arg 
425 

Cys Trp Val Trp 

His Pro Thr Pro 
460 

Trp Val His Ala 
475 

Met Lys Leu Tyr 
490 

Pro Lys Glu Gly 
505 

He Val Glu Lys 

Gin Tyr Leu Sar 
540 

Glu Tyr Phe Val 

555 



Met Lys Leu Lys 
15 

Val Leu Phe Ser 
30 

Asp Val I.eu Tl« 
45 

Phe Tyr Tyr Lys 

Leu Asp Phe Ser 
80 

Lys Asp Gly Gly 

95 

Val Pro Tyr Asp 
110 

Lys Asp Glu Asp 

125 

Tyr Val Gly Gly 

Asp Ala Asp He 
160 

Asn Asn Arg Leu 
175 

Ser Pro Tyr Gly 
190 

Ser Gly Ser Gly 
205 

Asp Gly Gly Val 

Lys Leu Thr He 
240 

Ser Leu Gly Phe 
255 

Asn Lys Asn Leu 
270 

Asp Tyr Asp Pro 
285 

Asp Cye Gly Trp 

lie Gin Met Met 
320 

Arg Ser His Asn 
335 

Lys Gin Ser Phe 

350 

Leu Val Phe Lys 
365 

Glu Gly Asp Arg 

Tyr Arg Trp He 
400 

Trp Pro Tyr Val 
415 

Pro Gin Val Ala 
430 

His Thr Arg Ly* 
445 

Val He He Lys 

Leu Asp Val Gly 
480 

Pro Asn Pro Ala 
4 95 

Gly His Glu Ala 
510 

Val Ser Phe Ser 
525 

Lys Gly Thr Tyr 

Glu Lys He He 
560 
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(2) INFORMATION FOR SEQ ID NO: 210 

it) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 391 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

tvi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalls 

(ix) FEATURE: 

(A) NAME/ KEY: raise feature 
IB) LOCATION 1...3 91 

(Xl) SEQUENCE DESCRIPTION: SEQ ID HO: 270 

Gin Met Lyc Arg Leu Leu Pro Phe Leu Leu Leu Ala Gly Leu Val Ala 

. 5 10 IS 

Val Gly A-n Val Ser Ala Gin Ser Pro Arg He Pro Gin Val Asp Val 

20 ' 25 30 

His Thr Arg He Ala Acq Asn Ala Arg Tyr Arg Leu Asp Lys 71- S fr 

35 40 45 

Val Pro Asp Ser Arg Gin He Phe Asp Tyr Phe Tyr Lys GIu Glu Thr 



55 60 



lie Pro Thr Lys He Gin Thr Thr Thr Gly Gly Ala He Thr Ser lie 



65 70 



15 80 



Asp Ser Leu Phe Tyr Glu Asp Asp Arg Leu Val Gin Val Arg Tyr Ph- 

85 90 95 

Asp Asn Asn Leu Glu Leu Lys Gin Ala Glu Lys Tyr Val Tyr Asp Gly 

100 105 no 

Ser Lys Leu Val Leu Arg Glu He Arg Lys Ser Pro Thr Asp Glu Thr 

115 120 125 

Pro lie Lys Lys Val Ser Tyr His Tyr Leu Cys Gly Scr Asp Met Pro 

130 135 140 

Phe Glu He Thr Thr Glu Met Ser Asp Gly Tyr Phe Glu Ser His Thr 
3 4S „ 150 355 160 

Leu Asn Tyr Leu Asn Gly Lys He Ala Arg He Asp He Met Thr Gin 

165 1^0 175 

Gin Asn Pro Ser Ala Glu Leu He Glu Thr Gly Arg Met Val Tyr Glu 

180 185 190 

Phe Asp Ala Asn Asn Asp Ala Val Leu Leu Arg Asp Ser Val Ph° Leu 

195 200 20* 

Pro Leu Gin Asn Lys Trp Val Glu Met Phe Thr His Arg Tyr Thr Tyr 

210 215 220 

Asp Asn Lys His Asn Cys He Arg Trp Glu Gin Asp Glu Phe Gly Thr 

7 , 230 235 240 

Leu Thr Leu Ala Asn Asn Phe Glu Tyr Asp Thr Thr He Pro Leu Ser 

245 250 255 

Ser Val Leu Phe Pro Thr His Glu Glu Phe Phe Arg Pro Leu Leu Pro 

260 265 270 

Asn Phe Met Lys His Met Arg Thr Lys Gin Thr Tyr Phe Asn Asn Ser 
275 280 285 

y Gly LeU Ser G1U Val CVff T Y" r ** n *yr ph * T Yr Thr Asp 

2 f° 295 300 

Met Gin Gly Asn Ala Leu Thr Asp Val Ala Val Asn Glu Ser He Lys 

x? . 310 31* 320 

He Tyr Pro Arg Pro Ala Thr Asp Phe Leu Arg He Glu Gly Ser Gin 

32 " 330 335 

Leu Leu Arg Leu Ser Leu Phe Asp Met Asn Gly Lys Leu He Arg Ala 

340 345 350 

Thr Glu Leu Thr Gly Asp Leu Ala He He Gly Val Ala Ser Leu Pro 

355 360 3 65 

Arg Gly Thr Tyr He Ala Glu He Thr Ala Ala Asn Ser Lys Thr II- 

» 3 ?° 37S 3tJ o 

Arg Ala Lys Val Ser Leu Arq 
385 ago 

(2 J INFORMATION FOR SEQ ID NO: 271 

(1) SEQUENCE CHARACTERISTICS: 
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10 



15 



25 



30 



50 



187/ 490 

(A) LENGTH: 428 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

(vl| ORIGINAL SOURCE: 

(A) ORGANISM: Porphyroroonas gingivalis 

(ixt FEATURE: 

(A) NAME/ KEY : raisc_f eature 

(B) LOCATION 1...4Z8 

Ixi) SEQUENCE DESCRIPTION: SEQ ID HO: 271 



Lys Arg Asn Pro Leu Pro Leu Thr Aia Ser Asn Arg Lys He Phe li- 
on 1 5 10 * 5 

Lie Met Arg Gin His Leu Ser Leu Phe Pro Phe He Leu Phe Leu Leu 

20 25 30 

Leu Ala Phe Ser Tyr Val Gly Cys Arg Thr Val Arg Gin Thr Pro Lys 

35 40 45 

Gin Ser Glu Arg Tyr Val Val val L-u Ser Leu Asp Gly Phe Arg Pro 

bu 55 60 

Asp Tyr Thr Asp Arg Ala Arg Thr Pro Ala Leu Asp Arg Met Ala Gin. 
65 70 75 60 

Glu Gly Leu Ser Gly Ser Leu Gin Pro Cys Phe Pro Ser Leu Thr Phe 

85 go 95 

Pro Asn His Tyr Ser Met Aia Thr Gly Leu Tyr Pro Asp His His Gl^' 

100 105 HO 

He Val Ala Asn Glu Phe Val Asp Ser Leu Leu Gly He Phe Arg He 

H* 120 125 

Ser Asp Arg Lys Ala Val Glu Thr Pro Gly Phe Trp Gly Gly Glu Pro 
35 130 135 140 

Val Trp Asn Thr Ala A3 a Arg Gin Gly He Arg Thr Gly Val Tyr Phe 
145 150 155 160 

Trp Val Gly Ser Glu Thr Ala Val Asn Gly Asn Arg Pro Trp Arg Trp 

An 165 170 175 

W Lye Lys Phe Ser Ser Thr Val Pro Phe Arg Asp Arg Ala Asp Ser Val 
180 185 190 

He Ala Trp Leu Gly Leu Pro Glu Lys Glu Arg Pro Arg Leu Leu Mf»r 

195 200 205 

Trp Tyr He Glu Glu Pro Asp Met He Gly His Ser Gin Thr Pro Glu 
45 210 215 220 

Ser Pro Leu Thr Leu Ala MeL Val Glu Arg Leu Asp Scr Val Val Gly 
225 230 235 240 

Tyr Phe Arg Lys Arg Leu Asp Ser Leu Pro He Ala Ala Gin Thr Aso 

245 250 255 

Phe He He Val Ser Asp His Gly Met Ala Thr Tyr Glu Asn Glu Lys 

260 265 270 

Cys Val Asn Leu Ser His Tyr Leu Pro Ala Asp Ser Phe Leu Tyr H°t 

2*75 280 285 

Ala Thr Gly Ala Phe Thr His Leu Tyr Pro Lys Pro Ser Tyr Thr Glu 
55 290 295 300 

Arg Ala Tyr Glu He Leu Arg Ala He Pro His He Ser Val Tyr Arg 
305 310 315 320 

Lys Gly Glu Val Pro Lys Arg Leu Arg Cys Gly Thr Asn Pro Arg Leu 
cn 325 330 335 

t5U Gly Glu Leu Val Val He Pro Asp He Gly Ser Thr Val Phe Phe Ala 
340 345 350 

He Asn Glu Asp Val Arg Pro Gly Ala Ala His Gly Tyr Asp Asn Gin 

355 360 365 

Ala Pro Glu Met Arq Ala Leu Leu Arg Ala Val Gly Pro Asp Phe Arg 
t>5 370 37S 380 

Pro Gly Ser Arg Val Glu Asn Leu Pro Asn He Thr He Tyr Pro L-u 
3 3S 390 395 400 

He Cys Arg Leu Leu Gly He Glu Pro Ala Pro Asn Asp Ala Asp Glu 
7n 40S 410 415 

/U Thr Leu Leu Asn Gly Leu II <* Arg Asp Lys Arg Pro 
420 425 

(2) INFORMATION FOR SEQ ID NO: 272 

75 (ij SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 282 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

Uxi) HYPOTHETICAL: YES 



10 



15 



30 



60 



65 



70 



75 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

( ix i FEATURE : 

(A) NAME/ KEY: miscjeature 
f B ) LOCATION 1 . . .282 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO:272 



Leu Ser Axg Gly Val Phe Pro Leu Met Tyr Gly Arg Arg Gly Ser li- 
on 5 10 15 
4V Arg Ala Ser Scr Gly His Arg Asp Lys He Phe Lys Asn Thr He He 
20 25 30 
Arg Phe He Thr Met Lys Val Gly Leu Phe He Pro Cys Tyr Val Asn 
33 40 45 

?R Ala In 1 ^ Pr ° G1U Val Gly Ile Ala Thr T V^ ^ Leu Leu Lys Ser 
^° 50 55 60 

Leu Asp He Asp Val Asp Tyr Pro Het Asp Gin Thr Cys Cys Gly Gin 

60 70 75 00 

Pro Met Ala Asn Ala Gly Phe Glu Gin Lys Ala Gin Lys Leu Ala L~u 

65 90 q 5 

Arg Phe Glu Glu Leu Phe Glu Ser Tyr Asp Val Val Val Gly Pro Ser 
fcl o 100 105 HO 

Ala Ser Cy* Val Ala Phe Val Lys Glu Asn Tyr- Asp His He Leu Arg 
115 120 125 

Pr ° ™ Gly HiS Val CyS Lys Ser Aia Ma Val Ar g Asp He cys 

00 130 135 140 

Glu Phe Leu His Asp Asp Leu Lys lie Thr Ser Leu Pro Ser Arg Phe 

14 150 155 160 

Ala His Lys Val Ser Leu His Aen Ser Cye His Gly Val Arg Glu Leu 

ATi 165 170 175 

*U His Leu Ser Thr Pro Scr Glu Val His Arg Pro Tyr His Asn Lys Val 

ieo xqs 190 

Arg Arg Leu Leu Glu Met Val Gin Gly Ile Glu Val Phe Glu Pro Lys 
195 200 205 

Arg III A3P G1U CyS Cys Gly Pne G1 y GJL Y Hst Tyr Ser Val Glu Glu 
^ D 210 215 220 

Pro Glu Val Ser Thr Cys Met Gly His Asp Lys Val Leu Asp His II- 
230 235 240 

Ser Thr Gly Ala Glu Tyr Ile Thr Gly Pro Asp Ser Ser Cy* Leu Met 

ftn 245 250 255 

OU His Met Gin Gly Val Ile Asp Arg Glu Lys Leu Pro Ile Lys Thr Ile 
260 265 270 

His Ala Val Glu He Leu Ala Ala Asn Leu 
275 280 

55 (2) II I FORI 1ATI ON FOR SEQ ID NO:273 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 
IB> TYPE: amino a=id 
ID) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

tvx) ORIGINAL SOURCE: 

I A) ORGANISM: Porphyromonas gingival is 

Ux) FEATURE: 

(A) NAME/ KEY : mlsc_featur* 

(B) LOCATION 1. . .251 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 273 
Pro Leu Lys Lys Arg Met Asp He Vai Ser Mst Ala Asp Lys Ala Leu 
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1 5 10 15 

Val Val Glu Met Arq Asp Val Thr Leu Cys Gin Glu Glu Asn Val lie 

20 25 30 

Phe Gin Asn Leu Asn Leu Thr Leu Ser Ala Gly Asp Phe Val Tyr Leu 

35 40 -45 

He Gly Scr Val Gly Ser Gly Lys Ser Thr Leu Leu Lys Ala Leu Tyr 

*° . 55 60 

Ala Glu Val Pro He Ser Ala Gly Tyr Ala Arq Val He Asp Tyr Asp 

f », * 70 75 80 

Leu Ala Lys Leu Lys Arg Lys Gin Leu Pro Tyr Leu Arg Arg Asn I.ou 

85 90 95 

Gly He Val Phe Gin Asp Phe Gin Leu Leu Asn Gly Arq Thr Val Ala 

100 105 no ' 

ulu Asn Leu Asp Phe Val Leu Arq Ala Thr Asp Trp Lye Asn Arg Ala 

115 120 125 

As-p Arg Glu Gin Arg He Glu Glu Val Leu Thr Arq Val Gly Met Ser 

130 las 140 

Aig Lys Ala Tyr Lys Arq Pro His Glu Leu Ser Gly Gly Glu Gin Gin 
145 ISO 155 160 

Arg Val Gly He Ala Arg Ala Leu Leu Ala Lys Pro Ala Leu He Leu 

165 170 175 

Ala Asp Glu Pro Thr Gly Asn Leu Asp Ser Val Thr Gly L«u Gin II- 

180 185 190 

Ala Ser Leu Leu Tyr Glu He Ser Lys Gin Gly Thr Ala Val Leu Met 
ZD 195 - 200 205 

Ser Thr Hie Asn Ser Ser Leu Leu Ser Hie Leu Pro Ala Arg Thr Leu 

210 215 220 

Ala Val Arg Lys Asn Gly Asp Ala Ser Ser Leu Val Glu Lau Ser Ala 
22 5 , 230 235 240 

Asp Ala Val Ser Arq Lys Asn Thr Glu He Asp 
24S 



10 



15 



20 



30 



(2) INFORIIATION FOR SEQ ID MO: 274 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 
(DJ TOPOLOGY: linear 



40 



50 



55 



60 



65 



70 



75 



Hi) MOLECULE TYPE: protein 
(ill) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 
45 (A> ORGANISM: Porphyromonas gingival is 



(ix) FEATURE: 

(A) NAME/KEY: misc_feaLure 

(B) LOCATION 1...238 

(Xi» SEQUENCE DESCRIPTION: SEQ ID NO:274 



65 



115 
Leu Gly 
130 



Cys 


Pro 


Ala 


Cys 


Arg 


Ser 


Ser 


Ph- 


His Leu He Lys Thr 


Met 


5 










10 




15 


He 


Glu 


He 


Ser 


Asn 


Leu 


Thr 


Lys Val Phe Arg Thr 


20 










25 






30 


Glu 


Thr 


Vol 


Ala 


Leu Asp Gly 


Val 


Ser Leu Lys Val Asp 










40 








45 


Phe 


He 


Ala 


He 
55 


Met 


Gly 


Pro 


S»r 


Gly Cys Gly Lys Ser 
60 


Asn 


He 


Leu 


Gly Leu 


Leu 


Asp 


Asn 


Pro Thr Ser Gly He 






70 










75 


80 


Asp 


Gly 


Ala 


Glu 


Val 


Gly Asn 


Leu 


Arg Glu Lys Asp Arg 




85 










90 




95 


Arg 


Lys 


Gly Asn 


He 


Gly 


Phe 


Val 


Phe Gin Ser Phe Asn 


100 










105 






110 


Glu 


Met 


Thr 


Val 


Ser 


Glu 


Asn 


Val 


Glu Leu Pro Leu Val 


Val 








120 








125 


Lys 


Ala 


Ser 
135 


Glu 


Arg 


Lys 


Glu 


Arg Val Glu Glu Ala 
140 


Het 


Ser 


He 


Ser 


His 


Arg 


Ala 


Gly 


His Phe Pro Asn Gin 


Gly 




150 










155 


160 


Gin 


Gin 


Gin 


Arg 


Val 


Ala 


He 


Ala Arg Ala Val Val 


Lys 


165 










170 




175 


Lou 


He 


Leu 


Ala 


Asp 


Glu 


Pro 


Thr Gly Asn Leu Asp 
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180 185 190 

Ser Lys Asn Gly Ala Asp Val Met Glu Leu Leu Arg Gly Leu Asn Arq 

195 200 205 

Glu Gly Ala Thr lie Val Met Val Thr His Ser Glu His Asp Ala Arq 

* 10 215 220 

Ser Ala Gly Arg lie lie Asn Leu Phe Asp Gly Lye Tie Art] 
225 230 235 

12) INFORMATION FOR SEQ ID HO: 275 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: €04 amino acids 
(B> TYPE: amino acid 
(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: protain 

(iii) HYPOTHETICAL: YES 

(vi) ORICIMAL SOURCE: 

(A) ORGANISM; Porphyromonas gingival as 

tlx) FEATURE: 

(A) NAME/5CEY: misc feature 

(B) LOCATION 1...604 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275 

Ser Thr Glu Thr Asn Ser Lys Ser Glu Met Lys Glu Phe Phe Lys H*t 

b 1° 15 

Phe Phe Ala Ser He Leu Gly Val He Thr Ala Gly He He L-u Ph- 

20 25 30 

Cys He Phe Leu Phe He Phe Phe Gly He Val Ala Gly He Ala Ser 

35 40 45 

Lys Ala Thr Gly Gly Thr He Pro Lys He Glu Ala Asn Ser He Leu 

1,0 55 go 

His He Unk Asn Ser Ser Phe Pro Glu He Val Ser Ala Asn Pro Trp 
65 70 75 go 

Ser Met Leu Thr Gly Lys Asp Glu Ser Val Ser Leu Ser Gin Ala Val 

85 90 95 

Glu Ala He Gly Gin Ala Lys Asn Asn Pro Asn He Thr Gly He Phe 

100 105 no 

Leu Asp Leu Asp Asn Leu Ser Val Gly Met Ala Ser Ala Glu Glu Leu 

115 120 125 

Arg Arg Ala Leu Gin Asp Phe Lys Met Ser Gly Lys Phe Val Val Ser 

130 135 140 

Tyr Ala Asp Arg Tyr Thr Gin Lys Gly Tyr Tyr Leu Ser Ser He Ala 
I t 150 155 160 

Asp Lys Leu Tyr Leu Asn Pro Lys Gly Met Leu Gly Leu He Gly He 

165 1 7 0 175 

Ala Thr Gin Thr Met Phe Tyr Lys Asp Ala Leu Asp Lys Ph- Gly Val 

1°° 185 190 

Lys Met Glu He Phe Lys Val Gly Thr Tyr Lys Ala Ala Val Glu Pro 
195 20" 205 

Met Leu Asn Arg Met Ser Asp Ala Asn Arg Glu Gin He Thr Thr 
210 215 22Q 

Tyr He Asn Gly Let, Trp Asp Lys He Thr Ser Asp He Ala Glu Ser 
230 235 240 

Arg Lys Thr Ala Met Asp Ser Val Lys Met Phe Ala Asp Lys Gly Glu 

245 250 255 

Met Phe Gly Leu Ala Glu Lys Ala Val Glu Met Lys Leu Val Asp Glu 

260 265 270 

Leu Ala Tyr Arg Thr Atp Val Glu Lys Glu Leu Lys Lys Met Ser Gin 
275 260 285 

G1U LyS ASP Glu Leu Ar 9 Pne Val Leu Ser Gin Val I.-u 

Z9 ° 295 300 

Ala Asn G±y Pro Met Asn Lys Thr Lys Gly Ser Arg He Ala Val L-u 

310 315 
Phe Ala Glu Gly Glu He Thr Glu Glu He He Lys Lys Pro Phe Asp 

Thr Asp Gly Ser Ser Tie Thr Gin Glu Leu Ala Lys Glu He Lys Ala 

B1 340 345 350 

Ala Ala Asp Asp Asp Asp He Lys Ala Val Val Leu Arg Val Asn S-r 

360 3 6 5 
Pro Gly Gly Sor Ala Phe Thr Ser Glu Gin He Trp Lys Gin Val Ala 
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370 375 380 

Asp Leu Lys Ala Lys Lys Pro lis Val Val Ser Hot Gly Asp Val Ala 
385 390 395 4 00 

Ala Ser Gly Gly Tyr Tyr He Ala Cys Ala Ala Asn Ser lie Val Ala 

405 410 415 

Glu His Thr Thr Leu Thr Gly Ser He Gly He Pho Gly Met Phe Pro 

420 425 430 

Asn Phs Ala Gly Val Ala Lys Lys He Gly Val Asn Met Asp Val Val 

435 440 445 

Gin Thr Ser Lys Tyr Ala Asp Leu Gly Asn Thr Phe Ala Pro Met Thr 

450 455 460 

Val Glu Asp Arg Ala Leu He Gin Arg Tyr He Glu Gin Gly Tyr Asp 
465 470 475 480 

Leu Phe Leu Thr Arg Val Ser Glu Gly Arq Asn Arg Thr Lys Ala Gin 

485 490 495 

He Asp Ser He Ala Gin Gly Arg Val Trp Leu GJy Asp Lys Ala Leu 

500 505 sio 

Ala Leu Gly Leu Val Asp Glu Leu Gly Gly Leu Asp Thr Ala He Lys 

515 520 525 

Arg Ala Ala Lys Leu Ala Gin Leu Gly Gly Asn Tyr Ser Tie Glu Tvr 

530 535 b4 0 , 

Gly Lys Thr Lys Arg Asn Phe Phe Glu Glu Leu Leu Ser Ser Ser Al« 
545 550 555 , S60 

Ala Asp Met Lys Scr Ala He Leu Ser Thr He Leu Ser Asp Pro Glu 

565 570 575 

He Glu Val Leu Arg Glu Leu Arg Ser Met Pro Pro Arg Pro Ser Gly 

S80 585 590 

He Gin Ala Arg Leu Pro Tyr Tyr Phe Met Pro Tyr 
595 600 



(2) IMPORTATION FOR SEQ ID HO: 27 6 

til SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 amino acids 
<B) TYPE: amino acid 
CD) TOPOLOGY: lineal 

tii) MOLECULE TYPE; protein 

<iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(AJ ORGANISM: Porphyroroona.<» gingival is 

lix) FEATURE: 

(A) MAI IE /KEY : miscjeature 
(B> LOCATION 1. . .324 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276 



Leu 


Thr 


Leu 


Phe 


Cys 


Cys 


Asn 


Tyr 


Phe 


Lys 


Gin 


Met Arg Aid Asn 


He 


1 








5 










10 




15 




Trp Gin 


lie 


Leu 


Ser 


Val 


Ser 


Val 


Leu 


Phe 


Phe 


Phe Gly Thr Ala 


He 








20 










25 






30 


Gly Gin Ala 
35 


Gin 


Ser Arg 


Asn 


Arg 
40 


Thr 


Tyr 


Glu 


Ala Tyr Val Lys 
45 


Gin 


Tyr 


Ala 


Asp Glu 


Ala 


He 


Arg 


Gin 


Met 


Ser 


Arg 


Tyr Asn lie Pro 


Ala 




50 










55 








60 




Ser 


Tie 


Thr 


He 


Ala 


Gin 


Ala 


Leu 


Val 


Glu 


Thr" 


Gly Ala Gly Ala 


Ser 


65 










70 










75 


60 


Thr 


Leu 


Ala 


Ser 


Val 


His 


Asn 


Asn 


His 


Phe 


Gly 


He Lys Cys His 


Lys 




Trp 






B5 










90 




95 


Ser 


Thr 


Gly 


Lys 


Arg 


Thr 


Tyr 


Arg 


Thr 


Asp 


Asp Ala Pro Asn 


Glu 


Cys 


Phe 




100 










105 






110 




Arg 


Ser 


Tyr 


Ser 


Ala 


Ala 


Arg 


Glu 


Ser 


Tyr Glu Asp His 


Ser 


Arg 




115 










120 








125 


Phe 
130 


Leu 


Leu 


Gin 


Pro 


Arg 

135 


Tyr Arg 


Pro 


Leu 


Phe Lys Leu Asp Arg 

140 


Glu 
145 


Asp 


Tyr 


Arg 


Gly Trp Ala 


Thr 


Gly 


Leu 


Gin 


Arg Cys Gly Tyr Ala 


Asn 








150 










155 




160 


Thr 


Arg 


Gly 


Tyr 


Ala 


Asn 


Leu 


Leu 


He 


Lys 


Met Val Glu Leu 


Tyr 


Glu 


Leu 






165 










1/0 




175 


Tyr 


Ala 


Leu 


Asp Arg 


Glu 


Lys 


Tyr 


Pro 


Ser Trp Phe His 


Lys 


Ser 






190 










185 






190 


Tyr 


Pro 


Gly 


Ser 


Asn 


Lys 


Lys 


Ser 


His 


Gin 


Thr Thr Lys Gin 


Ly3 
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195 ZOO 205 

Ser Gly Leu Lys His Glu Ala Tyr Phe Ser Tyr Gly Leu Leu Tyr 
Z1 ° 215 220 

lie lie Ala Lys Gin Gly Asp Thr Phe A.rp Ser Leu Ala Glu Glu Phe 
fc " 5 230 235 240 



Asp Met Arg Ala Ser Lys Leu Ala Lys Tyr Asn A*p Ala Pro Val A^p 
2-JS 250 - * 

Lys Gly Asp Val He 
260 265 27Q 



B L ~ — ^ 2S5 

Phe Pro He Glu Lys Gly Asp Val He Tyr Leu Glu Lys Lys His Ala 

260 2*5 270 

Cys Ser lie Ser Lys His Thr Gin His Val Val Arg Val Gly Asp Ser 

280 295 

Met His Ser He Ser Gin Arg Tyr Gly He Arg Met Lys Asn Leu Tyr 

£yu 295 300 

305 ^ ** P Gli " GlU Tyr 116 Pr ° Gln Glu G1 *' A3 P Tle 



305 

Leu Arg Leu Arg 



310 315 * 32Q 



(?) II I FORMATION FOR SEQ ID HO: 277 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 amino acids 
(B> TYFE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TY:?E: protein 

(iii) HYPOTHETICAL: YES 

(VI) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

(A) NAME /KEY : mi ec _f eature 

(B) LOCATION 1. . .533 

(Xi> SEQUENCE DESCRIPTION: SEQ ID HO: 277 

Arg He Pro Asp Glu Gin Thr Gly Arg Il s Met Asp Gly Arg Arg Tyr 

i 5 10 15 

Ser Asp Gly Leu H is Gin Ala He Glu Ala Lys Glu His Val Lys Val 

Glu Ala Ala Thr Gin Thr Phe Ala Thr II- Thr Leu Gin A^n Tyr Phe 

Arg Met Tyr His Lys Leu Ala Gly Met Thr Gly Thr Ala Glu Thr Glu 

55 $q 
Ala ely Glu Leu Trp Asp He Tyr Lys I.cu Asp Val Val Val He Pro 

Thr Asn Lys Pro He Ala Arg Lys Asp Met As*n Asp Arg He Tyr Lys 

85 go 95 

Thr Ala Arg Glu Lys Tyr Ala Ala Val He Glu Glu lie Val Arg Leu 

105 no 

Val Glu Glu Gly Arg Pro Val Leu Val Gly Thr Thr Ser Val Glu II- 

1X5 120 125 

Ser Glu Leu Leu Ser Axg Met Leu Arg T.eu Arg Gly He Gin His Asn 

Vol Leu Asn Ala Lys Leu His Gin Lys Glu Ala Glu He Val Ala Gin 
150 155 . „ 

Ala Gly Gin Lys Gly Thr Val Thr He Ala Thr A*n Met Ala Gly Arg 

165 170 175 

Gly Thr Asp He Lys Leu Ser Ala Glu Val Lys Lys Ala Gly Gly L«m, 

185 i9 0 
Ala He He Gly Thr Glu Arg His Glu Ser Arg Arg Val Asp Ary Gin 

200 205 
Leu Arg Gly Arg Ser Gly Arg Gin Gly Asp Pro Gly Ser Ser He Ph- 
210 215 220 



Tyr Val Ser Leu Glu Asp His Leu Met Arg Leu Phe Ala Thr Glu Ly; 

Ala Ser Leu Met Asp 

245 250 255 



He Ala Ser Leu Met Asp Arg Leu Gly Phe Lys Glu Gly Glu Val Leu 



Glu Asn Asn Met Leu Ser Lys Ser Val Glu Arg Ala Gin Lys Lys Val 

0 265 270 

Glu Glu Asn Asn Phe Gly n e Axg Lys His Leu Leu Glu Tyr Asp Asp 

280 285 

Val Met Asn Ser Gin Arg Glu Val He Tyr Thr Arg Arg Arg His Ala 
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290 










295 










300 








Leu 


Hut 


Gly 


Glu 


Arg 


He 


Gly 


Met 


Asp 


Val 


Leu 


Asn 


Thr 


He 


Tyr Asp 


305 










310 










315 








320 


Val 


Cys 


Lys 


Ala 


Leu 


lie 


Asp 


Asn 


Tyr 


Ala 


Glu 


Ala 


Asn 


Asp 


Phe Glu 










325 










330 








335 


Gly 


Phe 


Lys 


Glu 
340 


Asp 


Leu 


Hat 


Arg 


Ala 
34S 


Leu 


Ala 


He 


Glu 


Ser 

350 


Pro He 


Thr 


Cln 


Glu 


lie 


Phe 


Arg Gly 


Lys 


Lys 


Ala 


Glu 


Glu 


Leu 


Thr 


Asp Het 






355 










360 










365 




Leu 


Phe 


Asp 


Glu 


Ala 


Tyr 


Lys 


Ser 


Phe 


Gin 


Arg 


Lys 


Met 


Asp 


Leu He 




37 0 










375 










380 






Ala 


Glu 


Val 


Ala 


His 


Pro 


Val 


Val 


His 


Gin 


val 


Phe 


Glu 


Thr 


Gin Ala 


365 










390 










395 








400 


Ala 


Val 


Tyr 


Glu 


Arg 


Tie 


Leu 


He 


Pro 


He 


Thr Asp Gly Lys Arg Val 










405 










410 










415 


Tyr Asn 


lie 


Gly 


Cys 


Asn 


Leu 


Arg 


Glu 


Ala 


Asp Glu 


Thr 


Gin 


Gly Lys 








420 










425 










430 


Ser 


lie 


He 


Lys 


Glu 


Phe 


GlU 


Lys 


Ala 


He 


Val 


Leu 


His 


Thr 


He Asp 






435 










440 










445 




Glu 


Ser 


Trp 


Lys 


Ulu 


His 


Leu 


Ary 


Glu 


Met 


Asp 


Glu 


Leu 


Arg 


Asn Ser 




4 50 










4 55 








460 






Val 


Gin 


Asn 


Ala 


Ser 


Tyr Glu 


Asn 


Lys 


Asp 


Pro 


Leu 


Leu 


lie 


Tyr Lys 


465 










470 










475 








480 


I«cu 


Glu 


Ser 


Tyr 


Glu 

48S 


Leu 


Phe 


Arg 


Lys 


Met 
4 90 


Val 


Glu 


Ala 


Met 


Asn Arg 
495 


Lys 


Thr 


Val 


Ala 


He 


Leu 


Met 


Arg 


Ala 


Arg 


He 


Pro 


Val 


Pro 


Glu Ala 








500 










505 








510 




Pro 


Ser 


Gin 


Glu 


Glu 


Leu 


Glu 


His 


Arg 


Arg 


Gin 


He 


Glu 


He 


Arg Hlf 






515 










520 










525 




Ala 


Thr 


Gin 


Gin 


Arg 
























530 



























(2) INFORMATION FOR SEQ ID HO: 276 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ill I HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyr onion as gingiva lis 

(ix) FEATURE: 

(A) NAME /KEY : ml cofeature 

( B ) LOCATION 1 ... 7 20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278 



Lys 


Ser 


Cys 


Arg 


Val 


He 


Gly 


Gin 


Thr 


Arg 


Arg 


Tyr 


Gly Cys Cys 


Pro 


1 








5 










10 






15 




Val 


Gly 


Leu 


Arg 


Pro 


Cy3 


His 


Gin 


Ser 


Leu 


Ser 


Gly 


Val Cys Arg 


Cys 








20 










25 








30 


Leu 


Phe 


Gin 


Ser 


Arg 


Pro 


Ala 


Val 


Ala 


Phe 


Ala 


Arg 


Lys Gly Gin Arg 






35 










40 










45 




Arg 


His 


Axg 


Arg 


Ser 


Glu 


Ser 


Gly 


Arg 


Arg 


lie 


Arg 


Ala Leu Gin 


Gly 




50 










55 










60 




Val 


Gin 


Has 


His 


Gin 


Thr 


Asn 


Glu 


His 


Glu 


Val 


Met 


He Ser Val 


Asn 


65 










70 










75 






60 


Asn 


Leu 


Thr 


Val 


Asp 
85 


Phe 


Gly 


Thr 


Arg 


Leu 
90 


Leu 


Phe 


Asp Gin Val 
95 


Ser 


Phe 


Val 


He 


Asn 


Arg 


Arg 


Asp 


Arg 


He 


Ala 


Leu 


Val 


Gly Lys Asn 


Gly 


Ala 






100 










105 








110 


Gly 


Lys 


Ser 


Thr 


Leu 


Leu 


Lys 


Leu 


lis 


Ala 


Gly 


Met Glu Glu 


Pro 






115 










120 








125 




Thr 


Ser 
130 


Gly 


His 


He 


Ala 


Arg 
135 


Pro 


Lys 


Gly 


He 


Arg 
140 


He Gly Tyr 


Leu 


Pro 


Gin 


Val 


Met 


Arg 


Leu 


Gin 


Asp 


Gly 


Hia 


Thr 


Val 


Tyr Glu Glu 


Val 


145 


Gin 








150 










1SS 




160 


Glu 


Ala 


Phe 


Asn 


Asp 


He 


Arg 


Gin 


He 


Glu 


GlU 


Glu Tie Arg 


Arg 


Lsu 








165 










170 






175 


Ser 


Asp 


Glu 


Met 


Ala 


Gly 


Arg 


Thr 


Asp 


Tyr 


Glu 


Ser Asp Asp 


Tyr 
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180 185 190 

He Arg Leu He Glu His Tyr Thr Asn Met Ser Glu Thr Leu Scr 

1?5 200 205 

Het Gin Gin Gly Asn Tyr His Ala Ala He Glu Gin Thr Leu He Gly 

210 215 220 

Leu Gly Phe Gly Arg Glu Asp Phe His Arg Pro Thr Ala Asp Phe Ser 
22S 230 235 240 

Gly Gly Trp Arg Met Arg life Glu Leu Ala Lys Leu Leu Leu Gin Arg 

245 250 255 

Pro Glu Val Leu Leu Leu Asp Glu Pro Thr Asn His Leu Asp He Glu 

260 265 270 

S~r He Gly Trp Leu Glu Gin Phe He Ala Thr Asn Ala Gly Ala Val 

275 280 285 

He Leu Val Ser His Asp Arg Ala Phe He Asp Asn Thr The Thr Arg 

290 295 300 

Thr Tic Glu He Glu Leu Gly His He Tyr Asp Tyr Lys Thr Asn Tyr 
30S 310 315 320 

Ser H15 Tyr Val Glu Lau Arg Glu Glu Arg Leu Arg Gin Gin Hot Arg 

32 & 330 335 

Ala Tyr Glu Asn Cln Gin Lys I let He Arg Asp Thr Glu Asp Phe He 

340 345 
Glu Arg Phe Arg Tyr Lys Ala Thr Lys Scr Val Gin Val Gin Ser Arg 

355 360 365 

He Lys Gin Leu Glu Lys Val Glu Arg Val Glu He Asp Glu Arg Asp 

37 0 375 380 

Arg Ser Ala Phe His Phe Arg Phe lie Pro Ala Gin Pro Ser Gly S«r 
385 390 395 400 

Tyr Pro Leu He Val Asp Asp Leu Ala Lys Ala Tyr Gly Asp His Gin 

405 410 41b 

Vai Phe Ser Gly Ala Thr Tyr Thr He Glu Arg Gly Glu Lys Val Ala 

420 425 430 

Phe Val Gly Lys Asn Gly Ala Gly Lys Ser Thr Met Val Lys Cys He 

435 440 445 

Met Gly Glu Leu Thr Asp Tyr Thr Gly Lys Leu Glu Leu Gly His Asn 

450 455 460 

Val Gin Leu Gly Tyr Phe Ala Gin Asn Glu Ala Gin Glu Leu Arg Gly 
465 470 47b 480 

Asp Leu Thr Val Phe Asp Thr He Asp Arg Glu Ala Val Gly Asp He 

485 490 495 

Arg I.eu Arg Leu Asn Asp Leu Leu Gly Ala Phe Leu Phe Gly Gly Glu 

500 505 510 

Ala Ser Glu Lys Lys Val Ser Val Leu Ser Gly Gly Glu Arg Ala Aro 

515 520 525 

Leu Ala He He Arg Leu Leu Leu Gin Pro Ala Asn Pho Leu He Leu 

530 535 540 

Asp Glu Pro Thr Asn His Leu Asp Met Arg Ser Lys Asp Val Leu Lys 
545 550 555 560 

Glu Ala He Lys Asn Phe Asp Gly Thr Val He Val Val Ser His Asp 

565 570 575 

Arg Glu Phe Leu Asp G] y Leu Val Ser Lys Val Tyr Glu Phe Ala Asp 

580 sas 590 

Gly Gin Val Asn Glu His Leu Gly Gly He Tyr Asp Tyr Leu Arg Thr 

595 eoo 605 

Arg Arg Met Gin Thr Leu Thr Glu Leu Glu Arg Thr Thr Thr He Glu 

6 10 615 620 

Thr Lys Thr Thr Arg Glu Ala He Pro Glu Thr Glu Ala Lys Ala Asp 
625 630 6 35 640 

Tyr Arg Arg Gin Lys Glu Val Ala Lys Gin Leu Arg Thr Leu Glu Aro 

^ , 645 650 6S5 

Thr Val Ala Thr Cys Glu Glu Arg He Gly Lys Leu Glu Ser Glu Leu 

660 665 670 

Gin Ala He Glu Met Leu Leu Gin Asp Pro Lys His Ala Thr Asp Ala 

675 680 685 

Asn Leu Phe Glu Arg Tyr Ala Gly llet Lys Gin Glu Leu Glu Lys Ala 

690 695 700 

Mot Glu Asp Trp Glu Gin Ala Ser Glu Ala Leu Ser Glu Ala Gin G]y 
705 ^lo 715 720 

12) INFORMATION FOR SEQ ID IIO:27 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 386 amino acids 

(B) TYPE; amino acid 
{D) TOPOLOGY: linear 
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tii] MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

5 (Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

<ix> FEATURE: 

(A) NAME/KEY: misc_featUK 
10 <B> LOCATION 1...3B6 

(xJ ) SEQUENCE DESCRIPTION: SEQ ID NO:279 

Tyr Leu Lys Leu Leu lie Leu Gin lie Ala Leu HeL Asn Phe Leu Lvs 
15 1 5 XO 15 

Lys Glu Pro Phe Lys lie Phe Ser Met lie Tyr Leu Leu Leu Asp Thr 

20 25 30 

He Thr Asn Arg Ala Gly Thr Glu Arg Ala Val lie Asn Leu Ala Asn 

35 40 45 

Asn Leu His Ala Asn Gly His Arg Val Ser L=u Val Ser Val Cys Thr 

SO 55 60 

Lys Glu Gly Glu Pro Ser Phe Gin Val Glu Lys Gly lie Glu Val His 
65 70 75 80 

His Leu Gly He Arg Leu Tyr Gly Asn Ala Lsu Ala Arg Lys Thr Val 
25 es 90 95 

Tyr Phe Lys Ala Tyr Arg Arg He Lys Ala Leu Tyr Lys Lys Arg Glu 

100 10S no 

Pro Val Leu Leu He Gly Thr Asn He Phe He Asn Thr He Leu Ser 

115 120 175 

Gin He Ser Asn Arg Gly Arg He Phe Thr He Gly Cys Glu His II- 

130 135 140 

Ser Tyr Asp He Ala Arg Pro He Thr Lys Arg He Arg Gly Phe Leu 
145 150 155 160 

Tyr Ser Gly Leu Aep Ala Val Val Ala Leu Thr Lys Arg Asp Gin Gin 
ob 165 170 175 

Ser Phe Glu Ala He Leu Arg Gly Arg Ser Lys Ala Tyr Val He Pro 

ISO 185 190 

Asn Gin Val Ser Phe Thr Thr Val Gin Arg Asp Ala Thr Thr His Lys 
195 200 205 

W Gin Net. Leu Ala He Gly Arg Leu Thr Tyr Gin Lys Gly Phe Glu Phe 
210 215 220 

Met. He Glu Asp Ala Ser Arg Val Leu Arg Glu Arg Pro Asp Trp Lys 
225 230 235 240 

Leu Ho He Val Gly Asp Gly Glu Asn Glu Ser Met Leu Arg Lys Glu 
45 245 250 255 

Tie Ala Ser Arg Asn Met Glu Ser Gin He Glu He His Pro Ser Thr 

260 265 270 

Pro Glu Ho Arg Lys Tyr Tyr Glu Ser Ser Ala He Tyr Leu Met Thr 

275 280 285 

Ser Arg Phe Glu Gly Leu Pro Met Val Leu Leu Glu Ala Glu Ala Tyr 

290 295 300 

Ala Leu Pro He He Ser Tyr Asp Cys Pro Thr Gly Pro Arg Glu Leu 
305 310 315 320 

He Glu Asn Gly Arg Asn Gly Phe Leu Val Pro Met Glu Ala His Glu 
Dij 325 330 335 

Asp Phe Ala Asp Lys Leu Arg Leu Leu Met Asp Asp Glu Thr Leu Arg 
"\~ 340 345 350 - 

Lys Lys Met Gly Gin Glu Ser Glu Leu Met Val Lys Ser Tyr Ser Pro 

35b 360 365 

Ala Asn He Tyr Glu Cys Trp Lys Lys Leu Phe Val Glu He Gly Tyr 

370 375 360 

Met Asn 
3BS 

65 (2) Hi FORMAT I OH FOR SEQ ID MO: 280 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 Amino acids 

(B) TYPE: amino acid 
( D) TOFOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 



50 



60 



70 



75 
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10 



20 



(vi» ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivaUs 

(ix) FEATURE: 

(A) NAME/ KEY : miscjeature 
<B) LOCATION 1. . .655 

(xi) SEQUENCE DESCRIPTION: SEQ ID UO:2B0 

Thr Thr Lys Glu Asn Glu Thr Thr Thr Lye Asn Glu Tyr Arg He Me*- 
1 5 10 15 

Gly Lys He He Gly He Asp Lsu Gly Thr Thr Asn Ser Cys Val S-r 

20 25 30 

Val Leu Glu Gly Asn Glu Pro He Val He Thr Asn Ser Glu Gly Lys 
lO 35 4Q 45 

Arg Thr Thr Pro Ser Val Val Ala Phe Val Asp Gly Gly Glu Arg Lys 

50 55 60 

Val Gly Asp Pro Ala Lys Arg Gin Ala He Thr Asn Pro Thr Lys Thr 
65 7 0 7b 80 

He Tyr Sor He I.y- Arg Phe Met Gly Glu Thr Tyr Asp Gin Val Ser 

85 90 95 

Arg Glu Val Glu Arg Val Pro Phe Lys Val Val Arg Gly Asp Asn Asn 

100 X05 no 

Thr Pro Arg Val Asp He Asp Gly Arg Leu Tyr Thr Pro Gin Glu He 
" 115 120 125 

Ser Ala I let He Leu Gin Lys MeL Lys hys Thr Ala Glu Asp Tyr Leu 

130 135 140 

Gly Gin Glu Val Thr Glu Ala Val He Thr Val Pro Ala Tyr Ph- Asn 
14b ISO 155 160 

Asp Ala Gin Arg Gin Ala Thr Lys Glu Ala Gly Glu He Ala Gly Leu 

165 no 175 

Lys Val Arg Arg He Val Asn Glu Pro Thr Ala Ala Ser Leu Ala Tyr 

180 185 190 

Gly Leu Asp L}s Ser Asn Lye Asp M~t Lys He Ala Val Phe A^p Leu 

195 200 205 

Gly Gly Gly Thr Phe Asp He Ser He Leu Glu Leu Gly Asd Gly Val 

210 215 220 

Phe Glu Val Lys Ser Thr Asn Gly Asp Thr His Leu Gly Gly Asp Asp 
225 Z30 235 240 

Fhe Asp Hi a Val He He Asp Trp Leu Ala Glu Glu Phe Lys Ser Gin 

245 250 255 

Glu Gly Val Asp Leu Arg Gin Asp Pro Wet Ala Mot Gin Arg Leu Ly*. 

260 265 270 

Glu Ala Ala Glu Lys Ala Lys He Glu Leu Ser Ser Thr Ser S~r Thr 
™ 275 280 285 

Glu He Asn Leu Pro Tyr He MeL Pro Val Asn Gly He Pro Lys His ■ 

230 29b 300 

Leu Val Hit Thr Leu Thr Arg Ala Lys Phe Glu Gin Leu Ala Asp Arg 
305 310 315 320 

Leu lie Gin Ala Cys Val Ala Pro Cye Glu Thr Ala Leu Lys Asp Ala 

325 330 335 

Gly Met Ser Arg Gly Asp He Asp Glu Val He Leu Val Gly Gly Ser 

340 345 350 

Thr Arg He Pro Ala He Gin Glu Ha Val Glu Lys He Phe Gly Lys 

355 360 365 

Ala Pro Ser Lys Gly Val Asn Pro Asp Glu Val Val Ala Val Gl" Ala 

3 7 0 375 380 

Ala He Gin Gly Gly Val Leu Thr Gly Glu Val Lys Asp Val Leu Leu 
38b 3 90 305 400 

Leu Asp Val Thr Pro Leu Ser Leu Gly He Glu Thr Met Gly Gly Val 

405 410 415 

Met Thr Arg Lou He Asp Ala Asn Thr Thr He Pro Thr Lys Lys Ser 

420 425 430 

Glu He Phe Thr Thr Ala Val Asp Asn Gin Pro S=r Val Glu He His 
DO 435 4 4 o 445 

Val Lsu G3n Gly Glu Arg Ser Leu Ala Lys Asp Asn Lys Ser He Gly 

450 455 460 

Arg Phe Asn Leu Asp Gly He Ala Pro Ala Pro Arg Gin Thr Pro Cln 
?f 5 470 475 480 

He Glu Val Thr Phe Asp He Asp Ala Asn Gly He Leu Asn Val Thr 

485 400 495 

Ala His Asp Lys Ala Thr Gly Lys Lys Gin Asn He Arg He Glu Ala 

500 505 sio 

Ssr Ser Gly Leu Ser Asp Asp Glu He Lys Arg Met Lys Glu Glu Ala 
515 520 525 



30 



35 



40 



50 



55 



60 



70 



75 
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35 



40 



Gin Ala Asn Ala Glu Ala Asp Lys Lys Glu Lys Glu Arg He Asp Lys 

530 535 540 

He Asn Gin Ala Asp Ser Met He Phe Gin Thr Glu Lys Gin Leu Lys 
54 5 550 555 560 

Glu Leu Gly Asp Ly* Phe Pro Ala Asp Lys Lys Ala Pro He Asp Thr 



Ala Leu Asp Lys Leu Lys Glu Ala His Lys Ala Gin Asp Val Ala A3 a 

SGO 585 590 

He Arp Thr Ala I let Ala Glu Leu Gin Thr Ala Leu Ser Ala Ala Gly 
10 595 600 605 

Glu Glu Leu Tyr Lys Asn Ala Gly Ala Ala Gin Gly Gly Ala Gin Pro 

610 615 620 

Gly Pro A3p Phe Gly Gly Ala Gin Gly Pro Ser Ala Gly Asp Gin Pro 
625 630 635 640 

15 Ser Asp Asp Lys Asn Val Thr Asp Val Asp Phe Glu Glu Val Lys 
645 650 655 

(2) IN FORMAT T OH FOR SEQ ID NO: 281 

20 <i| SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 4 67 amino acids 
(B> TYPE: amino acid 
ID) TOPOLOGY: linear 

25 (lij MOLECULE" TYPE: protein 

(iii) HYPOTHETICAL: YES 

(Vi] ORIGINAL SOURCE: 
30 (A) ORGAN TS; I: Porphyromonas gingivalis 

(Ix) FEATURE : 

(A) NAME/ KEY: miscjeature 
{ B ) LOCATION 1 ... 4 67 



txi) SEQUENCE DESCRIPTION: SEQ ID NO! 281 



Lys 


Trp 


Ala 


Arg 


Thr 


Thr 


Thr 


Leu 


Arg 


He 


Ser 


Asn 


Ala 


Thr 


He 


Tyr 


1 








5 










10 










15 


Lys 


Thr 


Met 


Arg 


Tyr 


Asp 


Leu 


Ala 


He 


He 


Gly Gly Gly 


Pro 


Ala 


Gly 








20 










25 










30 




Tyr 


Thr 


Ala 


Ala 


Glu 


Arg 


Ala 


Ala 


Lys 


Gly Gly Leu 


Lys 


Thr 


Leu 


Leu 






35 










40 










45 








He 


Glu 


Lys 


Asn 


Ala 


Leu 


Gly 


Gly Val 


Cys 


Lou 


Asn 


Glu 


Gly 


Cys 


He 




50 










5b 










60 








Pro 


Thr 


Lys 


Thr 


Leu 


Leu 


Tyr 


Ser 


Ala 


Lys 


Val 


Leu 


His 


Gin 


He 


Ala 


65 










70 








75 










80 


The 


Ala 


Ser 


Lys 


Tyr 


Ala 


Val 


Ser 


Gly Thr Ala 


Af=p Gly 


Leu 


Asp 


Leu 










85 










90 










95 




Gly 


Lys 


Val 


He 


Ala 


Arg 


Lys 


Gly Lys 


He 


He 


Arg 


Lys 


Leu 


Thr 


Ala 








100 










105 








110 






Gly 


He 


Arg 


Ser 


Arg 


Leu 


Thr 


Glu 


Ala 


Gly Val 


Glu 


Met 


Val 


Thr 


Ala 


Glu 




115 










120 










125 








Ala 


Thr 


Val 


Thr 


Gly 


Cys 


Asp Ala Asp Gly He 


He Gly He 


Thr 




130 










135 










140 










Ala 


Gly 


Glu 


Ala 


Gin 


Tyr 


Lys 


Ala 


Ala 


Asn 


Leu 


Leu 


Leu 


Cys 


Thr 


Gly 


145 










150 










155 








160 


Ser 


Glu 


Thr 


Phe 


He 


Pro 


Pro 


He 


Pro 


Gly Val 


Glu 


Gin 


Thr 


Glu 


Tyr 










165 










170 










17 5 




Tip 


Thr 


Aen 


Arg 
180 


Glu 


Ala 


Leu 


Gin 


Asn 
185 


Lys 


Glu 


He 


Pro 


Thr 
190 


Ser 


Leu 


Val 


He 


He 


Gly 


Gly 


Gly 


Val 


He 


Gly Met 


Glu 


Phe 


Ala 


Sec 


Phe 


Phe 






195 










200 










205 








Asn 


Gly 


He 


Gly 


Thr 


Gin 


Val 


His 


Val 


Val 


Glu 


Met 


Leu 


Pro 


Glu 


He 




210 










215 










220 








Leu 


Asn 


Gly 


He 


Asp 


Pro 


Glu 


His 


Ala 


Ala 


Met 


Leu 


Arg 


Ala 


His 


Tyr 


275 










230 










235 








240 


Glu 


Ly E 


Glu 


Gly 


He 


Lys 


Phe 


Tyr 


Leu 


Gly 


His 


Lys 


Val 


Thr 


Ser 


Val 










245 










250 








255 




Arg 


Asn 


Gly 


Ala 


Val 


Thr 


Val 


Glu 


Tyr 


Glu 


Gly Glu 


Ser 


Lys 


Glu 


11 s 








260 










265 










270 






Glu 


Gly 


Glu 


Arg 


He 


Leu 


Met 


Ser 


Val 


Gly Arg 


Arg 


Pro 


Val 


Leu 


Gin 






27 5 










280 










285 








Gly 


Phe 


Glu 


Ser 


Leu 


Gly 


Leu 


Val 


Leu 


Ala 


Gly 


Lys 


Gly Val 


Lys 


Thr 




290 










295 










300 









565 570 575 " . &-\fj\ V 
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Asn Glu Arg Met Gin Thr Ser Leu Pro Asn Val Tyr Ala Ala Gly Asp 
*? 5 3 10 315 320 

He Thr Gly Phe Ser Leu Leu Ala His Thr Ala Val Arg Glu Ala Glu 

325 330 335 

Val Ala Val Asp Gin He Leu Gly Lys Thr Asp Glu Thr Met Ser Tyr 

340 345 350 

Arg Ala Val Pro Gly Val Val Tyr The Asn Pro Glu Val Ala Gly Val 

355 360 365 

Gly Glu Thr Glu Glu Ser Leu Arg Lye Ala Gly Arg Ala Tyr Thr Val 

370 37b, 3B0 

Arg Arg Leu Pro Met Ala Phe Ser Gly Arg Phe Val Ala Glu Asn Glu 
385 390 395 4 oo 

Gin Gly Asn Gly Glu Cys Lys Leu Leu Leu Asp Glu Glu Asn Arg Leu 

405 410 415 

He Gly Ala His Leu He Gly Asn Pro Ala Gly Glu Leu lie Val Thr 

420 4 2 5 430 

Ala Ala Met Ala He Glu Thr Gly Met Thr Asp Arg Gin He Glu Arg 

435 4 4o 445 

He He Ph~ Pro His Pro Thr Val Gly Glu He Leu Lys Glu Thr Leu 
455 4 6 o 

Ala Gly Gly 
465 

(2) INFORMATION FOR SEQ ID HO: 282 

(1) SEQUENCE CHARACTERISTICS; 

<A> LENGTH: 945 amino acids ' 
<BJ TYPE: amino acid 
(D) TOPOLOGY; linear 

(11) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY: miscjeature 

(B) LOCATION 1...945 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO:282 

Pro Lye He Leu Met Glu Leu Lys Arg Phe Leu Ser Leu Gly Leu Leu 

r 5 10 15 

Leu Val Gly Phe He Pro Met Lys Leu Ser Ala CI n Gin Ala Gin Pro 

20 25 30 

Leu Pro Thr Asp Pro Ala Val Arg Val Gly Lys Leu Asp Asn Gly Leu 

35 4 0 4S 

Thr Tyr Phe He Arg His Asn Glu Asn Pro Lys Asp Arg Ala Asp Ph» 

50 55 60 

Phe He Ala Gin Lys Val Gly Ser He Leu Glu Glu Asp Ser Gin Ser 



65 7Q 



75 00 



Gly Leu Ala His Phe Leu Glu His Met Ala Phe Asn Gly Thr Lys Asn 

85 9 o Q 5 

Phe Pro Gly Lys Asn Leu He Asn Tyr Leu Glu Thr He Gly Val Arg 

100 105 no 

Phe Gly Gin Asn Leu Asn Ala Ser Thr Gly Phe Asp Lys Thr Glu Tyr 
^ _ 115 120 125 

Thr He Met Asp Val Pro Thr Thr Arg Gin Gly He He Asp Ser Cys 

13 <> 135 140 

Leu Leu He Leu His Asp Trp Ser Asn Asn He Thr Leu Asp Gly His 
i? , 150 155 160 

Glu He Asp Glu Glu Arg Gly Val He Gin Glu Glu Trp Arg Ala Arg 

165 170 i7S 

Arg Asp Ala Asn Leu Arg Met Phe Glu Ala He Leu Ala Lys Ala M-t 

loo 185 190 

Pro Gly Asn Lys Tyr Ala Glu Arg Met Pro He Gly Leu Met Asp Val 

195 200 205 

Val Leu Asn Phe Lye His Asp Glu Leu Arg Asn Tyr Tyr Lys Lys Trp 

21° 215 220 

Tyr Arg Pro Asp Leu Gin Gly Leu Val He Val Gly Asp He Asp Val 
230 235 240 

Asp Tyr Val Glu Asn Lys He Lys Glu Leu Phe Lya Asp Val Pro Ala 
245 250 255 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Pro Val Asn Pro Ala Glu Arg Tie Tyr Thr Pro Val Clu Asp Asn Asp 

260 265 270 

Glu Pro Tit Val Ala lie Ala Thr Asp Ala Glu Ala Thr Thr Thr Gin 

275 280 285 

Leu Ser lie Ser Phe Lys Ser Asp Pro Thr Pro Gin Glu Val Arg Gly 

290 295 30O 

Ser He Phe GJy Leu Val Glu Asp Tyr Met Lys Gin Val He Thr Thr 
305 310 315 320 

Ala Val Asn Glu Arg Leu Ser Glu He Thr His Lys Pro Asn Ala Pro 

325 330 335 

Phe Leu Ser Ala Gly Ala Phe Phe Ser Asn Phe Met Tyr He Thr Gin 

340 345 350 

Thr Lys Asp Ala Phe Asn Phe Val Ala Thr Val Arg Glu Gly Clu Ala 

355 360 365 

Glu Lys Ala Het Asn Ala Leu Val Ala Glu He Glu Ser Leu Arg Gin 

370 375 360 

Phe Gly He Thi Lys Gly Glu Tyr Asp Arg Ala Arg Thr Asn Val L-u 
385 390 395 400 

Lys Arg Tyr Glu Asn Gin Tyr Asn Glu Arg Asp Lys Arg Lys Asn Asn 

405 410 4i5 

Ala Tyr Ala Asn Glu Tyr Ser Thr Tyr Phe Thr Asp Gly Gly Tyr lie 

420 425 430 

Pro Gly He Glu Val Glu Tyr Gin Thr Val Asn Ala Phe Ala Pro Gin 

435 440 445 

Val Pro Leu Glu Ala Phe Asn Gin Ala He Ala Gin Met He Asp Pro 

450 455 460 

Val T.ys Asn Ala Val Val Thr Leu Thr Gly Pro Ser Lys Ala Glu Ala 
465 470 475 4 80 

Lys He Pro Ser Glu Ala Asp Phe Leu Ala Ala Phe Lys Ala Ala Arg 

4B5 490 495 

Gin Gin Lys Val Glu Ala Lys Lys Asp Glu Val Ser Asp Gin Lys L»u 

500 505 510 

Met Glu Lys Ala Pro Lys Ala Gly Lys He Val Ser Glu Lys Lys Asp 

515 520 525 

Gin Lys Phe Gly Thr Thr Glu Leu Thr Leu Ser Asn Gly He Lys Val 

530 535 5.JO 

Tyr Leu Lys Lys Thr Asp Phe Lys Ser Asn Glu He Leu Met Ser Ala 
545 SS0 555 560 

Leu Scr Pro Gly Gly He Leu Ser Gly Lys His Ala Pro Asn Gin Ser 

565 570 575 

Val Met Asn Ser Phe Met Asn Val Gly Gly Leu G1 y Asn Phe Asp Ala 

580 585 590 

He Gin I.eu Asp Lys Val Leu Thr Gly Arg Ser Ala Ser Val Ser Pro 

595 600 605 

Ser Leu Ser Leu Leu Ser Glu Gly Leu Ser Gly Lys Thr Thr Val Glu 

610 615 620 

Asp Met Glu Thr Phe Phe Gin Leu He Tyr Leu Gin Met Thr Ala Asn 
625 630 635 640 

Arg Lys Aep Pro Glu Ala Phe Lys Ala Thr Gin Glu Lys Leu Tyr Asn 

645 650 655 

Asn Leu Lys Asn Gin Glu Ala Asn Pro Met Ala Ala Leu Met Asp Ser 

660 665 670 

He Arg His Thr Met Tyr Gly Asp Asn Pro Met Met Lys Pro Het Lys 

675 680 60S 

Ala Ala Asp Val Glu Lys Val Asn Tyr Asp Gin Val Met Ala Ph- Tvr 

690 695 700 

Asn Glu Arg Phe Ala Asp Ala Gly Asp Phe Met Phe Phe Phe He Glv 
705 710 715 720 

Asn Leu Asp Glu Ala Lys Met Lys Pro Leu He Glu Thr Tyr Leu Ala 
e , 725 730 735 

Ser Leu Pro Asn Leu Lys Arg Gly Asp Lys Met Asn Lys Ala Gin Val 

7 40 745 750 

Pro Ala Ala Arg Ser Gly Lys He Asp Cys Lys Phe Glu Lys Glu Met 

755 760 765 

Asp Thr Pro Ser Thr Thr He Phe Asp Val Val Ser Gly Asn Val Glu 

770 775 780 

Tyr Thr Leu Lys Asn Ser Leu Leu Leu Glu Val Phe Ser Ala Val Met 
785 i 790 795 800 

Asp Gin Val Tyr Thr Ala Thr Val Arg Glu Lys Glu Gly Gly Ala Tyr 

805 810 ei5 

Ser Val Ala Ala Phe Gly Gly Leu Glu Gin Tyr Pro Gin Pro Lys Ala 

B20 825 830 

Leu Met Gin He Tyr Phe Pro Thr Asp Pro Ala Arg Ala Glu Glu Met 

835 840 845 

Asn Ala He Val Phe Ala Glu Leu Glu Lys Leu Ala Lys Glu Gl- Pro 
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850 










B55 










860 










Asn 


Val 


Glu 


Tyr 


Phe 


Lys 


L>s 


Thr 


He 


Glu 


Asn 


Leu 


Asn 


Ly3 


Gin 


His 


865 










870 










875 








880 


Lys 


Glu 


Ser 


Leu 


Arg 


Glu 


Asn 


Arg 


Ph- 


Trp 


Leu 


Glu 


Ala 


Met 


Lys 


Ala 










88b 










890 










895 




Ser 


Phe 


Phe 


Glu 


Gly Asn 


Asp 


Phe 


Ile 


Thr 


Asp 


Tyr 


Glu 


.Ser 


Val 


Leu 








900 










905 






910 






Asn 


Gly 


Leu 


Thr 


Pro 


Ala 


Glu 


Leu 


Gin 


Lys 


Phe 


Ala 


Ala 


Asp Leu 


Leu 






915 










920 










92 b 








Lys 


Gin 


Gin 


Asn 


Arg 


Val 


Val 


Val 


Mot 


Ilet 


Ala 


Pro 


Val 


Ala 


Lys 


Ala 




930 










935 










940 









Gin 



(2) INFORMATION FOR SEQ ID NO: 283 

< 1 > SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 ami no acids 
(B» TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

tvi) ORIGINAL SOURCE: 

(AJ ORGANISM: Porphyromonae gingivalis 

<ix) FEATURE: 

(A) HAftE/KEY: misc_f eature 

(B) LOCATION 1...68 6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283 



Tyr 


Thr 


Men 


Ser 


Lys 


Lys 


Gly 


Thr 


Ho 


Gly Val 


Thr 


Ser Asp Asn 


He 


1 








5 










10 








15 




Phe 


Pro 


Val 


11 = 


Lys 


Lys 


Phe 


Leu 


Tyr 


Ser 


Asp 


His 


Glu 


He Phe 


Leu 








20 










25 








30 




Arg 


Glu 


He 


Val 


Ser 


Asn 


Ala 


Val 


Asp 


Ala 


Thr 


Gin 


Lys 


Leu Lys 


Thr 






35 










40 










45 




Leu 


Thr 


Ser 


Val 


Gly 


Glu 


Phe 


Lys Gly Glu 


Thr 


Gly Asp Leu Arg 


Val 




50 










55 










60 








Thr 


Val 


Ser 


Val 


Asp 


Glu 


Val 


Ala 


Arg 


Thr 


He 


Thr 


Val 


Ser Asp 


Arg 


65 










70 










75 






80 


Gly Val 


Gly 


Met 


Thr 


Glu 


GlU 


Glu 


Val 


Glu 


Lys 


Tyr 


He 


Arn Gin 


He 










B5 










90 








95 




Ala 


Phe 


Ser 


Ser 


Ala 


Glu 


GlU 


Phe 


Leu 


GlU 


Lys 


Tyr 


Lys 


Asp Asp 


Lys 








100 










105 








110 




Ala 


Ala 


lis 


He 


Gly 


His 


Phe 


Gly 


Leu 


Gly 


Phe 


Tyr 


Ser 


Ala Phe 


Met 


Val 




115 










120 








125 






Ser 


Glu 


Arg 


Val 


Asp Vol 


He 


Thr 


Arg 


Ser 


Phe Arg Glu Asp Ala 




130 










135 










140 








Thr 


Ala 


Val 


Lys 


Trp 


Ser 


Cys 


Asp 


Gly Ser 


Pro 


Glu 


Tyr 


Thr Leu 


Glu 


145 










150 










155 






160 


Pro 


Ala Asp 


Lys 


Ala 


Asp 


Arg 


Gly Thr 


Asp 


He 


Val 


Met 


His He 


Asp 










165 










170 








175 


Glu 


Glu 


Asn 


Ser 
180 


Glu 


Phe 


Leu 


Lys 


Lys 
185 


Glu 


Lys 


He 


Clu 


Gly Leu 
190 


Leu 


Gly Lys Tyr 


Cys 


Lys 


Phe 


Leu 


Thr 


Val 


Pro 


He 


He 


Phe 


Gly Lys 


Lys 






195 










200 










20b 




Gin 


Glu 


Trp 


Lys 


Asp 


Gly 


Lys 


Met 


Gin 


Asp 


Thr 


Asp 


Glu 


Asp Asn 


Gin 




210 










215 








220 






He 


Asn 


Afcp 


Thr 


His 


Pro 


Ala 


Trp 


Thr 


Lys 


Lys 


Pro 


Ala 


Asp Leu 


Lj s 


225 










230 










235 






240 


Asp Glu Asp 


Tyr Lys 


Glu 


Phe 


Tyr Arg 


Ser 


Leu 


Tyr 


Pro 


Met Ser 


Glu 










245 










250 








255 




Glu 


Pro 


Leu 


Phe 


Trp 


He 


His 


Leu 


Asn 


Val 


Asp 


Tyr 


Pro 


Phe Asn 


Leu 








260 










265 








270 




Thr 


dy 


He 


Leu 


Tyr 


Phe 


Pro 


Lys 


He 


Lys 


Asn 


Asn 


Leu 


Asp Leu 


Gin 






275 










280 










285 




Arg 


Asn 


Lys 


He 


Gin 


Leu 


Tyr 


Cys 


Asn 


Gin 


Val 


Tyr 


Val 


Thr Asp 


Glu 




290 










295 










300 




Val 


Gin 


Cl- 


He 


Val 


Pro 


Asp 


Phe 


Leu 


Thr 


Leu 


Leu 


His 


Gly Val 


He 


305 










310 










315 






320 


Asp 


Ser 


Pro 


Asp 


He 


Pro 


Leu 


Asn 


Val 


Ser 


Arg 


Ser 


Tyr 


Leu Gxn 


Ser 
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10 



15 



20 



25 



30 



35 



40 



45 



55 



60 



65 



70 



75 











325 










330 










335 




Asp 


Ala 


Asn 


Val 
340 


Lys 


Lys 


He 


Ser 


Ser 
345 


His 


He 


Thr 


Lys 


Lys 
350 


Val 


Ala 


Asp 


Arg 


L-u 


Glu 


Glu 


He 


Phe 


Lys 




Asp Arg 


Pro 


Thr 


Phe 


Glu 


Glu 






355 










360 










365 








Lyr 


Trp 


Asp 


Ser 


Leu 


Lys 


Leu 


Phe 


Val 


Glu 


Tyr 


Gly Met 


Leu 


Thr 


Asp 




370 










375 










380 










Glu 


Lys 


Phe 


Tyr 


Glu 


Arg 


Ala 


Ala 


Lys 


Phe 


Phe 


Leu 


Phe 


Thr Asp Met 


3RS 










390 








39S 










400 


Asp 


Gly 


Hie 


Lys 


Tyr 
405 


Thr 


Phe 


Asp 


Glu 


Tyr 
410 


Arg 


Thr 


Leu 


Val 


Glu 
415 


Gly 


Val 


Gin 


Thr 


Asp 


Lys 


P 


i»j.y 


Gin 


Val 


VqI 


Tyr 


Leu 


Tyr Ala 


Thr Aap 








420 










425 










430 






Lys 


His 


Gly 


Gin 


Tyr 


Ser 


His 


Val 


Lys 


Arg 


Ala 


Ser Asp 


Lys 


Gly Tyr 






4 35 










44 0 










445 








Ser 


Val 


Met 


Leu 


Lau 




Gly 


Gin 


_ 


Asp 


Pro 


His 


He 


Val 


Ser 


Leu 




450 










455 








4 60 










Lou 


Glu 


Gin 


L 


Lsu 


Glu 


L>s 


Thr 


His 


Phe 


Val 


Arg 


Val 


Asp 


Ser 


Asp 


465 










470 










475 










480 


Thr 


lie 


Asn 


Asr 


Leu 


I1g 




Vs 


Glu 


Glu 


Arg Ala 


Glu 


Val 


Lys 


Leu 










4 85 










490 










4 95 




Ser 


Asp 


Thr 


Glu 


Arg 


Ala 


Thr 


Leu 


Val 


Lys 


Leu 


Phe 


Glu 


Ala 


Arg 


Leu 








500 










505 










510 




Pro 


Arg 


Asp 


Glu 


Lys 


Lys 


His 


Phe 


Asn 


Val 


Ala 


Fht 


Glu 


Ser 


Leu 


Gly 






51 S 








520 










525 






Ala 


Glu 


Gly 


Glu 


Ala 


He 


Leu 


He 


Thr 


Gin 


Ala 


Glu 


Phe 


Met 


Arg Arg 




530 










535 










540 










Met 


Arg 


Asp 


I let 


Ala 


Gin 


Leu 


Gin 


Pro 


Gly Het 


Ser 


Phe 


Tyr 


Gly 


Glu 


545 










55 0 










555 










560 


Leu 


Pro 


Asp 


Ser 


Tyr 

565 


Asn 


Leu 


Val 


Leu 


Asn 
57 Q 


Thr 


Asp 


His 


Pro 


Leu 
S7 5 


lie 


Asp 


Arg 


Val 


Leu 

580 


Ser 


Gly 


Glu 


Lys 


Glu 
565 


Ser 


Val 


Glu 


Pro 


Ser 
590 


Leu 


Thr 


Glu 


Leu 


Arg 


Ala 


Lys 


Tie 


Ala 


Glu 


Leu 


Lys 


Ala 


Glu 


Glu 


Ala 


Lys 


Leu 






595 










600 










605 






Leu 


Asp 
610 


Glu 


Glu 


Lys 


Gly 


Lys 
615 


Lys 


Pro 


Glu 


Glu 


He 
620 


Pro 


Val 


Ala 


Thr 


Lye 


Glu 


Ala 


T.ys 


Glu 


Asn 


Asn 


Ala 


Val 


Glu 


Gin 


Ala 


Lys 


Thr 


Glu 


Gly 


625 










630 










635 










64 0 


Ser 


He 


Asn 


Asp 


Gin 


Leu 


Thr 


Lys 


Tyr 


Ala 


Gin Asp Asn 


Glu 


Leu 


He 










64 5 










650 










655 




Gly 


Gin 


Leu 


lie 


Asp 


Leu 


Ala 


Leu 


Leu 


Gly Ser 


Gly Leu 


Leu 


Thr 


Gly 








660 










665 










670 




Glu 


Ala 


Leu 
675 


Ala 


Glu 


Phe 


He 


Arg 
660 


Arg 


Ser 


Gin 


Arg 


Leu 
685 


Leu 







(2) INFORMATION FOR SEQ ID MO: 284 



(l) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 4 82 amino acids 

(BJ TYPE: amino acid 
<D) TOPOLOGY: linear 



(il> MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(ix> FEATURE: 

(A) NAME/ KEY : miscfeature 
(6 ) LOCATION 1...4 82 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 284 

Asp Tie Arg Gin Lys Arg Pro Cys Phe Asn Ala Asn Leu Tyr Phe Tyr 

15 10 15 

Arg Ala Met Glu Lys Leu He Asp He Leu Val Val Asp Asp Asp Val 

20 25 30 

Ala Val Cys Ala Ala Leu Arg Leu Val Leu Lys Arg Ala Gly Tyr Asn 

35 40 45 

Pro Val He Ala Asn Ser Pro Asp Glu Ala Leu Ser He Met Arg Asn 

50 55 60 

Pro Asp Gly Gly Cys Lys Pro Ala Val He Leu Het Asp Het Asn Phe 
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65 7 0 75 SO 

Ser Leu Scr Thr Ser Gly Arq Glu Gly Leu Glu Leu Leu Glu Lys Met 

65 90 95 

Gin lie Phe Thr Sex Cys Pro Val He Leu Met Thr Ala Trp Ala Ser 

100 lob no 

Ho Pro Lou Ala Val Glu Gly Met Arg Leu Gly Ala Phe Asp Phe He 

115 120 125 

Gly Lys Pro Trp Asp Asn Asp Arg Leu Leu Arg Thr He Asp Thr Ala 

130 135 140 

Lsu His Leu Ala Ala Pro Ser Ala Val Ala Asn Pro Ser Glu Gin Ser 
145 150 155 160 

Asp Arg Asp Thr Ala Arg Gin Pro Lys Ala Thr Val Gin Glu Asn Asp 

165 170 175 

Pro Cys Ala His II- He Gly Arg Ser Asp Ala He Cys Lys He Lys 

180 185 iso 

Glu Arg He Arg Arg He Ala Pro Thr His Ala Ser Val Leu II- Thr 

195 200 205 

Gly Glu Ser Gly Thr Gly Lys Glu Leu He Ala Glu Ala Leu His Arg 

210 21S 220 

Gly Ser Lys Arg Ala Ser Ala Pro The Val Lys Val Asn Leu Glv Glv 
225 230 235 240 

lie Pro Glu Ser Leu Phe Glu Ser Glu Leu Phe Gly His Lys l.ya Gly 

?45 250 255 

Ala Phe Thr Asn Ala Phe Ser Asp Arg Lys Gly Arg Phe Glu Leu Ala 

260 265 270 

Asp Gly Gly Thr He Phe Leu Asp Glu He Gly Glu Leu Pro Val Gly 

275 280 28b 

Asn Gin Val* Lys Leu Leu Arg Val L«u Gin Glu Gin Thr Phe Glu Pro 

290 295 300 

Leu Gly Glu Ser Val Ser Ills Arg Val Asp He Arg Val Val Scr Ala 
305 310 315 320 

Thr Asn Ala Ser Leu Glu Arg Met Val Ala Glu Gly Arg Phe Arg Glu 

325 330 335 

Asp Leu Tyr Tyr Arg He Asn Leu He His Leu His Leu Pro Pro Leu 

340 345 350 

Arg Glu Arg Gin Glu Asp He Gin Leu Leu Val Glu Ala Phe Ser Glu 

355 360 365 

Ala phe Ala Gin Ser Asn Gly Leu Pro His Ala Val Trp Ser Ala Glu 

370 375 380 

Ala Met Arg Arg He Cys Ala Met Pro Leu Pro Gly Asn Val Arq Glu 
305 390 395 400 

Leu Lys Asn Val Val Glu Arg Thr Leu Leu Leu Ser Gly Ser Arg Glu 

405 4io 
He Ser Ala Arg Asp Val Ala Asp Phe Gly Ser Gin Val Thr Ala Ala 

420 4?5 430 

Asp His ser Asp Glu Arg Ala Leu Thr Asp Uet Glu Glu Ala Ala He 

435 440 445 

Arg Glu Thr Leu Thr Lys Tyr Asn Gly Asn Val Ser Arg Ala Ala Arq 

450 455 460 

Ala Leu Gly Leu Ser Arg Ala Ala Leu Tyr Arg Arg Met Glu Lys Tyr 

lf S r 470 475 480 

Gly Leu 



(2) INFORMATION FOR SEQ ID NO: 285 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

[A} ORGANISM: Porphyromonas gingival! a 

(ix) FEATURE: 

(A) HA1-1E/ KEY : misc_f oature 
IB) LOCATION 1 . . .263 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285 

\rg Set Leu Gin Ser Phe Gin Asn Lys Lys His Ser Ser Met L 
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15 10 15 

lie Lys Asn Leu His Ala Thr Val Gin Gly Lys Glu He Leu Lys Gly 

20 25 30 

He Asn Leu Glu He Asn Ala Gly Glu He His Ala He Met Gly Pro 
5 35 40 45 

Asn Gly Ser Gly Lys Ser Thr Leu Ser Ser Val Leu Val Gly His Pro 

50 55 60 

Ser Phe Glu Val Thr Glu Gly Glu Val Thr Phe Asn Gly He Asp Leu 
65 70 75 80 

10 Leu Glu Leu Glu Pro Glu Glu Arg Ala His Leu Gly Leu Phe Leu Ser 
85 90 95 

Phe Gin Tyr Pro Val Glu lie Pro Gly Val Ser Met Val Asn Phe Met 

100 105 HO 

Arg Ala Ala Val Asn Glu His Arg Lys Ala He Gly Ala Glu Pro Val 
15 115 120 125 

Ser Ala Sex Asp Phe Leu Lye Met Met Arg Glu Lys Arg Ala He Val 

130 135 140 

Glu Leu Asp Asn Lys Leu Ala Ser Arg Ser Val Asn Glu Gly Phe Ser 
145 150 155 160 

20 Gly Gly Glu Lys Lys Arg Asn Glu He Phe Gin Met Ala Met Leu Glu 
165 no 175 

Pro Lys Leu Ala lie Leu Asp Glu Thr Asp Ser Gly Leu Asp He Asp 

180 185 190 

Ala Leu Arg He Val Ala Gly Gly Val Asn Arg Leu Arg Sor Pro Glu 
25 195 200 205 

Asn Ala A3 a He Val He Thr His Tyr Gin Arg Leu Leu Glu Tyr He 

210 715 220 

Lys Pro Asp Phe Val His Val Leu Tyr Ly? Gly Arg He Val Lys Ser 
225 230 235 240 

Gly Gly Ala Glu Leu Ala Leu Thr Leu Glu Glu Lys Gly Tyr Asp Trp 

245 250 255 

Tie Lys Glu Glu He Gly Glu 
260 

35 (2) INFORMATION FOR SEQ ID NO: 286 



30 



40 



45 



55 



60 



65 



70 



75 



(i> SEQUENCE CHARACTERISTICS; 

<A) LEHGTH: 4 62 amino acids 
(B) TYPE: amino acid 
(DJ TOPOLOGY: linear 

|ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 



(ix) FEATURE; 
50 (A) NAME/ KEY: mi sc_ feature 

<B) LOCATION 1...462 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 286 



Ser 


Met 


Ala 


Lys 


Glu 


Lys 


Thr 


He 


Tyr 


Val 


Cys 


Arg 


Ser Cys Gly Thr 


1 








5 










10 






15 


Lys 


Tyr 


Ala 


Lys 


Trp 


Gin 


Gly 


Asn 


Cys 


Asn 


Ala 


Cys 


Gly Glu Trp Asn 








20 










25 








30 


Cys 


lie 


Asp 


Glu 


Glu 


I.ys 


Val 


Pro 


Ala 


Pro 


Ala 


Ser 


Gly Lys His Ala 






35 










40 










45 


Ala 


Lys 


Ser 


Phe 


Met 


Pro 


Arg 


Glu 


Gin 


Asp 


Asn 


Arg 


Pro Arg Leu Leu 




50 










5S 










60 


Gin 


Asp 


Val 


Glu 


Ser 


Gly 


Asp 


Glu 


Glu 


Arg 


He 


Arg 


Leu Gly Asp Glu 


6b 










70 










75 




80 


Glu 


Phe 


Asp 


Arg 


Val 


Leu 


Gly 


Gly 


Gly 


He 


Val 


Lys 


Gly Ala Phe Val 










85 










90 






95 


Leu 


Leu 


Gly 


Gly 


Glu 


Pro 


Gly 


He 


Gly 


Lys 


Ser 


Thr 


Leu He T.eu Gin 








100 










105 








110 


Thr 


Val 


Leu 


Arg 


Leu 


Pro 


Gin 


Leu 


Arg 


Thr 


Leu 


Tyr 


Val Ser Gly Glu 






115 










120 










125 


Glu 


Ser 


Ala 


Arg 


Gin 


Leu 


Lys 


Met 


Arg 


Ala 


Glu 


Arg 


Leu Gly Gin Ala 




130 










135 










140 


Met 


Aen 


Gly 


Cys 


Tyr 


Val 


Tyr 


Cys 


Glu 


Thr 


Asn 


lie 


Glu Arg He Leu 


145 










150 










155 




160 


Ser 


Arg 


Ala 


Glu 


Glu 


Leu 


Thr 


Pro 


Asp 


Leu 


Leu 


Val 


He Asp Ser He 
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165 170 175 

Gin Thr Val Tyr Thr Glu Lilu Met Glu Ser Sex Ala Gly Ser Val Gly 

180 195 190 

Gin He Arg Glu Cys Ala Ala I.eu Leu Leu Lys Tyr Cys Lys Thr Thr 

195 200 205 

Gly He Pro Val He Val He Gly His He Thr Lys Glu Gly Ser He 

210 215 220 

Ala Gly Pro Lys Val Leu Glu His He Val Asp Thr Val Leu Leu Phe 
225 230 235 240 

Asp Gly Asp Lys His His Leu Tyr Arg He Leu Arg Gly Gin Lys Asn 

245 250 255 

Arg Tyr Gly Ser Thr Ser Glu Leu Gly He Tyr Glu Met Arg Gin Asp 

260 265 270 

Gly I.-sii Arg Gly Val Glu Asn Pro Ser Glu His Leu He Thr Arg Asn 

275 280 285 

Arg Glu Asp Leu Ser Gly Tift Ala He Ala Val Ala Met Glu Gly He 

290 295 300 

Arg Pro He Leu He Glu Aid Gin Ala Leu Val Ser Ser Ala He Tyr 
305 310 315 32Q 

Ala Asn Pro Gin Arg Ser Ala Thr Gly Phe Asp He Arg Arg Met Asn 

325 330 335 

Met Leu Leu Ala Val Leu Glu Lys Arg Ala Gly Phe Lys Leu He Gin 

340 345 350 

Lys Asp Val Phe Leu Asn He Ala Gly Gly He Lys He Ala Asp Pro 

355 360 365 

Ala Thr Asp Leu Ala Val He Ser Ala Val Leu Ala So.r Ser Leu Asp 

370 375 380 

He Val He Pro Pro Ala Val Cys Met Thr Gly Glu Val Gly Leu Ser 
385 390 395 400 

Gly Glu He Arg Pro Val Ser Axg He Glu Gin Arg He Thr Glu Ala 

405 410 415 

Arg Arg He Gly Phe Lys Glu He Leu Val Pro Ala Asp Asn Phe Arg 

420 425 430 

Gin Glu Asp Ala Gly Arg Phe Gly Tie Arg Leu Val Pro Val Arg Lys 

435 440 445 

Val Glu Glu Ala Phe Arg His Leu Phe Ser Lys Gly Arg Glu 
450 455 460 

(2) INFORMATION FOR SEQ ID HO: 287 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 373 amino acids; 
<B] TYPE: amino acid 
(D) TOPOLOGY: linear 

Ml) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival Is 

Ux) FEATURE: 

(A) NAME/ KEY : mlsc_ieature 
{&} LOCATION 1-..373 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 287 ' * 



Gly 


Ser 


Cys Gly 


Ser 


Ser 


Pro 


Ala 


He 


Gly 


Arg 


Ser 


Gly 


Cys 


Ser 


Arg 


1 






5 










10 










15 


Cys 


Asp Arg Arg 


He 


Gly 


His 


Gly 


Leu 


Ser 


Thr 


Ser 


Ala 


Phe Arg Thr 






20 










25 










30 






Tyr 


Ala 


Gly Leu 


Val 


Cy 5 


His 


Ser 


Arg 


Leu 


Arg 


Asp 


Gin 


Ser 


Ser 


Cys 


Arg 




35 








40 










4S 






Val 


Cys Gly 


Met 


Arg 


Pro 


He 


Gly 


Gin 


Leu 


Phe 


Phe 


Pro 


Ser 


Cys 




50 








55 








60 








Pro 


Arg 


Arg Ser 


Ala 


Ser 


Asp 


Ala 


Gly 


Arg 


Thr 


Glu 


Arg 


He 


Gin 


Ser 


65 


Leu 






70 








75 








8G 


Pro 


Gin Asn 


Leu 


Leu 


Gin 


Glu 


Met 


Gin 


Phe 


Ser 


Pro 


Thr 


Asn 


Arg 








85 










90 










95 


Arg Asn 


Pro Phe 


Asp 


Asn 


Met 


Asn 


Ser 


Arg 


His 


Leu 


Thr 


He 


Thr 


He 






100 










105 










110 






He 


Ala 


Gly Leu 


Ser 


Leu 


Phe 


Val 


Leu 


Thr 


Leu 


Gly 


Gly 


Cys 


Ser 


Val 


Ala 




115 








120 










125 








Gin 


Gin Asp 


Thr 


Gin 


Trp 


Thr 


Leu 


Gly 


Gly 


Lys 


Leu 


Phe 


Thr 


Ser 
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50 



60 



65 



70 



75 





130 










135 










14 0 






Ala 


Trp 


He 


Gin 


Arg 


Scr 


Ala 


Glu 


Tyr 


Gin 


Ala 


Leu 


y 9 lit: uxu 


Al a 


145 










150 










1 55 




160 


Tyr 


Asn 


He 


Ala 


Thr 


Glu 


Arg 


Val 


Asp 


Ala 


Leu 


Pro 


Ala Glu Arg 


Lys 










165 










170 






175 


Gin 


Gly 


Asp Arg 


Pro 


Tyr Ala 


He 


Val 


Thr 


Asp 


He 


Asp Glu Thr 


He 








1B0 










185 








190 




Leu 


Asp 


Asn 


Thr 


Pro 


Asn 


Ser 


Val 


Tyr 


Gin 


Ala 


Leu 


Arg Gly Lys 


Asp 






195 










200 










205 


Tyr 


Asp 


Glu 


Glu 


Thr 


Trp Gly 


Lys 


Trp 


Cys 


Ala 


Gin 


Ala Asp Al a 


Asp 




210 










215 










220 


Thr 


Leu 


Ala 


Gly 


Ala 


Leu 


Ser 


Phe 


Phe 


Leu 


His 


Ala 


Ala Asn Lys 


Gly 


225 










230 










235 




240 


He 


Glu 


Val 


Phe Tyr 


Val 


Thr 


Asn 


Arg 


Arg 


Asp 


Asn 


Leu Arg Glu 


Ala 










245 










250 






255 




Thr 


Leu 


Gin 


Asn 
260 


Leu 


Gin 


Arg 


Tyr 


Gly 
265 


Phe 


Pro 


Phe 


Ala Asp Glu 
270 


Glu 


His 


Leu 


Leu 

27 5 


Thr 


Thr 


His 


Gly 


Pro 
280 


Ser 


Asp 


Lys 


Glu 


Pro Arg Arg 
285 


Leu 


J.ys 


11- 


Gin 


Glu 


Gin 


Tyr 


Glu 


He 


Val 


Leu 


Leu 


He 


Gly Asp Asn 


Leu 




290 










295 










300 




Gly 


Asp 


Phe 


His 


His 


Phe 


Phe 


Asn 


Thr 


Lys 


Glu 


Glu 


Ser Gly Arg 


T.y* 


305 










310 










315 




320 


Gin 


Ala 


Leu 


Gly 


Leu 


Thr 


Ala 


Gly 


Glu 


Phe 


Gly 


Arg 


His Phe He 


Met 










325 










330 




33S 




Leu 


Fro 


Asn 


Pro 


Asn 


Tyr Gly 


Ser 


Trp 


Glu 


Pro 


Ala 


Trp Tyr Gly Gly 








340 










34 5 








350 




Lys 


Tyr 


Pro 
355 


Pro 


Leu 


Pro 


Glu 


Arg 
360 


Asp 




Ala 


Leu 


Lys Gin Leu 

365 


His 


Sex 


Gin 
370 


Asn 


Ser 


Arg 





















10 



15 



20 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 268 

35 (i) SEQUENCE CHARACTERISTICS: 

<AJ LENGTH: 426 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOCY: linear 

40 (ii) HOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vif ORIGINAL SOURCE: 
45 (A) ORGANISM: Porphyroraonas gingiva lis 

(irj FEATURE: 

(A) NAME/ KEY: raiecj eature 
{D> LOCATION 1. . .4 26 



SEQUENCE DESCRIPTION: SEQ ID MO: 288 



Arg 


Trp 


Gly 


Phe 


Ser 


Asn 


Phe 


Val 


Ala 


Met 


Ser 


Thr 


Asn 


He 


Asp Val 


1 








5 










10 










15 




Gin 


Gin 


He 


Lys 
20 


Gin 


Arg 


Phe 


Gly 


He 
25 


He 


Gly 


Ser 


Ser 


Pro 
30 


Leu 


Met 


Glu 


His 


Ala 


He 


Arg 


Val 


Ala 


Ala 


Gin 


Val 


Ala 


Pro 


Thr Asp 


Met 


Ser 






35 










40 










45 








Val 


Leu 
50 


Val 


Thr 


Gly 


Glu 


Ser 
55 


Gly 


Ser 


Gly 


Lys 


Glu 
60 


Phe 


Phe 


Pro 


Gin 


He 


He 


His 


Tyr 


Tyr 


Ser 


Ala 


Arg 


Lys 


His 


lli- 


Ser 


Tyr 


He 


Ala 


Val 


65 










70 










75 








80 


Asn 


Cys 


Gly 


Ala 


He 


Pro 


Glu 


Gly 


Thr 


He 


Asp 


Ser 


Glu 


Leu 


Phe 


Gly 










85 










90 










95 


His 


Arg 


Lys 


Gly 


Ser 


Phe 


Thr 


Gly 


Ala 


Val 


Ser 


Asp 


Arg Lys 


Gly 


Tyr 








100 










105 










110 


Phe 


Glu 


GlU 


Ala 


Ser 


Gly 


Gly 


Thr 


He 


Phe 


Leu 


Acp 


Glu 


Val 


Gly Glu 






115 










120 










125 








Leu 


Pro 


Leu 


Pro 


Thr 


Gin 


Ala 


Arg 


Leu 


Leu 


Arg 


Val 


Leu 


Glu 


Thr 


Gly 




130 










135 








140 










Glu 


Phe 


He 


Pro 


Val 


Gly 


Ala 


Ser 


Gin 


Ser 


Gin 


Lys 


Thr 


Asp 


Val 


Arg 


14b 










150 










155 








150 


He 


Val 


Ala 


Ala 


Thr 


Asn 


Val 


Asn 


Leu 


Lys 


Glu 


Ala 


Val 


Ala 


Asn 


Gly 










165 










170 










175 


Lys 


Phe 


Arg 


Glu 


Asp 


Leu 


Phe 


Phe 


Arq 


Leu 


Asn 


Thr 


Val 


Pro 


He 


Glu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



10 



20 



206/ 490 

180 185 190 

Val Pro Ala Leu Arq Met Arg Pro Asp Asp Val Pro Leu Leu Phe Arg 

195 200 205 

Arg Phe Ala Ala Asp Ser Ala Glu Lys Tyr Arg Met Pro Pro Leu Arg 

210 215 220 

Leu Ser Asp Glu Ala Arg Thr lie Leu Met Arg Tyr Arg Trp Pro Glv 
225 230 235 240 

Asn Val Arg Glu Leu Arg Asn He Thr Asp Arg Leu Ser He Leu Glu 

245 250 255 

Glu Glu Arg Thr Val Ser Ala Glu Thr He Thr Arg Tyr Leu Asp Ala 

260 265 270 

Glu Gly Met Gin Asp Leu His Pro Val Val He Arg Arg Ann Glu Thr 

2"*5 280 285 

Thr Glu Ala Asp Lys Gin He Pro His Tyr Glu Arg Glu He He Tyr 
ID 290 295 300 

Gin Val Leu Tyr Asp Met Lys Lys Glu He Ala Asp Leu Lys Glv Mer 
305 310 315 320 

Met Asn Arg Leu Ala His His Glu Gin Pro Ser Trp Pro Val Gly Ser 

325 330 335 

Asp Val Trp Gly Asn Asp Asp Lys Arg Thr Ala Asp Pro Lys Trp Glv 

340 345 - 350 

Val Scr Thr His Lys Ala Pro He Ala Asn Ala Ala Glu Pro Val Glu 

355 360 365 

Pro He Gin Glu Ala Ser Glu Tyr Thr Glu Asp Pro Val Ser Leu Glu 

370 375 380 

Glu Val Glu l,yr? Lys Met He Ser Leu Ala Leu Glu Arg His Glv Glv 
385 390 395 * 4 0 0 

Arg Arg Lys Gin Thr Ala Glu Glu Leu Lys He Scr Glu Arg Thr Leu 

JU Tyr Arg Lys lie Lys Glu Tyr Gly Leu Glu 
420 425 

(2) INFORMATION FOR SEQ ID NO: 289 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 amino acids 

(B) TYPE: amino acid 
tE) TOPOLOGY: linear 

40 <ii) MOLECULE TYPE: protein 

(111) HYPOTHETICAL: YES 

(vij ORIGINAL SOURCE: 
^° [A) ORGANISM: Porphyromonas gingival Is 



50 



(iX) FEATURE: 

[A> NAME/KEY: mi3C_featur- 
£ B ) LOCATION 1 . . .653 

(Xi) SEOUEHCF. DESCRIPTION: SEQ ID HO: 28 9 





Lys 
1 

Glu 


Asn 


Leu 


Pro 


Arg 


Tyr 


Ala 


Pro 


Ala 


Phe 


Gly Arg Asn Arg 


Lys 


Arg 


55 


Gin 


Pro 


Asp 


5 

Pro 


Asp Asn 


Asp Asp 


10 
Ser 


Arg Gly Arg Ser 


15 
His 


Arg 




Gin 






20 










2S 






30 






Ser 


Gly 


Arg Ala 


Val 


Thr 


Val 


Val 


Cys 


Arg Arg He 


Gin 


Thr 


Pro 




Leu 


Pro 
50 


35 










40 






45 








60 


His 


Met 


He 


Leu 


Leu 

55 


Phe 


Gly Gly 


Thr Thr Glu 
60 


Gly Arg Ala 




Ala 
65 


Ala 


Arg 


Val 


Leu 


Asp 


Glu 


Ala 


Gly 


Ser 


Pro Phe Phe 


Tyr 


Ser 


Thr 












70 










75 




80 




Lys 


Gly Asn 


Leu 


Gin 


Glu 


He 


Gin 


Ser 


Ser 


His Gly His 


Arg 


Leu 


Thr 


65 


Gly 


Ala 


Met 


Thr 


85 
Val 


Ala 


Asp 


Met 


Val 


90 
Ser 


Phe Cys Arg 


Lys 


95 
Glu 


Glu 




He 


Arg 




100 










105 






110 








Leu 


He 


Val 


Asp 


Ala 


Ala 


His 


Pro 


Pho Ala Glu 


Glu 


Leu 


His 




Ala 




115 










120 






125 






70 


Ser 
130 


Val 


Ala 


Glu 


Ala 


ser 
135 


Glu 


Gin 


Thr 


Gly He Pro 
140 


Val 


Val 


Arg 




Tyr 
145 


Gl u 


Arq 


Gin 


Tyr 


Pro 
150 


Pro 


Arg 


Glu 


Glu 


Gly He Val 
155 


Trp 


Cys 


Ala 
160 




Asn 


Tyr Asp 


Thr 


Ala 


Ala 


Glu 


Arg 


Het 


Leu 


Gly Asp Gly Val 


Gin 


Arg 


75 


Leu 


Leu 


IleL 


Leu 


165 
Thr 


Gly 


Val 


Asn 


Thr 


170 
He 


Pro Lys Leu 


Ala 


175 
Ala 


Phe 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



207/ 490 



10 



20 



1BO 1R.S 190 

Trp Lys Glu Arg Thr Thr Phe Cys Arg He Leu Lys Arg Asp Glu Ser 

195 200 205 

Val Ala Leu Ala Glu Lys Asn Gly Phe Pro Ala Glu Arg He Val Phe 

210 215 220 

Phe Glu Pro His Ala Asp Glu Glu Leu Met Gin Ala Val Arg Pro Asp 
225 230 23S 7 40 

Ala He He Thr Lys Glu Ser Gly Glu Ser Gly Tyr Phe Arg Glu Lys 

245 250 255 

He Glu Ala Ala Arg Arg Met Gly He Arg He Tyr Ala Val Val Arg 

260 265 270 

Pro Pro Leu Pro Pro Ser Phe He Pro Val Gly Gly Pro Val Gly Leu 

275 280 285 

Arq Arg Ala Val Glu Arg Leu Val Pro Gly Phe Phe Ser Leu Arg Ser 
XO 2 90 295 300 

Gly Phe Thr Thr Gly Thr Thr Ala Thr Ala Ala Val Val Ala Ala Het 
305 310 3is 320 

Tyr Arg Leu tlet Gly Leu Gly Ser Leu Ala Glu Aid Pro Val Glu L*»u 

325 330 335 

Pro Ser Gly Glu He Val Ser Leu Pro He Ala Glu He Arg Glu Glu 

3-10 345 350 

Glu Asp Ala Val Val Ser Ala Val Leu Lys Asp Ala Gly Asp Asp Pro 

355 360 365 

Asp Val Thr Acn Gly Het Ala Val Cys Ala Thr He Arg Leu Asa Pro 
25 370 375 380 

Glu His Glu Glu Val Arg Phe Leu Gin Gly Glu Gly Val Gly Val Val 
385 390 395 400 

Thr Leu Pro Gly Leu Gly Leu Glu Val Gly Gly Pro Ala He Asn Leu 

405 410 415 

Val Pro Arg Arg Met Met Thr Ala Glu Val Arg Arg Leu Tyr Ala Gin 

420 425 430 

Gly Gly Val Arp He Thr He Ser Val Pro Glu Gly Arg Glu Ala Ala 

•135 440 44s 

Thr Gin Thr Phe Asn Pro Arg Leu Gly He Arg Asp Gly He Ser II- 
«55 450 455 460 

He Gly Thr Ser Gly Val Val Lys Pro Phe Ser Ala Glu Ala Phe Val 
465 470 475 4 8 o 

Gly Ala He Arg Lys Gin Val Gly He Ala Thr Ala Leu Gly Ala Asn 

485 490 495 

His He Val Leu Asn Ser Gly Ala Lys Ser Glu Arg Tyr Val Lys Gly 

500 505 510 

Ala Tyr Pro Ala Leu He Pro Gin Ala Phe Val Gin Tyr Gly Asn Ph~ 

515 ' 520 525 

Val Gly Glu Ser Leu Ser Cys Val Ala Ser Phe Pro Ser Val Arg S-r 
45 530 535 540 

Val Thr Val Gly He Het Leu G)y Lys Ala Val Lys Leu Ala Glu Gl^ 
545 550 555 560 

Tyr Leu Asp Thr His Ser Lys Lys Val Val I let Asn Arg Asp Phe J.eu 

565 57C 575 

His Glu Leu Ala Arg Gin Ala Gly Cys Ser Glu Asp He His Ala He 

580 5B5 590 

He Asp Ser Leu Asn Leu Ala Arg Glu Leu Trp Thr Met Pro Ser Ala 

595 600 605 

Glu Asp Ser Asp Arg Leu Leu Arg Lys He Ala Glu Arg Ser Trp Glu 
55 610 615 620 

Thr Cys Arg Pro Ser Val Pro Ser Ala Glu Leu Glu Leu Leu Leu lie 
625 630 635 640 

Asp Glu Ser Gly Ala He Arg Phe Arg He Gly Gly Glu 
645 650 



30 



40 



50 



60 



65 



(2) INFORMATION FOR SEQ ID NO: 290 

(i| SEQUENCE CHARACTERISTICS: 

(A] LENGTH: 451 amino acid? 
(BJ TYPE: amino acid 
(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 

70 (III) HYPOTHETICAL: YES 

|vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas Gingival is 

75 (ix) FEATURE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



208/ 490 



(A) NAME/ KEY: ml rc_f eature 

(B) LOCATION 1 . . .451 

SEQUENCE DESCRIPTION: SEQ ID HO: 290 

Gin Gin Arg Pro His Arg He He Met. T.eu Arg Thr Phe Arg He Gly 

* 5 10 15 

Gly He His Pro Pro Glu Asn Lys Leu Ser Ala Gly Lys Pro Val Glu 

20 25 30 

Val Leu Pro lie Pro Ser Gin Val Val He Pro Leu Gly Gin His He 

35 40 45 

Gly Ala Pro Ala Thr Ala Thr Val Lys Lys Gly Asp Glu Val Lys Val 

50 55 60 

Gly Thr II- He Ala Gin Ala Gly Gly Phe Val Ser Ala Asn Tie His 
I s 70 75 80 

Ser Ser Val Ser Gly Lys VaJ Leu Lys He Asp Asn Val Tyr Asp Ser 

85 oo g 5 

Ser Gly Tyr Pro Lys Pro Ala Val Phe He Ser Val Glu Gly Asp Glu 

100 10b HO 

Trp Glu Glu Gly He Asp Arg Ser Pro Ala He Val Lys Glu Cys Asn 

115 120 125 

Leu Asp Ala Lys Glu He Val Ala Lys He Ser Ala Ala Gly II- Val 

330 135 140 

Gly Leu Gly Gly Ala Thr Phe Pro Thr His Val Lys Leu Ser Pro Pro 
J 45 150 155 i60 

Pro Gly Asn Lys Ala Glu lie L=u 13 e Tie Asn Ala Val Glu Cys Glu 

165 170 175 

Pro Tyr Leu Thr Ser Asp His Val Leu Met L*u Glu His Gly Glu Glu 

ISO las 190 

lie Met He Gly Val Ser He Leu Met Lys Ala He Gin Val Asn Lvs 

195 200 205 

Ala Val He Gly Val Glu Asn Asn Lys Lys Asp Ala He Ala His Leu 

210 215 220 

Thr Lys Leu Ala Thr Ala Tyr Pro Gly He Glu Val Met Pro Leu Lys 
225 230 235 240 

Val Gin Tyr Pro Gin Gly Gly Glu Lys Gin Leu He Asp Ala Val He 

245 250 255 

Arg Lys Gin Val Lys Ser Gly Ala Leu Pro He Ser Thr Gly Ala Val 

260 265 270 

Val Gin Asn Val Gly Thr Val Phe Ala Val Tyr Glu Ala Val Gin Lvs 

275 280 285 

Asn Lys Pro Leu Val Glu Arg He Val Thr Val Thr Gly Lys Lys Leu 

290 295 300 

Ser Arg Pro Ser Asn Leu Leu Val Arg He Gly Thr Pro He Ala Ala 
305 310 315 3 20 

Leu He tflu Ala Ala Gly Gly Leu Pro Glu Asn Thr Gly Lys II- He 

325 330 335 

Gly Gly Gly Pro Met tlet Gly Arg Ala Leu Leu Ser Pro Asp Val Pro 

340 345 350 

Val Thr Lys Gly Ser Ser Gly Val Leu He Leu Asp Arg Glu Glu Ala 

355 360 365 

Val Arg Lys Pro Met Arg Asp Cys He Arg Cys Ala I.ys Cys Val Gly 

370 375 3Q0 

Val Cys Pro Met Gly Leu Asn Pro Ala Phe Leu Met Arg Asp Thr Leu 
i? 85 T 390 395 400 

Tyr Lys Ser Trp Glu Thr Ala Glu Lys Gly Asn Val Val Asp Cys He 

405 410 415 

Glu Cys Gly Ser Cys Ser Phe Thr Cys Pro Ala Asn Arg Pro Leu Leu 

* «. 420 425 «0 

Asp Tyr He Arg Gin Ala Lys Lys Thr Val Met Gly He Gin Arg Ala 

435 440 445 

Arg Lys Gin 
450 



(2) I N FORMAT 1 Of J FOR SEQ ID UO:291 

ID SEQUENCE CHARACTERISTICS: 

IA) LEMGTH: 4 89 amino acids 
(B> TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



209/490 



Ivil ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(iX) FEATURE: 

(A) NAME/ KEY: miscjeature 
(BJ LOCATION 1. . . 48 9 







SEQUENCE DESCRIPTION: 


SEQ 


ID MO: 291 










Met 


Asn 


Ser 


Gin 


Lys 


Lys 


Glu 


Ala 


Phe 


Asn 


Hot 


Lys 


Arg 


He 


Gin 


Leu 


1 








5 










10 










15 




Thr 


Leu 


He 


Al a 


Lou 


Phe 


Ala 


Ala 


Val 


Ala 


Gly 


Leu 


Val 


Ala 


Gin 


At n 








20 










25 










30 






Ala 


Tyr 


Glu 


Gly 


Val 


He 


Ser 


Tyr 


Lys 


He 


Ser 


T.eu 


Asp 


Lys 


Thr 


Gly 






35 










40 










45 








Asn 


Lis 


Val 


Val 


Leu 


Asn 


Gly 


Ala 


Ala 


Asp 


Met 


Ser 


Asn 


Leu 


Lys 


Leu 




50 










55 










60 








Lys 


Ser 


Thr 


Gin 


Met 


He 


He 


Val 


Thr 


Pro 


Tie 


I.eu 


Arg 


Ser 


Glu 


Asp 


65 










70 










7 5 










80 


Gly 


Thr 


Ser 


Arg 


Val 


Glu 


Phe 


Pro 


Ser 


Val 


Val 


He 


Thr 


Gly 


Arg 


Asn 










85 










90 










95 




Arg 


Thr 


Lys 


Ala 


Leu 


Lys 


Arg 


Glu 


Ho 


Ala 


Phe 


Ser 


Ser 


Ala 


Leu 


Pro 








100 










105 










110 






Gin 


Ala 


Lyr 


His 


Ala 


Ala 


Gin 


Tyr 


He 


Arg 


Arg 


His 


Asn 


Gly 


Lys 


Ser 






115 










120 










125 








Glu 


Gin 


Ph« 


Ala 


Phe 


Thr 


Gly 


Glu 


Ills 


Ala 


Tyr 


Ala 


Ser 


Trp 


Met 


Met 




130 










135 










14 0 










Arp 


Ala 


Lys 


The 


Val 


"Val 


Arg 


Glu 


Glu 


Val 


Arg 


Gly 


Cys 


Ma 


Lys 


Cy? 


145 










3 50 










155 








160 


Pro 


Val 


Gly 


Leu 


Ser 


Ser 


Asn 


Tl e 


Val 


Pro 


Phe 


Asp 


Pro 


Leu 


Phe 


Asn 










165 










170 










175 




Pro 


Ala 


GlU 


Ala 


Pro 


Tyr 


Leu 


Leu 


Ala 


His 


He 


Thr 


Pro 


Ala 


Glu 


Glu 








180 










185 










190 






Val 


Glu 


Lys 


Gin 


Arg Glu 


Ser 


Ser 


Phe 


Asp 


Ala 


Tyr 


He 


Asn 


Phe 


Lys 






19b 










200 










205 






Val 


Am 


Lys 


Ala 


Asp 


Val 


Leu 


Pro 


Glu 


Tyr 


Arg 


Asn 


Asn 


Lys 


Ala 


Glu 




210 










215 










220 










Levi 


Glu 


Lys 


He 


Lys 


Glu 


Phe 


Val 


ser 


Thr 


Val 


Lys 


Ala 


Asn 


Pro 


Asn 












230 










235 










240 


Tyr 


Ser 


Val 


Asn 


Lys 


Met 


He 


lie 


Glu 


Gly 


Phe 


Ala 


Ser 


Pro 


Glu 


Ala 










245 










250 










255 




Ser 


lie 


Ala 


His 


Asn 


Lys 


Ala 


Leu 


Ser 


Glu 


Arq 


Arg 


Ala 


Lys 


Arg 


Leu 








2G0 










265 










270 






Ala 


Glu 


Glu 


Leu 


Val 


Arq 


Lys 


Tyr 


Gly 


Lys 


Thr 


Leu 


Pro 


Asn 


He 


Thr 






275 










280 










285 








Thr 


Glu 


Phe 


Gly 


Gly 


Glu 


Asp 


Trp 


Lys 


Gly 


T.eu 


T.ys 


Leu 


Ala 


He 


Glu 




2 SO 










2y"b 










300 










Lys 


Ser hep 


He 


Ala A^p 


Arg 


Asp 


Arg 


Val 


Leu 


Glu 


He 


He 


Aen 


Ser 


30b 










31 0 










315 










320 


Asp 


Lys 


Tyr 


Ala 


Asp Asp 


Aep 


Ala 


Arg 


Glu 


Gin 


Ala 


Leu 


Lys 


Gin 


Leu 










32b 










330 










335 




Scr 


Ser 


Tyr 


Arg 


Tyr 


He 


Leu 


Asp 


Gin 


He 


Tyr 


Pro 


Asn 


Leu 


Arg Arg 








34 0 










34 5 










350 






Asn 


Thr 


He 


Thr 


Met 


Gly 


Tyr 


He 


Val 


Arg 


Asp 


Tyr 


Thr 


Leu 


Glu 


Glu 






355 










360 










365 








Ala 


Arg 


Glu 


He 


He 


Lys 


Thr 


Ala 


Pro 


Lys 


Glu 


Leu 


Ser 


Glu 


Ala 


Glu 




370 










375 










380 










Hat 


Tyr Arg 


Val 


Ala 


Met 


Ser 


Tyr 


Pro 


Glu 


Gly 


Hie 


Gin 


Glu 


Arg 


Leu 


38b 










390 










395 








4 00 


Phe 


Ala 


Leu 


Asn 


Thr 


Thr 


Leu 


Lys 


Tyr 


Phe 


Pro 


Glu 


Ser 


Val 


Thr 


Gly 










4 05 










410 










415 


Arg 


lie 


Acn 


Leu 


Ala 


Val 


Ala 


Ala 


Phe 


Asn 


Gly 


Gly 


Asp 


Val 


Gin 


Gin 








420 










425 










4 30 






Ala 


lie 


Ala 


Leu 


Leu 


Ser 


Pro 


He 


Gin 


Thr 


Glu 


Lys 


Gly 


Val 


Ser 


Asn 






435 










440 










445 








lie 


Leu 


Gly 


Ala 


Ala 


Tyr 


Ala 


Arg 


Thr 


Gly 


Asp 


Phe 


Ala 


Arg 


Ala 


Glu 




4 50 










4 55 










4 GO 








Thr 


Phe 


Phe 


Arg 


Lys 


Ala 


Val 


Ala 


Glu 


Gly 


Asp 


Ala 


Asn 


Ala 


Gin 


Arg 


465 










470 










475 










480 


Asn 


Leu 


Asp 


Met 


Leu 


Leu 


Gly 


Lys 


Lys 

















465 



(2) IMF0W1ATIOIJ FOR SEQ ID HO: 292 
(j.) SEQUENCE CHARACTERISTICS: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



210/490 



10 



15 



(AJ LEHGTH: 384 nmino acids 
(81 TYPE: amino acid 
(D) TOPOLOGY: linear 

In) MOLECULE TYPE: prorein 

(ili) HYPOTHETICAL: YES 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: Porphyiomonas gingival^ 

(i-'.t FEATURE : 

(A) NAME/ KEY : nuec feature 

(B) LOCATION 1...38 4 



Ui» SEQUENCE DESCRIPTION: SEQ ID NO: 292 

Asp Mat Ala Glu Ly, Arg A,p Tyr Tyr Glu Vnl T.cu Gly Val S « Lys 

20 ton Ala Thr Asp Asp Glu Leu Ly 5 Ly. aL Tyr Arg Lys Ly£ He 

Gin Tyr His Pro Asp Lys Asn Pro Gly Asp Lys Glu Ala Glu Glu His 

25 PhS IV Ala G1U '^* r Asp Val L « p« °m i.y* 

Arg Ser Gin Tyr Asp Gin Phe Gly Hi. Ala Gly Leu Gly Gly Ala Ala 

Gly Gly Gly Phe Ser Gly Gly Gly Met iL Glu Asp Ilo phe °° r 

30 Arg Phe Gly Asp Leu Phe Gly Gly Phe Gly Gly P hs Giy Giy £ B s « 

Asp Met Gly Gly Gly Ser Arg Arg Arg Val Arg Arg Gly ^ Asp Leu 

120 

35 Val Li ' S LSU ?« L =" ~» *-P II- l£ Gly val Glu 

Lys Lys val Ly s Val Lys Lys" cm Val Val cys Ser Lys C ys Arg Gly 
Asp Gly Tnr Glu Glu Ala Asn Gly r>ya Thr * = * Cys Gln Thr ^ ™ 
40 Gly Thr Gly Val Val Thr Arg Val Ser £° Thr p hs Le u ely Ht _, 

Gin Thr Gin Ser Thr Cys Pro Thr Cys Bly Glu Gly G ?° 

45 2^0 S " ' ,yS ^" °» Glu Gly Val Mu He Gly Glu 



Glu Val He Ssr Phe „„ Pxo Ala Gly Val I™ Glu Gly Met eln 

Met Ser val A,n Gly L ys Glv A. n AIa ^ f£ Ary Gly Qly y ^ 
50 Gly Asp Leu 11. val Val lie Ala Glu Glu Pro Asp Pro Asn Leu He 

55 ^ G1V Gly ™ Val Pro T " «• ^ Gly Arg Ala 

Ly- H= Arg 11* Glu Ala Gly Thr Gin Pro Gly Lys Met Leu Arg Leu 
Arg Asn Lys Gly Leu Pro Ser Val Asn Gly Tyr Gly Met Gly Asp 
60 Leu Val Asn Val Asn Val Tyr He Pro "u Ser He Asp Ala lyt Asp 
Glu Gin Ala He Ala Ala Met Glu ^ S9 r Asp Ser Phe Ly" Pro Thr 



Arg Asn Gly Asn Asp Leu lie Tyr Asn Leu Leu He Ser vll Pro Leu 

200 

Val Pro Thr 

300 



65 37U Ar9 AS " 1% ^ *-r* T V* "2 Met Leu Asp 



380 



(2) INF ORMAT I ON FOR SEQ ID NO : 293 

7n (*> SEQUENCE CHARACTERISTICS- 

/U <*> LENGTH: 309 amino acid- 

<B> TYPE: amino acid 
(D) TOPOLOGY: linear 

ti*> HOLECULC TYPE: protein 
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tiii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromona-f? gingiva)!? 

ilx) FEATURE: 

(A) NAHE/KEY: misc_feature 
( B J LOCATION l . . . 309 

(xa.) SEQUENCE DESCRIPTION; SEQ ID NO: 293 

Arg Phe Asp Ser Thr Thr Asn Val Ser Gin lie Asn Leu Arg Thr Glu 

15 10 is 

Met Lyf Lys Leu lie Leu Ala Thr Leu Gly Leu Met Ala lie Ala M°L 

20 25 30 

Leu Ser Cys Ser Ser Asn Asn Lys Asp Leu Glu Asn Lys Gly Glu Ala 

35 40 45 

Thr Leu Leu Val Thr Phe Gly Ser Ser Tyr Lys Ala Pro Arg Glu Thr 

50 55 go 

Tyr Ala Lys lie Glu Lys Thr Phe Ala Ala Ala Tyr Pro Asp Gin Arg 
65 70 75 go 

He Ser Trp Thr Tyr Thr Ser Ser He lie Arg Lys Lys Leu Ala Gin 

05 90 q 5 

Gin Gly He Tyr He Asp Ala Pro Asp Glu Ala Leu Glu Lys Leu Ala 

1°° " 105 no 

Arg Leu Gly Tyr Lys Lys He Asn Val Gin Ser Leu His Val II- pro 

IIS 120 12 5 

Gly Arg Glu Tyr Asp Glu Met He Asp Phe Val Asn Lys Phe Lys Ala 

130 135 i 4 o 

Ala Hi* Ser Asp He Thr Val Lys Val Gly Ala Pro Leu Phe Asp Thr 
145 150 155 160 

Asp Glu Asp Met Arg Glu Val Ala Glu He Leu His Lys Arg Phe Gin 

165 170 175 

Gin Thr He Glu Lys Gly Glu Ala He Val Phe Met Gly His Glv Thr 

l fl 0 IBS 190 

Glu His Ala Ala Asn Asp Arg Tyr Ala Arg He Asn Lys He Met Lys 

195 200 205 

Asn Tyr Ser Lys Phe Met lie Val Gly Thr Val Glu Ser Asp Pro S-r 

210 215 220 

He Asn Asp Val He Ala Glu Leu Lys Glu Thr Gly Ala Thr Ala Val 
225 230 235 240 

Thr Met Met Pro Leu Met Ser Val Ala Gly Asp Hie Ala Thr Asn Asp 

245 250 255 

Met Ala Gly Acp Glu Asp Asp Ser Trp Lys Thr Leu Leu Thr Asn Ala 

260 265 270 

Gly Tyr Thr Val Ser He Asp Lys Leu Asp Asn Gly Asn Phe Ser Ala 

275 280 285 

Leu Gly Asp He Glu Glu He Arg Asn He Trp Leu Lys His Met Lys 

290 295 3Q0 

Ala Thr Ser Ala Arg 
305 

(2) INFORMATION FOR SEQ ID HO: 2 94 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 91 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

liii) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix> FEATURE: 

(A) NAME/ KEY: miscfeature 
(Q) LOCATION 1. . .491 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 2 94 

Arg Arg Asp Ala Arg Arg Gin Leu Val Arg Arg Arg Arg Lys Glu Leu 

10 15 
Ser Gly Thr Ala Asn Thr Arg Cys Arg Arg Arg Val Phe Pro Glu Arg 
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20 










25 










30 






Arg 


Set 


Ala 


He 


Ary 


Trp 


Cys 


Phe 


Pro 


Arg 


Leu 


Arg 


A.?p 


Gly 


Leu 


Gin 






35 










40 










45 








Pro 


He 


Arg 


Phe Cly 


Ser 


Leu 


Leu 


Ala 


Asn 


Pro 


Gin 


Gin 


Leu 


Ser 


Arg 




50 










55 










60 










Gly 


Val 


Axg 


Leu 


Arg 


Ser 


Phe 


Glu 


Ser 


Arg 


Arg 


Gin 


Pro 


He 


cys 


Arg 


65 










"70 










75 










80 


Phe 


Gin 


Gly 


He 


Gin 


Gly 


Lvs 


Ar 


eg 


Asp 


He 


Leu 


Gin 


Pro 


Glu 


Gly 










35 










90 










95 




Asn 
n 


Str 


Leu 


Leu 


Axg 


Thr 


His 


Ala 


His 


Ala 






eu 


Gly Ala 


Gly 








100 










105 










110 








Ser 


j yr 


Arg 


Tyr 


Leu 


Ala 


Pro 


Arg 


Thr 


Thr 


Ser 


Arg 


Pto 


Pro 


Thr 






115 










120 










125 








Ala 


Phe 


Leu 


Leu 


Phe 


Ala 


Pro 


Met 


Thr 


» or 


Val 


S«r 


His 


Leu 


Arg 


r 




130 










135 










14 0 










He 


Se r 


Va 1 


Ala 


Gly 


He 


Leu 


Ala 


Ala 


eu 




Gly 


Ala 


Val 


Leu 


11 ** 


14 5 










150 










1 55 








160 




Phe 


(31 v 


Val 


Asn 


Leu 


Phe 


Leu 




Ser 


Val 


Al a 


T 1 f 


Pro 


Met 


Ser 










1G5 










170 










175 




Glu 


He 


Phe 


Arg 


His 


Leu 


Phe 


Ser 


Asp 


Arg 


Pro 


Gl 
o u 




Gly 


Glu 


a 








ISO 










185 










190 






Leu 


Va 1 


His 


Tyr Acn 


Tie 


Leu 


Trp 


Lys 


Ser 


Arg 


Leu 


Pro 


Glu 


Ala 








195 










200 










205 








Thr 


Ala 


Ala 




Ala 


Gly 


Ala 


Gly 


i#2U 


Ser 


Val 




Gl" 


T.eu 


Gin 


M«t 




2.1 o 










215 










220 










Gin 


Thr 


Va 1 


Phe 


Arg 


A«n 


Pro 


Leu 


Ala 


Gly 


Pro 


Ser 


Val 


Leu 


Gly 


He 


22 5 










230 










235 










240 


Ser 


- - 


^J-i 


Ala 


Ser 


Leu 


Gly 


Val 


Ala 


Leu 


Val 


Va 1 


Leu 


Leu 


Ser 


Gly 










245 










250 










255 




Ser 


L°u 


Gly 


Gly Val 


A3 a 


Leu 


Ser 


Ser 


Leu 


Gly 


Tyr 


e 


Gly 


Glu 


Val 








260 










265 










270 






Ala 


Het 


Asn 


He 


Ala 


Ala 


Ala 


Val 


Gly 


Set 


Leu 


Ala 


Val 


Met 


Gly 








2*7 5 










280 










285 








He 


Val 


Phe 


Val 


Ser 


Thr 




Val 


Arg 


Ser 


His 


Val 


Thr 


Leu 


Leu 


He 




290 










295 










300 










He 


Gly 


Val 


Met 


He 


Gly Tyr Val 


Ala 


Thr 


Ala 


Val 


He 


Gly Val 


Phe 


305 










310 










315 










320 


Lys 


Phe 


Phe 


S«r 


He 


Glu 


Glu 


Asp 


He 


Arg 


Ala 


Tyr 


Val 


He 


Trp 


Gly 










325 










330 










335 




Leu 


Gly 


Ser 


Phe 


Ser 


Axg 


Ala 


Thr 


Asp 


Ser 


Gin 


Leu 


Ser 


Phe 


Phe 


Ala 








34 0 










345 










350 






He 


Leu 


Met 


Leu 


He 


Phe 


He 


Pro 


Ala 


Gly Itet 


Leu 


Leu 


Val 


Lys 


Gin 






355 










360 










365 








Leu 


Asn 


Leu 


Leu 


Leu 


Leu 


Gly 


Glu 


Ser 


Tyr Al a 


Arg 


Arn 


Leu 


Gly 


Leu 




370 










375 










380 










Asn 


Thr 


Arg 


Arg 


Ala 


Arq 


Leu 


Leu 


Val 


He 


Ssr 


Sex 


Aid 


Gly 


Leu 


Leu 


385 










390 










395 








400 


He 


Ala 


Thr 


Val 


Thr 


Ala 


Tyr 


Cys 


Gly 


Pro 


lie 


Gly 


Phe 


Leu 


Gly Met 










4 05 










410 










415 




Ala 


Val 


Pro 


His 


Leu 


Ala 


Axg Val 


He 


Phe 


His 


Thr 


Ser 


Aap 


Hxs 


Arg 








420 










425 










430 






He 


Leu 


Het 


Pro 


Ala 


Thr 


Cys 


Leu 


He 


Gly 


Ser 


Ala 


Leu 


Ala 


Leu 


Phe 






435 










440 










445 








Cya 


Asn 


He 


He 


Ala 


Arg 


Met 


Pro 


Gly 


Fhe 


Glu 


Gly 


Ala 


Leu 


Pro 


Val 














455 










460 










Asn 


Ser 


Val 


Thr 


Ala 


Leu 


Val 


Gly Ala 


Pro 


He 


He 


Val 


Thr 


Val 


Leu 


465 










470 










475 










4 80 


Phe 


Arg 


Arg 


Arg 


Arg 


Phe 


Lys 


Glu 


Glu 


The 


Asp 




















485 










4 90 














(2) 


INFORMATION 


FOR 


SEQ 


ID HO:295 

















<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY; linear 



Ui> MOLECULE TYPE: protein 

lilli HYPOTHETICAL: YES 

(Vi> ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gmgivalis 

(IX) FEATURE: 
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(A) NAUR/KEY: misc_f eature 
tB) LOCATION 1 . . .763 

(Xii SEQUENCE DESCRIPTION: SEQ ID NO: 2 9b 

His Phe Leu Val Thr lie Met Arg Thr Lys Thr Tie Phe Phe Ala Il» 

15 i0 is. 

lie S»r Phe He Ala Leu Leu Ser Ser Ser Leu Ser Ala Gin Ssr Lys 

20 25 30 

Ala Val Leu Thr Gly Ser Val Ser Asp Ala Glu Thr Cly Glu Pro L-u 

35 40 45 

Ala Gly Ala Arg lie Glu Val Lys His Thr Asn He Val Ala Gly Ala 

50 55 60 

Asp Ala Gly Gly His Phe Glu He Lys Asn T.eu Pre Ala Gly Gin Hi- 
65 70 75 eo 

Thr lie Tie Cys Ser Leu Gly Gly Tyr Gly Gin Lys Glu Glu Val Val 

85 90 9b 

Ala He Glu Ala Gly Gin Thr Lys Thr Tic Ser Phe Ala Leu Arg L-u 

100 105 110 

Arg Thr Asn Asn Leu Glu Glu Val Val Val Thr Gly Thr Gly Thr Aro 

115 12 0 125 

Tyr Arg Leu Val Asp Ala Pro Val Ala Thr Glu Val L~u Thr Ala t -.-<. 

130 135 140 

Asp He Ala Ser Phe Ser Ala Pro Thr Ser Glu Ala Leu Leu Gin r=l-.- 
145 150 155 i 6 6 

Leu Ser Pro Sei Phe Asp Phe Gly Pro Asn Leu Met Gly S-r Ph- Met 

16b 170 175 

Gin Leu Asn Gly Leu Ser Ser Lys Tyr He Leu He Leu He Asp Gl- 

180 a8 5 i 9Q 

Lys Arg Val Tyr Gly Asp Val Gly Gly Gin Ala Asp Leu Ser Arg II- 

195 200 205 

Ser Pro Asp Gin He Glu Arg He Glu Leu Val Lys Gly Ala Ser Ser 

210 215 220 

Ser Leu Tyr Gly Ser Asp Ala Tie Ala Gly Val He Asn Val He Thr 

235 240 



225 230 

Lys Lys Asn Thr Asn Arg Leu Ser Ala Tyr Thr Ser His Arg He Ser 

245 250 255 

Lys Tyr Asn Asp Arg Gin Thr Asn Thr Ser Leu Asp He Asn He Glv 

260 265 270 

Lys Phe Ser Ser Asn Thr Asn Tyr Phe Phe Tyr His Thr Asp Gly Trp 

275 2B0 285 

Gin Asn Ser Pro Phe Glu He Lys Lys Lys L y s Gly Ser Gly Glu Pro 

290 295 300 

Val Leu Glu Glu Thr Tyr Lys Lys Thr Phe Arg Ala Gin Glu Asn Gin 
305 310 315 320 

Gly Val Ser Gin Ser Leu Ser Tyr Tyr Ala Thr Asn Asn Leu Ser Ph- 



325 



330 335 



Ser Gly Asn Val Gin Tyr Asn Lys Arg Gin He Phe Thr Pro Thr Ph« 

340 3-15 350 

Ser Glu Lys Lys Ala Tyr Asp llet Asp Tyr Arg Ala Leu Thr Ala Ser 

355 36O 365 

Leu Gly Thr Asn Tyr Leu Phe Pro Asn Gly Leu His Thr Leu Ser Phe 

370 375 39 0 

Asp Ala Val Tyr Asp Arg Phe Arg Phe Gly Tyr Leu Tyr His Asp Lyr 
385 ^ 390 3gb * 4 J 0 

Asp Ser Ser Glu Sex Leu He Asn Asn Gin Gly Gin Thr Glu Gin Pro 



40S 



410 415 



Thr Phe Phe Pro Gly Gin Leu Arg Asn Lys Asn Asp Gin He Arg Tyr 

420 425 430 

Thr Ala Glu Ala Arg Gly Val Phe Thr Leu Pro Tyr Ala Gin Lys Leu 
435 4 4 o 44S 

Thr 5iX Gly LeU G1U TVr Phe Arg Glu Glu Lsu He Ser Pro Tyr Asn 

450 4 55 460 J 

Leu He Thr Asp Lys Ala Asp Ala Ser Thr Leu Ser Ala Tyr Val Gin 

J - 470 475 480 

Asp l,1u Trp Lys Pro Leu Asp Trp Phe Asn Met Thr Ala Gly Phe Arg 

485 490 r 495 y 

Leu Val His His Gin Glu Phe Gly Thr Axg Met Thr Pro Lys Val Ser 

500 505 sio 

lie Leu Ala Lys Tyr Gly Pro Leu Asn Phe Arg Ala Thr Tyr Ala Asn 

515 520 £25 

Gly Tyr Lys Thr Pro Thr Leu Lys Glu Leu Phe Ala Arg Asn Glu Leu 

Thr Thr Het Gly Ser His Asn Leu Tyr Leu Gly £sn Ala Asp Leu Ly~ 
550 555 560 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



Pro 


Gin 


Met 


Ser 


Asp 


Tyr 


Tyc 


Ala 


Leu 


Gly 


Leu 


Glu 


Tyr Asn 


Gin 


Gly 










565 










57 0 










575 




Pro 


He 


Ser 


Phe 
580 


Ser 


Ala 


Thr 


Val 


Tyr 
585 


Asp 


Asn 


Glu 


Leu 


Arg 
590 


Asn 


Leu 


lie 


Ser 


Phe 
595 


Met 


Asp 


He 


Pro 


Thr 
600 


Ser 


Pro 


Glu 


His 


Glu 
605 


Ala 


Gin 


Gly 


lie 


Lys 
610 


Lys 


Thr 


Lys 


Cln 


Tyr 

615 


Ala 


Asn 


He 


Gly 


Lys 
620 


Ala 


Arg 


Ser 


Arg 


Gly 


Leu 




Val 


Leu 


Cys 


Asp 


Ala 


Ser 


He 


Sly 


Trp 


Gly lie 


Lys 


Leu 


625 










630 










635 










640 


Gly Ala 


Gly 


Tyr 


Ser 


Leu 


Vol 


GlU 


Ala 


Lys 


Asn 


Leu 


Gin 


Thr 


Asp 


Glu 










645 










650 










655 




Trp 


Leu 


Glu 


Gly Ala 


Ala 


Arg 


His 


Arg 


Ala 


Asn 


Val 


His 


Ala 


Asp 


Trp 








660 










665 










670 






Val 


Hi s 


Tyr 
675 


Trp 


Gly 


Gin 


Tyr 


Arg 
680 


Leu 


Gly 


Val 


Ser 


Leu 
685 


Phe 


Gly 


Arg 


lie 


Gin 


Ser 


Glu 


Arg 


Tyr 


Tyr 


I.ys 


Anp Gly Asn 


Ala 


Pro 


Asp 


Tyr 


Thr 




690 










695 










700 










Leu 


Trp 


Arg 


Leu 


Ala 


Thr 


Ser 


His 


Arg 


Phe 


Ala 


His 


Phe 


Arg 


Hla 


He 


705 










73 C 










715 










720 


lie 


Leu 


Asp 


Gly Thr 


Leu Gly 


He 


Asp 


*Ksn 


Leu 


Phe 


Asp 


Tyr 


Val 


Asp 










725 










730 










735 




Asp 


Arg 


Pro 


Het 


Gly 


Val 


Asn 


Tyr 


Ala 


Thr 


Val 


Thr 


Pro 


Gly 


Arg 


Thr 








140 










745 










750 




Phe 


Phe 


Ala 


Gin 


He 


Ala 


He 


Arg 


Phe 


Asn 


A/?n 













(2) INFORMATION FOR SEQ ID HO*. 296 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

til) MOLECULE TYPE: protein 

(iir) HYPOTHETICAL: YES 

Ivi) ORIGINAL SOURCE : 

(A) ORGANISM: Porphyromonas gmgivalis 

I Ix) FEATURE: 

(A) NAME/ KEY: raiscjeature 

(B) LOCATION 1. . .365 



(Xi) SEQUENCE DESCRIPTION: SEQ TD NO: 296 



Cys 


Cys 


Ala 


Arg 


Lsu 


Arg 


Asp 


Thr 


Ser 


Pro 


Arg 


Lys 


Lys 


Ser 


Phe 


Arg 


1 








5 










10 










15 




Asn 


Asp 


Thr 


Val 


Phe 


Pro 


Tyr 


Phe 


Val 


He 


His 


Leu 


He 


Lys 


His 


He 








20 










25 










30 






He 


He 


Met 


Thr 


Asp 


Asn 


Lys 


Gin 


Arg 


Asn 


He 


Val 


Phe 


Pro 


Ala 


Phe 






35 










40 










45 








Leu 


Leu 


Leu 


Leu 


Gly 


Val 


He 


Ala 


Val 


Val 


Thr 


He 


Val 


Gly 


Phe 


Phe 




50 










55 










60 








Met 


Leu 


Arg 


Pro 


Ala 


GlU 


GlU 


He 


He 


Gin 


Gly 


Gin 


He 


Glu 


Val 


Thr 


65 










70 










75 










80 


Glu 


Tyr 


Arg 


Val 


Ser 


Ser 


Lys 


Val 


Pro 


Gly 


Arg 


He 


Lys 


Glu 


Leu 


Arg 










85 










90 










95 




Val 


Ser 


Gl u 


Gly 


Gin 


Gin 


Val 


Gin 


Ala 


Gly 


Asp 


Thr 


Leu 


Ala 


Val 


He 








100 










105 










110 






Glu 


Ala 


Pro 


Asp 


Val 


Ala 


Ala 


Lys 


Met 


Glu 


Gin 


Ala 


Lys 


Ala 


Ala 


Glu 






115 










120 










125 








Ala 


Ala 


Ala 


Gin 


Ala 


Gin 


Asn 


Ala 


Lys 


Ala 


Leu 


Lys 


Gly Ala Arg 


Ser 




130 










135 










140 










Glu 


Gin 


He 


GLn 


Ala 


Ala 


Tyr 


Glu 


Met 


Trp 


Gin 


Lys 


Ala 


Gin 


Ala 


Gly 


145 










150 










155 










160 


Val 


Ala 


He 


Ala 


Thr 


Lys 


Thr 


His 


Gin 


Arg 


Val 


Gin 


Asn 


Leu Tyr 


Asp 










165 










170 










3 75 




Gin 


Gly 


Val 


Val 


Pro 


Ala 


Gin 


Lys 


Leu 


Asp 


Glu 


Ala 


Thr 


Aid 


Gin 


Arg 








180 










195 










190 






Asp Ala 


Ala 


He 


Ala 


Thr 


Gin 


Lys 


Ala 


Ala 


Glu 


Ala 


Gin 


Tyr 


Asn 


Hot 






195 










200 










205 








Ala 


Arg 


Arn 


Gly 


Ala 


Clu 


Arg 


Glu 


Asp 


Lys 


Leu 


Ala 


Ala 


Ser 


Ala 


Leu 




210 










215 






220 
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Val 


Asp 


Arg 


Ala 


Arg 


Gly 


Ala 


Val 


Ala 


Gl u 


Val 


Glu 


Ser 


Tyr lie 


Asn 


225 










230 










235 






240 


Glu 


Tlit 


Tyr 


Leu 


He 


Ala 


Pro 


Arg 


Ala 


Gly 


Glu 


Val 


Ser 


Glu He 


Phe 










245 










750 








255 




Pro 


Lys 


Ala 


Gly 


Glu 


Leu 


Val 


Gly 


Thr 


Gly 


Ala 


Pro 


He 


Met Asn 


He 








260 










265 










270 




Ala 


Glu 


Hfet 


Gly 


Asp 


Met 


Trp 


Ala 


Ser 


Phe 


Ala 


Val 


Arg 


Glu Asp 


Phe 






275 










280 










285 




Leu 


Ser 


Ser 


Met 


Thr 


Met 


Gly Ala 


Vdl 


Leu 


Glu 


Thr 


Val 


Val Pro 


Ala 




290 










295 










300 








Leu 


Asn 


Glu 


Glu 


Lys 


Vol 


Arg 


Phe 


Lys 


He 


Thr 


Phe 


He 


Lys Asn 


Met 


305 










310 










315 






320 


Gly Thr 


Tyr 


Ala 


Ala 


Trp 


Lys 


Ala 


Thr 


Lys 


Thr 


Thr 


Gly Gin Tyr Asp 










325 










330 








335 




Leu 


Lys 


Thr 


Phe 


Glu 


Val 


Lys 


Ala 


Thr 


Leu 


Ala 


Asp 


Lys 


Asp Ly3 


Ala 








34 0 










345 










350 




Gin 


Lys 


Leu 


Axg 


Pro 


Gly 


Met 


Ser 


Val 


He 


lie 


Arg 


Lys 










355 










360 








365 







12 J INFORMATION FOR SEQ ID MO: 297 



\i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 320 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

tii) MOLECULE TYPR: protein 

(ill) HYPOTHETICAL : YES 



(vi) ORIGINAL SOURCE: 

<A} ORGAN I SH; Porphyromonas gxngivalis 

( J X ) FEATURE : 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...320 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 2 97 

Pro Gin Ser Ser Pro Asp Arg Arg Ser Phe Gin Asn Val Met Asn Lys 

1 5 io 15 

Tyr His Ser Gin Ser Val Leu Glu Val Gly Lys He Gly He Val He 

20 25 30 

He Phe Ala Pro He Val Arg Asn Val His Gin Gin Pro Pro Phe Leu 

35 40 45 

Ser His Lys Ser He Met Arg He Val Ser Asn Phe Leu Phe Val Ser 

50 55 60 

Phe Ser V«.l Leu Lou Phe Ala Ser Cys Arg Ser Gin Arg Glu Lys Val 
65 70 75 80 

Val Tyr Leu Gin Asp He Gin Thr Phe Asn Arg Glu He He Ala Lys 

B5 90 95 

Pro Tyr Asp Val Lys He Glu T.ys Asp Asp Val Leu Asn He Leu Val 

lOO 105 HO 

Ser Ser Arg Asp Pro Glu Leu Ser Thr Pro Tyr Asn Gin Val Leu Thr 

115 120 125 

Thr Arg Ala Leu Ala Arg Asn Gly Tyr Gly Thr Asn Ser Asn Glu Gly 

130 135 140 

Phe Leu Val Asp Ser Lys Gly Tyr He Asn Tyr Pro lie Leu Gly Gin 
145 150 15 5 160 

He Tyr Val Glu Gly Leu Thr Arg Thr Glu Leu Glu Lys Glu He Gin 

165 170 175 

Lys Arg He He Ser Ser Gly Phe He Lys Asp Pro Thr Val Thr Val 

100 , 185 190 

Gin Leu Gin Asn Phe Lys Val Ser Val Leu Gly Glu Val Asn His Pro 

195 200 205 

Gly Ser Met Ser Val Lys Gly Glu Arg He Thr Leu Leu Glu Ala He 

210 215 220 

Gly MeL Ala Gly Asp Leu Thr He Tyr Gly Arg Arg Asp Arg Val Phe 
225 230 235 240 

Val He Arg Glu Thr Asp Gly His Arg Glu Val Pne Gin Thr Asp Leu 

245 250 255 

Arg Lys Ala Asp Leu Leu Ala Ser Pro Val Tyr Tyr Leu His Gin Asn 

260 265 " 270 

Asp Val He Tyr Val Glu Pro Asn Asp Lys Lys Thr Gin Met Ser Glu 
275 - 280 285 
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He Am G] n Asn Asn Asn Val Asn Val Trp Leu Ser Val Thr Ser Thr 

290 295 300 

Leu Val Ser lis Ser Thr Leu Thr He Thr lie lie Asp Lys Thi Lys 
305 310 315 320 

5 

(2) INFORMATION FOR SEQ TD 110:290 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acids 
10 (B) TYPE: amino acid 

(DJ TOPOLOGY: linear 

ItW MOLECULE TYPE: protein 

15 <iii> HYPOTHETICAL: YES 

[vi) ORIGINAL SOURCE: 

{A) ORGANISM: Forphyromonas gingival 

20 (ix) FEATURE: 

(A) NAI IE /KEY : misc_ feature 

(B ) LOCATION 1. . . 502 
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{Xl) SEQUENCE DESCRIPTION: SEQ ID 11C:29P 



Ser Lys 


He Val 


J.CHI 


Arg 


Lys 


Phe 


Cys 


Thr 


Leu 


ALa 


Arg 


Met 


Lys 


l.y- 


1 




5 










10 










15 




Thr Ai?n 


Leu Phe 


Leu 


Ser 


Leu 


Leu 


Val 


Il« 


Phe 


He 


Thr 


Gly 


Ser 


Phe 




20 










25 










30 






Met Thr 


Ala cys 


Ala 


Gin 


Lys 


Ser 


Lys 


Thr 


Asn 


Lys 


Leu 


Thr 


Glu 


Glu 




35 








40 










45 








Asp Arg 


Ser Arg 


Asn 


Glu 


Tyr 


Val 


Gin 


Ser 


Met 


Asp 


Val 


Leu 


Ser 


Asn 


50 








55 










60 










He He 


Gly Ann 


Val 


Arg 


Leu 


Tyr 


Phe 


Val 


Aep 


Thr 


He 


Scr 


He 


Lys 


65 






70 










75 










80 


His Met 


Thr Arg 


Arg 


Gly 


lie 


Asp 


Ala 


I1et 


Leu 


Gly 


Gly 


Leu 


Asp 


Pro 






85 










90 










95 




Tyr Thr Glu Tyr 


He 


Pro 


Tyr 


Glu 


Glu 


Met 


Asp 


Glu 


Leu 


Lys 


Leu 


Met 




100 










105 










110 






Thr Thr Gly Glu 


Tyr Ala 


Gly 


Val 


Gly 


Ala 


He 


He 


Ser 


Gin 


Arg 


Pro 




115 








120 










125 








Asp Ser 


Ala Val 


He 


He 


Gin 


Arg 


Pro 


Met 


Glu 


Gly 


Met 


Pro 


Ala 


Asp 


130 








135 










140 










Glu Ala 


Gly Leu 


He 


Ala 


Gly 


Asp 


Arg 


He 


Leu 


Thr 


He 


A-fcp 


Gly 


Lye 


145 






150 










155 










160 


Asp Phc 


Arg Lys 


Ser 


Thr 


Thr 


Pro 


Lys 


Val 


Scr 


Gin 


Ala 


Leu 


Lys 


Gly 






165 










170 










175 




He Ala 


Gly Thr 


Val 


Ma 


Lys 


Val 


Thr 


Val 


Met 


Arg 


Tyr Gly 


Glu 


Thr 




100 










185 










190 






Lys Pro 


Arg Thr 


Phe 


Ser 


Val 


Lys 


Arg 


Gin 


Lys 


Val 


He 


Met 


Asn 


Ser 




195 








200 










205 








Val Thr 


Tyr Ser 


Gly Met 


Leu 


Asp 


Gly 


Ser 


He 


Gly 


Tyr 


He Arg 


Leu 


210 








215 










220 










Asn Asn 


Phe Thr 


Asp 


Lys 


Ser 


Ala 


Glu 


Glu 


Val 


Arg 


Thr 


Ala 


Leu 


Leu 


225 






230 










235 










240 


Asp Leu Arg Asp 


Lys 


Gin 


Gly 


Ala 


Lys 


Gly 


Leu 


He 


Leu 


Asp 


Leu 


Arg 






215 










250 










255 




Gly Asn 


Gly Gly 


Gly 


Leu 


Met 


Gin 


Ala 


Ala 


He 


Glu 


He 


Val 


Asn 


Leu 




260 










265 










270 






Phe Val 


Pro Lys 


Gly 


Lys 


Glu 


Val 


Val 


Thr 


Thr 


Lys 


Gly 


Arg 


He 


Ala 




27 5 








280 










205 








Glu Sec 


Ala Ser 


Val 


Phe 


Arg 


Thr 


Leu 


Thr 


Glu 


Pro 


He 


Asp 


Thr 


Lys 


290 








295 










300 








Leu Pro 


He Val 


Val 


Leu 


He 


Asp 


Gly 


Gin 


Ser 


Ala 


Ser 


Ser 


Ser 


Glu 


305 






310 








315 










320 


He Val 


Ala Gly 


Ala 


Leu 


Gin 


Asp 


Met: 


Asp 


Arg 


Ala 


Val 


Leu 


Met 


Gly 






325 










330 










335 


Gin Lys 


Ser Tyr 


Gly 


Lys 


Gly 


Leu 


Val 


Gin 


Thr 


Thr 


Arg 


Gin 


Leu 


Pro 




340 










345 










350 






Tyr Asn 


Gly Val 


He 


Ly3 


Leu 


Thr 


Thr 


Ala 




Tyr 


Tyr 


He 


Pro 


Ser 




355 








360 










365 








Gly Arq 


Cys He 


Gin 


Arg 


Leu 


Asp 


Tyr 


Ser 


Arg 


Thr 


Asn Arg Thr Gly 


370 








375 










380 










Met Ala 


Thr Ala 


He 


Pro 


Asp 


Ser 


Leu 


His 


Lys 


He 


Phe 


Tyr 


Thr 


Ala 


385 






390 










395 










400 
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Ala 


Gly 


Ary 


Arg 


Val 
405 


Glu 


Asp 


Ala 


Gly 


Gly 
410 


He 


Leu 


Pro 


Asp 


Il« 
415 


Glu 


Val 


Lys 


Gin 


Asp 


Thr 


Ala 


Ala 


Thr 


Leu 


Leu 


Tyr 


Tyr Met 


Ala 


He 


Asn 








420 










425 










4 30 






Asn 


Asp 


Val 
435 


Phe 


Atp 


Phe 


Val 


Thr 
440 


Gly 


Tyr 


Val 


Leu 


Lys 
445 


His 


Lys 


Thr 


He 


Ala 


Lys 


Pro 


Glu 


Asp 


Phe 


Ser 


He 


Thr 


Asn 


Glu Asp 


Tyr 


Ala 


Ala 




450 










455 










460 










Phe 


Cys 


Lys 


Met 


Met 


Gl u 


Glu 


Lys 


Lys 


fhe 


Asp 


Tyr 


Asp 


Arg 


Gin 


Ser 


465 










470 










475 










480 


Gly 


Lys 


(let 


Leu 


Asp 
485 


Lys 


Leu 


Glu 


Gin 


Leu 
490 


Ala 


Lys 


He 


Glu 


Gly 
495 


Tyr 


Leu 


Pro 


Glu 


Ala 
500 


Asn 


Ser 


Glu 


Leu 


Lys 
505 


Ala 


Leu 


Arg 


Glu 


Lys 
510 


Leu 


Lys 


Pro 


Asn 


Leu 


Ser 


Arg 


Asp 


Leu 


Leu Arg 


Phe 


Lys 


Lys 


Glu 


lie 


Thr 


Asn 






515 










520 










525 








Tyr 


Leu 


As n 


Aim 


Glu 


lie 


Val 


Thr Arg 


Tyr 


Tyr 


Vyr 


Glu 


Arg 


Gly 


Ser 




530 










535 










540 










He 


Arg 


Gin 


Ser 


Leu 


Pro 


Glu 


Asp 


Lys 


Val 


Val 


Lys 


Glu 


Al a 


He 


Lys 


545 










550 










555 










560 


Leu 


Leu 


Lys 


Asp 


His 
565 


Pro 


Glu 


Gin 


He 


Arg 
570 


Gin 


He 


Leu 


Ala 


Ala 
575 


Pro 


Lys 


Ala 




Asn 
590 


Lys 


Gly 























(2) III FORI 1ATION FOR SEQ ID HO: 299 



til SEQUF.1I-E CHARACTERISTICS: 

(A) LENGTH: 985 amino acids 
30 <B) TYPE: amino acid 

(D) TOPOLOGY: linear 

tii) MOLECULE TYPE: protein 

35 (HI) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(AJ ORGANISM: Porphyromonas gingiva lis 

40 tlx) FEATURE: 

(A> NAME/ KEY : misc_f eature 
(B> LOCATION 1...985 



<kU SEQUENCE DESCRIPTION: SEQ ID NO:299 



Val 


Thr Asp Lys 


Met 


Gin 


Asn 


Lys 


Gly 


Phe 


Val 


He 


Val 


He 


Thr 


Ser 


1 








5 










10 










15 




Ala 


Leu 


Ala 


lie 


He 


Cys 


Ala 


Phe 


Tyr 


Leu 


Ser 


Phe 


Ser 


Phe 


Val 


Thr 








20 










25 










30 






Asn 


Arq 


Tyr 


Glu 


Lys 


Lys 


Ala 


Lys 


Ala 


Het 


Gly 


Asp 


Val 


Ala 


Gly Met 






35 










40 










4S 








Ala 


Tyr 


Leu 


Asp 


Ser 


Met 


Ser 


Asn 


Glu 


Lys 


Val 


Trp 


Phe 


Gly Tyr 


Thr 




50 










55 










60 










Leu 


Lys 


Glu 


Ala 


Gin 


Ala 


Gin 


Gin 


He 


Gly 


Leu 


Gly 


Leu 


Asp 


Leu 


Lys 


€5 










70 










75 










80 


Gly Gly 


KFet 


Asn 


Val 


He 


Leu 


Lys 


Leu 


Asn 


Ala 


Ser 


Asp 


Leu 


Leu 


Arg 










85 










90 










95 




Asn 


Leu 


Set 


Asn 


Lys 


Ser 


Lsu 


Asp 


Pro 


Asn 


Phe 


Asn 


Lys 


Ala 


Leu 


Glu 








100 










105 










110 






Azu 


Ala 


Ala 


Lys 


Ser 


Thr 


Glu 


Gin 


Ser 


Asp 


Phe 


He 


Asp 


He 


Phe 


Val 






115 










120 










125 








Lys 


Glu 


Tyr 


Arg 


I.ys 


Leu 


Asp 


Pro 


Asn 


Gly 


Arg 


Leu 


Ala 


Val 


He 


Phe 




130 










135 










140 










Gly 


Ser 


Gly 


Asp 


Leu 


Arg 


Asp 


Gin 


He 


Thr 


Ala 


Lys 


Ser 


Thr 


Asp 


Ala 


145 










ISO 










155 










160 


Asp 


Val 


Val 


Arg 


Leu 


Leu 


Lys 


Glu 


Lys 


Tyr 


Asn 


Ser 


Ala 


Val 


Glu 


Ala 










165 










17 0 










175 




Ser 


?he 


Asn 


Val 


Leu 


Arg 


Al a 


Arg 


He 


Asp 


Ala 


Phe 


Gly Val 


Val 


Ala 








180 










185 










190 






Pro 


Asn 


Levi 


Gin 


Arg 


Leu 


Glu 


Gly 


Gin 


Gly 


Arg 


He 


Leu 


Val 


Glu 


Leu 






195 










200 










2C5 








Pro 


Gly Val 


Lys 


A.-p 


Pro 


Glu 


Arg 


Val 


Arg 


Thr 


Leu 


Leu 


Gin 


Arg 


Ser 




210 










215 










220 










Ala 


Asn 


Leu 


Gin 


Phe 


Trp 


Arg 


Thr 


Tyr 


Lys 


Phe 


Glu 


Glu 


Val 


Ser 


Gly 


225 










230 










2 35 










240 
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Arp 


Leu 


T l- 


Ala 


Ala 


Asn 


Asp 


Arg 










245 








Thr 


Asp 


Ala 


Thr 


Pro 


Glu 


Thr 


GUi 








260 










Ala 


Thr 


Ala 


Asp 


Ser 


Ala 


Ala 


Val 






275 










280 


Val 


Ala 


Gin 


Lys 


Glu 


Ala 


Lys 


Asp 




290 










295 




Ser 


Leu 


Leu 


Thr 


Pro 


Val 


Asn 


Arg 


305 










310 






Arg 


Arg 


Ala 


Am 


Met 


Ala 


Gin 


He 










325 








Asp 


Leu 


Lys 


Val 


Thr 


Arg 


Glu 


Asp 








3-10 










Ala 


lie 


Glu 


Asp 


Pro 


Glu 


Thr 


Lys 






355 










360 


Tyr 


Ma 


lie 


Arq 


Thr 


Asn 


Arg 


Thr 




370 










375 




Val 


Val 


Thr 


Ser 


Ala 


Lys 


Ser 


Asp 


385 










390 






Glu 


Pro 


lie 


Val 


Ser 


Met 


Thr 


list 










405 








Ala 


Arg 


He 


Thr 


Lys 


Asp Asn 


Val 








420 










Asp 


Gly 


Val 


Val 


Tyr 


Ser 


Ala 


Pro 






435 










440 


Gl y 


Arg 


Set 


Gin 


He 


Ser 


Gly 


His 




4 50 










455 




Leu 


Ala 


Asn 


Val 


Leu 


Asn 


Ser 


Gly 


4 65 










470 




Glu 


Gin 


Glu 


Asn 


Val 


He 


Gly 


Pro 










485 








Ala 


Gly 


Phe 


Leu 


Ser 


Phe 


Leu 


Leu 








500 










Met 


Cys 


Leu 


Ala 


Tyr 


Gly 


Phe 


Leu 






515 










520 


Leu 


lie 


Val 


Asn 


Ser 


Phe 


Phe 


Thr 




530 










535 




Ala 


Val 


Leu 


Thr 


Leu 


Ser 


Gly 


He 


54 5 










550 






Met 


Ala 


Val 


Asp 


Ala 


Asn 


Val 


Leu 










565 








Leu 


Arg 


Ala 


Gly 


Lys 


Thr 


Pro 


He 








580 










Asn 


Ala 


Phe 


Ser 


Ala 


He 


Phe 


Asp 






595 










600 


Gly 


He 


He 


Leu 


Phe 


Leu 


Tyr 


Gly 




610 










615 




Thr 


Thr 


Leu 


He 


He 


Gly 


Leu 


He 


625 










630 






Leu 


Thr 


Ary 


He 


Val 


Phe 


Glu 


Lys 










645 








L^s 


He 


Thr 


Phe 


Thr 


Thr 


Ser 


lie 








660 










Ser 


Tyr 


Asn 


He 


Leu 


Gly 


Lys 


Arg 






675 










680 


He 


He 


He 


Val 


Leu 


Gly 


Leu 


H e 




690 








695 




Arg 


Gly 


He 


Glu 


Phe 


Ser 


Gly Gly 


7 0b 










710 






Gin 


Pro 


Val 


Ser 


Ser 


Glu 


Ala 


Val 










725 








Gin 


Glu 


Lys 


Val 


Leu 


Val 


Thr 


Ser 








740 










Arg 


He 


Ser 


Thr 


Asn 


Tyr Lys 


He 






755 










760 


Ala 


Glu 


He 


Thr 


Asp 


Lys 


Leu 


Tyr 




770 










775 




Glu 


Gin 


Pro 


Thr 


Ala 


Asp 


Gin 


Phe 


785 










790 






Lys 


Val 


Ser 


Pro 


Ser 


Met 


Ser 


Ser 










005 








Ala 


Val 


Leu 


Leu 


Ser 


Met. 


He 


Phe 








820 










Phe 


Arg 


Asp 




Ser 


Phe 


Ser 


Ala 



218/490 



Leu 


Ser 


Glu 


Leu 


Ala 


Met 


Asn 


Asn 




250 










255 




Pro 


Ala 


Thr 


Thr 


Asp 


Ser 


Val 


Ala 


2 65 










270 






Gin 


Ala 


Val 


Ali 


Asp 


Ser 


Ala 


Thr 










285 








Ala 


Thr 


Arg 


Lys 


Asp 


Ala 


Leu 


Phe 








300 










Gly Clo- 


Ala 


Val 


Val 


Gly Val 


Ala 






315 










320 


ser 


Glu 


Ilet 


Leu 


Gin 


Gin 


Ala 


His 




330 










335 




Val 


I.CII 


Fhe 


Leu 


Trp 


Gly Ala 


Lys 


34 5 










350 






Lys 


Glu 


Thr 


Asp 


Leu 


Tyr 


Glu 


l,cu 










365 








Gly Asp 


Pro 


A-p 


Leu 


Gly 


Gly 


Asp 








380 










He 


Gin 


Asn 


Asp 


Phe 


Gly Arg 


Ser 






395 










400 


Asn 


Glu 


Glu 


Gly 


Ala 


Arg 


Lys 


Trp 




410 










415 




Gly Arg 


Ala 


lie 


Ala 


He 


Val 


Leu 


4 25 










430 






Asn 


Val 


Acn 


Acp 


GlU 


He 


Thr 


Gly 










445 








Phe 


Thr 


Val 


Glu 


Glu 


Ala 


Gly 


Asp 








460 










Lys 


Met 


Afp 


Ala 


Thr 


Val 


Ser 


He 






475 










480 


Thr 


Leu 


Gly 


Ala 


Glu 


Ser 


He 


Lys 




490 










4 95 




Ala 


Leu 


Val 


He 


Leu 


Met 


Cys 


Tyr 


505 










510 






Pro 


Glj,- 


Leu 


He 


Ala 


Asn 


Gly 


Ala 










525 








Leu 


Gly 


Val 


Leu 


Ala 


Ser 


Phe 


His 








540 










Ala 


Gly 


Leu 


Val 


Leu 


Thr 


Leu 


Gly 






555 










560 


He 


Phe 


Glu 


Arg 


lie 


Lys 


Glu 


Glu 




570 










575 




Arg 


Ala 


Val 


Thr 


Asp 


Gly Tyr 


Gly 


535 










590 






Ser 


Asn 


Val 


Thr 


Thr 


He 


He 


Thr 










605 








Thx 


Glv 


Pro 


He 


Arg 


Gly 


Phe 


Ala 








620 










Ala 


Ser 


Phe 


He 


Thr 


Ala 


Val 


Phe 






635 










640 


Leu 


Ala 


Lys 


Lys 


Gly 


Arg 


Leu 


Asp 




650 










655 




Thr 


Arq 


Asn 


L-u 


Leu 


Val 


Asn 


Pro 


665 










67 0 






Lys 


Thr 


Gly 


Phe 


Tie 


He 


Pro 


Val 










685 








Ala 


Ser 


Phe 


Thr 


He 


Gly Leu 


Asn 








700 










Arg Asn 


Tyr 


v*i 


Val 


Lys 


Phe 


Asp 






715 










720 


Arg 


Sex 


Ala 


Leu 


Ser 


Ser 


Pro 


Leu 




730 










7 35 




He 


Gly 


Thr 


Glu 


Gly Thr 


Glu 


Val 


745 










750 






Gin 


Glu 


Glu 


Ser 


Glu 


Glu 


Thr 


GlU 










765 








Gin 


Ser 


Leu 


Lys 


Gly 


Phe Tyr Thr 








780 










Leu 


Asp 


Asn 


lie. 


-He 


Ser 


Ser 


Gin 






7 95 










800 


Asp 


He 


Thr 


Arg 


Gly Ala 


He 


Trp 




810 










815 




Mot 


Ala 


He 


Tyr 


He 


Leu 


He 


Arg 


825 










S30 




Gly 


Val 


Fhe 


Val 


Ser 


Val 


Ala 


Ala 
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835 










840 










845 








Thr 


Thr 
850 


Phe 


Cye 


He 


He 


Ala 
855 


Leu 


Tyr 


Ala 


Leu 


Leu 
960 


Trp 


Lys 


He 


Leu 


Pro 


Phe 


Thr 


Met 


Glu 


He 


Asp 


Gin 


Aftn 


Phe 


He 


Ala 


Ala 


He 


Leu 


Ala 


865 










870 










875 










980 


He 


lie 


Gly Tyr 


Ser 


Leu 


Asn 


Asp 


Thr 


Val 


Val 


Val 


Phe 


Asp 


Arg 


He 










885 










890 










895 




Arg 


Glu 


Thr 


Met 
900 


Lys 


Leu 


Tyr 


Pro 


Asn 
90S 


Arg 


Asp 


Arg 


Tyr 


Gin 
910 


Val 


lie 


Asn 


Asp 


Ala 


Leu 


Asn 


Ser 


Thr 


Leu 


Gly Arg 


Thr 


Leu 


Asn 


Thr 


Ser 


Leu 






915 










920 










925 








Thr 


Thr 


Phe 


He 


Val 


Het 


Leu 


Val 


He 


Phe 


He 


Phe 


Gly Gly Ala 


Thr 




930 










935 










940 










Met 


Arg 


Ser 


Phe 


Thr 


Phe 


Ser 


He 


Leu 


Leu 


Gly 


He 


Val 


He 


Gly 


Thr 


945 










950 










955 








960 


Tyr 


Ser 


Thr 


Leu 


Phe 
965 


Val 


Ala 


Thr 


Pro 


Leu 
97 0 


Ala 


Tyr 


Glu 


He 


Gin 
97 5 


Lys 


Arg 


Lys 


Leu 


Asn 
980 


Lys 


Ala 


Ala 


Lys 


Lys 
985 

















(2> INFORMATION FOR SEQ ID NO: 300 



ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 6 amino acids 
25 (B) TYPE: amino acid 

(DJ TOPOLOGY: linear 

(11) MOLECULE TYPE; protein 

30 (lii) HYPOTHETICAL: YES 

(Vi| ORIGINAL SOURCE: 

(A) ORGANISM: Forphyromonas ginglvalis 

35 (ix) FEATURE: 

(A) NAME /KEY: mtnc_f eature 

(B) LOCATION 1...1046 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300 



Asp 


Phe 


Pro 


Trp 


Val 


Arg 


Val 


Lys 


Pro 


Glu 


Lys 


Lys 


Arg 


Lys 


Gin 


His 


1 








5 










10 










15 




Asn 


Ser 


Asn 


Asn 


Phe 


l.ys 


Phe 


Asn 


Ala 


Lys 


Glu 


Lys 


Ser 


Het 


Lys 


Arg 








20 










25 










30 






Met 


Thr 


I.SU 


Phe 


Phe 


Leu 


Cys 


Leu 


Leu 


Thr 


ser 


He 


Gly Trp Ala 


Met 






35 










40 










45 








Ala 


Gin 


Asn 


Arg 


Thr 


Val 


Ly3 


Gly 


Thr 


Val 


lis 




Ser 


Glu 


A.*?p 


Asn 




50 










55 










60 










Glu 


Pro 


Leu 


Tie 


Gly 


Ala 


Asn 


Val 


Val 


Val 


Val 


Gly 


Asn 


Thr 


Thr 


He 


65 










70 










75 










80 


Gly Ala 


Ala 


Thr 


Asp 


Leu 


Asp 


Gly 


Asn 


Phe 


Thr 


Leu 


Ser 


Val 


Pro 


Ala 










as 










90 










95 




Asn 


Ala 


Lys 


Met 


Leu 


Arg 


Val 


Ser 


Tyr 


Ser 


Gly 


Met 


Thr 


Thr 


Lys 


Glu 








100 










105 










110 




Val 


Ala 


He 


Ala 


Asn 


Val 


Met 


Lys 


He 


Val 


Leu 


Asp 


Pro 


Asp 


Ser 


Lys 






115 










120 










125 








Val 


Leu 


Glu 


Gin 


Val 


Val 


Val 


Leu 


Gly 


Tyr 


Gly 


Thr 


Gly Gin Lys 


Leu 




130 










135 










140 










Ser 


Thr 


Val 


Ser 


Gly 


Ser 


Val 


Ala 


Lys 


Val 


Ser 


Ser 


Glu 


Lys 


Leu 


Ala 


145 










150 










155 










160 


Glu 


Lys 


Pro 


Val 


Ala 


Asn 


He 


Met 


Asp 


Ala 


Leu 


Gin 


Gly Gin Val 


Ala 










165 










170 










17S 




Gly 


Met 


Gin 


Val 


Met 


Thr 


Thr 


Ser 


Gly 


Asp 


Pro 


Thr 


Ala 


Val 


Ala 


Ser 








180 










1B5 










190 






Val 


Glu 


He 


Hie 


Gly 


Thr 


Gly 


Ser 


Leu 


Gly 


Ala 


Ser 


Ser 


Ala 


Pro 


Leu 






195 










200 








205 








Tyr 


lie 


Val 


Asp 


Gly 


ttet 


Gin 


Thr 


Ser 


T.«u 


Asp 


Val 


Val 


Ale 


Thr 


Hat 




210 










215 










220 










Asn 


Pro 


Asn 


Asp 


Phe 


Glu 


Ser 


Met 


Ser 


Val 


Lau 




Asp 


Ala 


Ser 


Ala 


225 










230 










235 










240 


Thr 


Ser 


He 


Tyr 


Gly 


Ala 


Arg 


Ala 


Ala 


Asn 


Gly 


Val 


Val 


Phe 


He 


Gin 










245 










250 










255 




Thr 


Lys 


Lys 


Gly 


Lys 


Mot 


Ser 


Glu 


Arg 


Gly 


Arg 


lis 


Thr 


Phe 


Asn 


Ala 








260 










265 










270 






Ser 


Tyr 


Gly 


He 


Ser 


Gin 


He 


Leu 


Asn 


Thr 


Lys 


Pro 


Leu 


Aip Asn 


Met- 
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275 










280 


Met 


Thr 


Gly 


Asp 


Glu 


Leu 


Leu 


Asp 




2 20 










2 95 




Gly 


Asn 


Asn 


Gin 


Thr 


Val 


Gin 


L s 


305 










310 






Ala 


Glu 


Asp 


Leu 


Tyr 


Gly Asn 


Tyr 










325. 








Lys 


Thr 


Leu 


Phe 


Pro 


Val 


Asp 


Phe 








340 








Ala 


Leu 


Phe 


Lys 


Thr 


Ala 


Pro 


Thr 






355 










360 


Gly 


Gly 


Ser 


Gin 


Gly 


Thr 


Ser 


Tyr 




3/0 










375 


Gin 


Glu 


Gly Met 


Ala 


Arg 


Glu 


Pro 


385 










390 






Ary 


Leu 


7\cn 


Phe 


Glu 


Ser 


Arg 


He 










405 








Asn 


Leu 


Ser 


Gly Ala 


He 


Ala 


A<*n 








4 20 










Lye 


Tyr 


ryr 


Met 




Ser 


Gly 


Thr 






435 










44 0 


Tyr 


Tyr 


Asn 


Pro 


Phe 


Asp Val 


Asn 




•150 










455 




Met 


Tyr 


Gl" 


Ala 


Thr 


Arg 


Pro 


Ser 


465 










■370 






Wet 


A 


Pro 


Phe 


Ser 
4 85 


Ser 


Glu 


Ser 


Al a 


Gin 


He 


Thr 


Pro 


He 


Lys 










500 










Val 


~- 1 


He 


Thr 


Asn 


Thr 


Arg 


Thr 






515 










520 


Pro 


Tyr 


Asp 


Ser 


Thr 


Pro 


Leu 


Gly 




530 










535 




Asp 


Val 


Ser 


Lys 


Ser 


Ph« 


Thr 


Asn 


54S 










550 






Asp 


Glu 


Lys 


His 


Asp 


Leu 


Thr 


Ala 










565 








Tyr 


Glu 


Gly 


Asp 


Val 


He 


Gly 


Ala 








580 










Lys 


T.QH 


Met 


Lsu 


Lsu 


Ser 


Gin 


Gly 






595 










600 


Pro 


Glu 


His 


Arg 


Val 


Ala 


Glu 


Tyr 




€10 










615 




Phe 


Asn 


Tyr 


Gly 


Phe 


Asp Lys 


Trp 


625 










630 






Asn 


Asp 


Gin 


Ser 


Ser 


Arg 


Phe 


Gly 










645 








Tyr 


Ser 


Val 


Gly 


Gly 


(let 


Phe 


Asp 








660 










Ser 


Asn 


Trp 


Leu 


Ser 


Asp 


Leu 


Arg 






675 










680 


Gly 


Asn 


Ser 


Glu 


He 


Gly Asn 


Tyr 




6S0 










695 




Asn 


Asn 


Tyr 


Thr 


Glu 


Arp Ala 


Mst 


705 










710 






Asn 


Pro 


Asp 


Leu 


Ssr 


Trp 


Glu 


Lys 










725 






Ala 


Aid 


Gly 


Al a 


Phe 


Asn 


Asn 


Arg 








740 








Val 


Arq 


Thr 


Thr 


Asn 


Acp 


Met 


Leu 






755 










7 60 


Ser 


Gly 


Phe 


Phe 


Ser 


Gin 


Tyr 


Gin 




77 0 










775 




Gly 


Val 


Asp 


Leu 


Ser 


Leu 


Lys 


Gly 


78S 










7 90 






Ai*n 


Val 


Tyr 


Ala 


Ser 


Ala 


Asn 


Phe 










805 








Lys 


Leu 


Phe 


Phe 


Gly 


Lsu 


A-^n 


Lys 








820 










lie 


Trp 


Glu 


Tl- 


Gly 


Tyr 


Pro 


Asn 






835 










840 


Gly 


He 


Asp 


Lys 


Lys 


Thr 


Gly Lys 




850 










855 




Val 


Asp 


Ala 


Asp 


Gly 


Asn 


Lys 


Val 


865 










87 0 
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2B5 



Phe 


Gin 


Val 


Lys 


Ala 


Gly 


Phe 


Trp 








300 










Val 


Lys 


Asp 


H^t 


He 


Leu 


Ala 


Gly 






315 










320 


Asp 


Ser 


Leu 


Lys 


Asp 


Glu 


Tyr 


Gly 




330 










335 




Asn 


Hlr 


Asp 


Ala 


Asp 


Trp 


Leu 


Lys 


345 










350 






Ser 


Gin 


Gly 


Asp 


He 


Ser 


Phe 


Ser 










365 








Tyr 


Ala 


Ser 


He 


Gly Tyr 


Phe 


Asp 








380 










Ala 


Asn 


Phe 


Lys 


Arg 


Tyr 


Ssr 


Gly 






3 95 










400 


Asn 


Glu 


Trp 


r.cu 


Lys 


Val 


Gly 


Ala 




410 










415 




Arg 


Arg 


Ser 


Ala 


Asp 


Tyr 


Phe 


Gly 


425 










430 






Ph~ 


Gl\- 


Vai 


Leu 


Thr 


Met 


Pro Arg 










445 








GL v 


Asp 


Leu 


Ala 


Asp 


Val 


Tyr 


ryr 








460 










I l^t 


Thr 


Gl u 


Pro 


Tyr 


Phe 


Ala 


Lys 






475 










480 


Ills 


Gin 


Ala 


Asn 


Val 


Aen 


Gly 


Phe 




4 90 










495 




Leu 


Th r 


Leu 


Lvs 


Ala 


Gin 


Ala 


Gly 


505 










510 






Ser 


Ser 


L"S 


Ara 


Met 


Pro 


Asn 


Asn 










525 








Glu 


Arg 


Ara 


Glu 


Arq 


Ala 


Tyr 


Arg 








540 










Thr 


Ala 


Glu 


Tyr 


Lys 


Phe 


Ser 


He 






555 










560 


Lsu 


Met 


Gly 


His 


Glu 


Tyr 


He 


Glu 




570 










575 




Ser 


Ser 


Lys 


Gly 


Phe* 


Glu 


Ser 


Asp 


505 










590 






Lys 


Thr 


Gly 


Asn 


Ssr 


Leu 


Ser 


Leu 










605 








Ala 


Tyr 


Leu 


Ser 


Phe 


Phe 


Ser 


Arq 








620 










Met 


Tyr 


He 


Asp 


Phe 


Ser 


Val 


Arg 






635 










640 


Ser 


Asn 


Asn 


Arg 


Ser 


Ala 


Trp 


Phe 




650 










655 




Il~ 


Tyr 


Asn 


Lys 


Phe 


He 


Gin 


Glu 


665 










670 






Leu 


Lys 


Met 


Ser 


Tyr Gly Thr 


Thr 










685 








Asn 


His 


Gin 


Ala 


Leu 


Val 


Thr 


Val 








700 










Gly 


Leu 


Ser 


He 


Ser 


Thr 


Ala 


Gly 






715 










7?0 


Cln 


Ser 


Gin 


Phe 


Asn 


Phe 


Gly 


Leu 




730 










735 




Leu 


Ser 


Ala 


Glu 


Val 


Asp 


Phe 


Tyr 


74 5 










750 






He 


Asp 


Val 


Pro 


Met 


Pro 


Tyr 


He 










765 








Asn 


Val 


Gly 


Ser 


Met 


hys 


Asn 


Thr 








7B0 










Thr 


He 


Tyr 


Gin 


Asn 


Lys 


Asp 


Trp 






7 95 










800 


Asn 


Tyr 


Asn 


Arg 


Gin 


Glu 


He 


Thr 




810 










815 




Tyr 


Mel 


Lsu 


Pro 


Asn 


Thr 


Gly Thr 


625 










830 






Ser 


Phe 


Tyr 


list 


Ala 


Glu 


Tyr 


Ai*. 










845 








Gin 


Leu 


Trp 


Tyr 


Val 


Pro Gly Gin 








860 










Thr 


Thr 


Ser 


Gin 


Tyr 


Ser 


Ala 


Asp 






875 










880 
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25 



30 



45 



50 



55 



60 



65 



70 



75 



Leu 


Glu 


Thr 


Arq 


lie 


Asp 


Lys Ser Val 


Thr 


Pro 


Pro 


He 


Thr 


Gly 


Gl> 










885 






890 










995 




Phe 


Ser 


Leu 


Gly Ala 


Ser 


Trp Lys Gly 


Leu 


Ser 


Leu 


Asp Ala 


Asp 


Phe 








900 






905 










910 






Ala 


Tyr 


He 


Val 


Gly 


Lys 


Trp Het He 


Asn 


Asn 


Asp 


Arg 


Tyr 


Phe 


Thr 






915 








920 








925 








GlU 


Asn 


Ala 


Gly 


Gly 


Leu 


Het Gin Leu 


Asn 


Lys 


Asp 


Lys 


Met 


Leu 


Leu 




930 










935 






940 










Asn 


Ala 


Trp 


Thr 


Glu 


Asp 


Asn Lys Glu 


Thr 


Asp 


Val 


Pro 


Lys 


Leu 


Gly 


945 










950 






955 










960 


Gin 


Ser 


Pro 


Gin 


Phe 


Asp 


Thr His Leu 


Leu 


Glu 


Asa 


Ala 


Ser 


Phe 


Leu 










965 






970 










975 




Aig 


Leu 


Lys 


Am 


Leu 


Lys 


Leu Thr Tyr 


Val 


Leu 


Pro 


Asn 


Ser 


Leu 


Phe 








980 






98 b 










990 






Ala 


Gly 


Gin 


Asn 


Val 


lie 


Gly Gly Ala Arq 


Val 


Tyr 


Leu 


Het 


Ala 


Arg 














1000 








1005 






Asn 


Leu 


Leu 


Thr 


Val 


Thr 


Lys Tyr Lys 


Gly 


Phe 


fKrp 


Pro 


Glu 


Ala 


Gly 




1010 








1015 






1020 






Gly 


Asm 


Val 


Gly 


Lys 


Asn 


Gin Tyr Pro 


Asn 


Ser 


Lys 


Gin 


Tyr 


Vsl 


Ala 


1025 








1030 




103S 








i 


Gly 


lie 


Gin 


T.ftu 


Ser 


Phe 



















10 



15 



20 

Gly He Gin T.ftu 

104 5 

(2) 7 H FORMATION FOR SEQ ID J JO: 301 

(i) SF.QUEUCE CHARACTERISTICS: 

{A) LE1JGTH: 869 amino acids 
( B ) TYPE: amino acid 
<D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 

till) HYPOTHETICAL: YES 

35 (vi) ORIGINAL SOURCE: 

tA> ORGANISM: Porphyromonas ginglvalis 

t ix J FEATURE : 

<A) HAKE /KEY : mlsc_f eature 
40 IB) LOCATION 1...BS9 

Ul) SEQUFNCE DESCRIPTION; SEQ ID MO: 301 



Trp 


His 


Arg 


Asn 


He 


Phe 


He 


Phe 


Ala 


Ser 


Thr 


Phe 


Ser 


Pro 


Lys 


Asn 


1 








5 










10 










15 




Met 


Leu 


Pro 


Leu 


Pro 


Tyr 


Arg 


Tyr 


Ala 


Lys 


Thr 


G1 u 


His 


Leu 


Phe 


Leu 








20 










25 










30 






Ala 


Lys 


Gly 


Tyr 


Cys 


Lys 


Asn 


Pro 


He 


Thr 


Asn 


He 


He 


He 


Leu 


Phe 






35 










40 










45 








Het 


Lys 


Lys 


l.yr 


Asn 


Phe 


Leu 


Leu 


Leu 


Gly 


He 


Phe 


Val 


Ala 


Leu 


Leu 




50 










55 










60 










Thr 


Phe 


He 


Gly 


Ser 


Met 


Gin 


Ala 


Gin 


Gin 


Ala 


Lys 


Asp Tyr 


Phe 


Asn 


65 










70 










75 










80 


Phe 


Asp 


Glu 


Arg 


Gly 


Glu 


Ala 


Tyr 


Phe 


Ser 


Phe 


Lys 


Val 


Pro 


Asp 


Arg 










85 










90 










95 




Ala 


Val 


Leu 


Gin 


Glu 


Leu 


Ala 


Leu 


He 


Met 


Ser 


He 


Asp 


Glu 


Phe 


Asp 








100 










105 










110 






Pro 


Val 


Thr 


Asn 


Glu 


Ala 


He 


Ala 


Tyr 


Ala 


Ser 


Glu 


Glu 


Glu 


Phe 


Glu 






115 










120 










125 








Ala 


Phe 


Leu 


Arg 


Tyr 


Gly 


Leu 


Lys 


Pro 


Thr 


Phe 


Leu 


Thr 


Pro 


Pro 


Ser 




130 










135 










140 










Het 


Gin 


Arg 


Ala 


Val 


Glu 


Het 


Phe 


Asp 


Tyr 


Arg 


Ser 


Gly Glu 


Lys 


Tyr 


145 










150 










155 








160 


Glu 


Trp 


Am 


Ala 


Tyr 


Pro 


Thr 


Tyr 


Glu 


Ala 


Tyr 


He 


Ser 


Met 


Met 


Glu 










165 










170 










175 




Glu 


Phe 


Gin 


Thr 


Lys 


Tyr 


Pro 


Ser 


Leu 


Cys 


Thr 


Thr 


Ser 


Val 


11- 


Gly 








180 










185 










190 






Lys 


SfeE 


Val 


Lys 


Asp 


Arg 


Lys 


Leu 


Met 


He 


Cys 


Lys 


Leu 


Thr 


Ser 


Ser 






195 










200 










205 








Ala 


Asn 


Thr 


Gly 


Lys 


Lys 


Pro 


Arg 


Val 


Leu 


Tyr 


Thr 


Ser 


Thr 


Met 


His 




210 










215 










220 










Gly Asp 


Glu 


Thr 


Thr 


Gly 


Tyr 


Val 


Val 


Leu 


Leu 


Arg 


Leu 


He 


Asp 


His 


225 










230 










235 








240 


Leu 


Leu 


Ser 


JKsr. 


Tyr 


Glu 


Ser 


Asp 


Pro 


Arg 


Tie 


Lys 


Asn 


He 


Leu 


Asp 










245 










250 










255 
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Lys 


Thr 


Glu 


Val 


Trp 


He 


Cys 


Pro 








2 60 










Arg 


Ala 


Gl / Asn 


His 


Thr 


Val 


Gin 
















280 


Asn 


Val 


Asp 


Leu 


Asn 


Arg 


Asn 


Phe 




290 










235 




Pro 


Asp 


Gly 


Lys 


Pro 


Trp 


Gin 


Pro 


30b 










310 






Glu 


Gly 


Asn 


Thr 


Ser 


Phe 


Val 


Leu 










.375 








Glu 


Val 


Val 


Asn 


Tyr 


Pro 


Trp 


Asp 








340 










Asp 


Glu 


Trp 


Tyr 


Lys 


Leu 


He 


Ser 






355 










360 


Ser 


lie 


Ser 


Ala 


Set 


Tyr 


Met 


Thr 




370 










37b 




Arn 


Gly 


Sor 


Arp 


Trp 


Tyr 


Val 


He 


385 










390 






Asn 


Tyr 


Phe 


His 


Acg 


Leu Arg 


Glu 










405 








Lys 


Leu 


Val 


Pro 


AJ a 


Ser 


Cln 


Leu 








420 










Glu 


Ser 


Leu 


Leu 


Ala 


Leu 


He 


Glu 






435 










440 


Thr 


Val 


Thr 


Ser 


Ala 


Ala 


Asn 


Gly 




450 










455 




lie 


Glu 


Asn 


His 


Asp 


Lys 


Arg 


Asn 


4 65 










470 






Thr 


Gly 


Tyr 


Tyr 


Val 


Arq 


Pro 


Ue 










485 








Tyr 


Lys 


Ala 


Glu 


Gly 


Tyr 


Pro 


Glu 








500 










A£fp 


Lys 


Glu 


Thr 


Val 


lie 


Met 


Asp 






515 










520 


Leu 


Pro 


Val 


Pro 


Asp 


Phe 


Thr 


Ala 




530 










535 




GlU 


Ser 


Val 


Gin 


Phe 


Gin 


Asp 


Gin 


54 5 










550 






Glu 


Trp 


Thr 


Phe 


Glu 


Gly 


Gly Gin 










565 








Pro 


T.eu 


Val 


Ser 


Tyr 


ser 


His 


Pro 








580 










Val 


Tip 


Asn 


Ala 


Ser 


Gly 


Ser 


Asn 






595 










600 


Thr 


Val 


Asn 


Ala 


Val 


Met 


Pro 


Val 




610 










615 




Glu 


He 


Glu 


Glu 


Gly Gin 


Thr 


Val 


625 










630 






Al a 


Thr 


Asn 


Tyr 


Val 


Trp 


He 


Phe 










645 








Glu 


Ai;p 


Glu 


Asn 


Pro 


Thr 


Val 


Leu 








660 










Val 


Thr 


Leu 


Lys 


Ala 


He 


Ser 


Ala 






675 










680 


Glu 


Lys 


Tyr 


lie 


Thr 


Val 


Lys 


Lys 




690 










695 




Asp 


Phe 


Glu 


Gly 


Thr 


Pro 


Arg 


Lys 


7 05 










710 






Phe 


Lys 


Asp 


T.cu 


Ssr 


Thr 


Asn 


Asn 










725 








Glu 


Gly 


Gly 


Ser 


Pro 


Ala 


Thr 


Ser 








■740 










Tyr 


Asn 


Glu 


Thr 


Gly Lyr 


Tyr 


Asp 






7 55 










760 


Gly 


Gly 


Ser 


As ri 


V<*1 


Lys 


Lys 


Ala 




770 










775 




Asp 


Asp 


Ser 


Val 


Glu 


Asp 


He 


Val 


785 










7 90 






Pro 


Gin 


Asn 


Gly 


Thr 


Lys 


Gin 


He 










805 








Lys 


Ala 


llr 


Val 


Leu 


Tyr Asp 


He 








820 










Thr 


Pro 


Asn 


Gin 


Leu 


Arg 


Ser 


Thr 






335 










840 


Gly 


He 


Tyr 


Thr 


He 


Asn 


He 


Lys 
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Leu 


Thr 


Asn 


Pro 


Asp 


Gly Ala 


Tyr 


265 










270 






Gly Ala 


Thr 


Arg 


Tyr 


Asn 


Ala 


Asn 










285 








Lys 


Asp 


Asp 


Val 


Ala 


Gly Asp 


His 








300 










Glu 


Ala 


Thr 


Ala 


Phe 


Met 


Asp 


Leu 






315 










320 


Gly Ala 


Asn 


He 


His 


Gly 


Gly 


ThL 




330 










335 




Asn 


Lys 


Lys 


Glu 


Arg 


His 


Ala 


Asp 


345 










350 






Arg 


Asn 


Tyr 


Ala 


Ala 


Ala 


Cys 


Gin 










365 








Scr 


Glu 


Thr 


Asn 


Ser 


Gly 


He 


He 








380 








Arg 


Gly 


Ser 


Arg 


Gin 


Asp 


Asn 


Ala 






395 










400 


He 


Thr 


Leu 


Glu 


Ho 


Ser 


Asn 


Thr 




410 










415 




Pro 


Lys 


Tyr 


Trp 


Asn 


Leu 


Asn 


Lys 


425 










430 






Glu 


Scr 


Leu 


Tyr 


Gly 


He 


His 


Gly 










445 








Gin 


Pro 


Leu 


Lys 


Cys 


Gin 


lie 


Leu 








460 










Ser 


Asp 


Val 


Tyr 


Ser 


Asp 


Ala 


Thr 






475 










480 


Lys 


Ala 


Gly 


Thr 


Tyr 


Thr 


Val 


Lys 




4 90 










495 




Ala 


Thr 


Arq 


Thr 


He 


Thr 


lie 


Lys 


505 










510 






He 


Ala 


Leu 


Gly 


Asn 


Ser 


Val 


Pro 










525 








Ser 


Pro 


Met 


Thr 


He 


Ser 


Val 


Gly 








540 










Thr 


Thr 


Asn 


Asn 


Pro 


Thr 


Asn 


Trp 






555 










560 


Pro 


Ala 


Mot 


Scr 


Thr 


Glu 


Gin 


Asn 




570 










57 5 




Gly 


Gin 


Tyr 


Asp 


Val 


Thr 


Leu 


Lys 


565 










590 






Thr 


He 


Thr 


Lys 


Glu 


Lys 


Phe 


He 










605 








Ala 


Glu 


Phe 


Val 


Gly 


Thr 


Pro 


Thr 








620 










Scr 


Fhe 


Gin 


Asn 


Gln 


Ser 


Thr 


Asn 






635 










640 


Asp Gly Gly 


Thr 


Pro 


Ala 


Thr 


Ssr 




650 










655 




Tyr 


Ser 


Lys 


Ala 


Gly 


Gin 


Tyr 


Asp 


665 










67 0 






Ser 


Gly-Glu 


The 


Val 


Lys 


Thr 


Ly- 










685 








Ala 


Pro 


Val 


Pro 


Ala 


Pro 


Val 


Ala 








7 00 










Val 


Lys 


Lys 


Gly 


Glu 


Thr 


Val 


Thr 






715 










720 


Pro 


Thr 


Ser 


Trp 


Leu 


Trp Val 


Phe 




730 










735 




Thr 


Glu 


Gin 


Asn 


Pro 


Val 


Vol 


Thr 


745 










750 






Val 


Gin 


Leu 


Thr 


Ala 


Thr 


Asn 


Giu 










765 








Glu 


Asp 


Tyr 


He 


Glu 


Val 


He 


Leu 








780 










Ala 


Gin 


Thr 


Gly 


He 


Val 


He 


A eg 






795 










000 


Leu 


He 


Glu 


Al a. 


.Asn 


Ala 


Ala 


lie 




810 










815 




Asn 


Gly Arg 


Val 


Val 


Leu 


Lys 


Thi. 


825 










830 






Val 


Asp Lsu 


Ser 


lie 


Leu 


Pro 


GlU 










645 








Thr 


Glu 


Ly3 


Ser 


Ala 


Arg 


Thr 


Glu 
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850 855 
Lys He Hi* He Gly 
865 

5 { 2 J INFORMATION FOR SEQ ID HO: 302 

(i) SEQUENCE CHARACTERISTICS: 

(A» LENGTH: 106 amino acids 
iB) TYPE ; amino acid 
10 (D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

(lii) HYPOTHETICAL: YES 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyr onion as gingiva lis 



( ix ) FEATURE : 
20 (A) HA1IE/KEY: miscjeature 

(B| LOCATION 1. . .106 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 302 

25 Leu Ser Tyr Ser Gly Glu Ser Asp Ala Lys Gli; Ser Asp Gin Asn Cys 
15 10 15 

Arg Lys Cyt* Thr Phe He Gly Phe Glu Lys Arg Val Asn Thr Met Arg 

20 25 30 

Leu He Lys Ala Phe Leu Val Gin Leu Leu Leu Leu Pro He Phe Phe 
i0 35 40 45 

Tyr Lys Arg Phe He Ser Pro Leu Thr Pro Pro Ser Cys Arg Phe Thr 

SO 55 60 

Pro Ser Cys Ser Ser Tyr Ala He Glu Ala Leu Arg Lys Tyr Gly Pro 
65 7 ° 75 80 

ib Gly Lys Gly Leu Leu Leu Ser He Lys Arg He Leu Arg Cys His Pro 
85 go 95 

Trp Gly Gly Ser Gly Tyr Asp Pro Val Pro 
100 105 

(2) INFOR1IATION FOR SEQ ID IIO:303 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 061 amino acids 
(E) TYPE: amino acid 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyr omonajs ginglvalis 

(ix) FEATURE: 

(A> NAME/ KEY : misc_reaturc 

(B) LOCATION 1. . .861 

<xl> SEQUENCE DESCRIPTION: SEQ ID NO: 303 



Leu 


Glu 


Met 


Ala 


Tyr 


Asp 


Phe 


Thr 


Gin 


Thr 


Phs 


Arg 


Asn 


Ser 


Leu 


Glu 


1 








5 










10 








15 




Tyr 


Ser 


Tyr 


Gin 


Glu 


Ala 


Thr 


Arg 


Leu 


Gly 


Val 


Val 


Ala 


Val 


Thr 


Gin 








20 










25 










30 






A^p 


Met 


Leu 


Val 


Leu 


Gly 


He 


He 


Ary 


Asp 


Gly 


Asp 


Asn 


Gly 


Ala 


He 






35 










40 










45 








Asp 


He 


Met 


Axg 


His 


Tyr 


Gly 


He 


Asn 


Leu 


Tyr 


Glu 


Leu 


Lys 


Arg 


Leu 




50 










55 










60 






He 


Glu 


Leu 


Glu 


Ala 


He 


Ala 


Glu 


Ser 


Leu 


Pro 


Ala 


Ser 


Pro 


Glu 


Gly 


65 










90 










75 










80 


Ser 


Pro 


He 


Phe 


Thr 


Pro 


Scr 


A] a 


Arg 


Glu 


Ala 


He 


Asp 


Asp Ala 


Thr 










85 










90 










95 




Asp 


He 


Cys 


Ala 


Asp 


Met 


Glu 


Asp 


Glu 


Ala 


Val 


Ser 


Pro 


Val 


Hjs 


Leu 








100 










105 










110 






Leu 


Leu 


Ser 


Tie 


Leu 


Asn 


Ser 


Thr 


Gin 


Glu 


Ser 


Leu 


Val 


Gin 


Lys 


He 






115 










120 










125 
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Phe 




Lys 


Gin 


Gly 


Il & 




Tyr 


Asp 


r 


He 






Asp 


Tyr 


Phe 




130 










135 










140 










oi y 


Gin 


Ary 


Asn 


Pro 


Ser 


Glu 


Gly I.ys 


er 


Pro 


„ ec 


U 


et 


lj u 


11° 


14 5 










150 










155 










160 




/\_p 


Gly Tyr 


Gin 


A^p 


Asn 


Asp 


Phe 


Asp 




Glu 


Glu 


Asp 


Glu 


Ser 










165 










170 










175 




go r 


Pro 


ro 


Ser 


Gly 


Asn 


Ser 


Gly Thr 


Gl V 


Gl 


Gl v 


Ser 


Gly 


Asp 


Ala 








1B0 










195 










190 






Pro 


Glu 


Gl T\ 


Asn 


Thr 




Gly 


Gly Asp 


Thr 


Thr 


Thr 


Thr 


Thi- 


rg 


Sec 






1 95 










200 










205 








Gl y 




Asp 


Tnr 


Pro 


Ala 


Leu 


Asp Thr 


Phe 


Gl v 


Thr 


Asp 


Ile 


Thr 


Ala 














215 




















H«^t 


Al 3 


Al a 


Ala 


Gly 


j.yo 


Leu 


Asp 


Pro 


Val 


Val 




Arg 


Glu 


Gin 


Glu 












230 




















240 


1 1 e 


u 


Arg 


Val 


He 


Gin 


He 


Leu 


Scr 




Arg 




Lys 


Asn 


Asn 


Pro 










245 










250 










755 




V(~i 1 


Lru 


He 


Gly Glu 


Pro 


Gly 


Val 


Gly 


Lys 


Ser 


Ala 


He 


Val 


Glu 


Gly 








260 










265 










2 10 






Leu 


Ala 


Glu 


Arg 


He 


Val 


Asn 


Arg 


Lys 


Val 


Ser 


Arg 


He 


Leu 


Phe 


Asp 






27 5 










280 










285 










Arg 


He 


He 


Ser 


Leu 


Asp 


Leu 


Ala 


Gin 


Met 


Val 


Ala 


Gly 


Thr 


I.ys 




299 










295 










300 










Tyr 


Arg 


Gly 


Gin 


Phe 


Glu 


Glu 


Arg 


Leu 


I.ys 


Ala 


Val 


Leu 


Asp 


Glu 


Leu 


305 










310 










315 










320 


Lys 


Lys 


Asn 


Pro 


Gin 


He 


He 


Leu 


Phe 


He 


Asp 


Glu 


He 


His 


Thr 


He 










325 










33G 










335 




Vai 


uiy 


Ala 


Gly Ser 


Ala 


Ala 


Gly Ser 


Met 


Asp 


Tnr 


Ala 


Asn 


[let 


Leu 








340 










345 










350 






I.ys 


Pro 


Al a 


Leu 


Ala 


Arg 


Gly Gin Val 


Gin 


Cye 


He 


Gly Ala 


Thr 


Thr 






355 










360 










365 








Leu 


Asp 


Glu 


Tyr Arg 


Lys 


Asn 


He 


Glu 


Lys 


Asp 


Gly 


Ala 


Leu 


Glu 


Arg 














375 










380 














Gin 


Lys 


Val 


Pro 


He 


Ala 


Pro 


Ser 


Thr 




Glu 


Glu 


Thr 


Leu 


385 










390 










395 










400 


Thr 


-Lie 


Lsu 


Gin 


Asn 


He 


Lys 


Glu 


Lys 


Tyr 


Glu 


Asp 


Tyr 


His 


Gly Val 










405 










410 










41b 




Arg 


Tyr 


Thr 


Asp 


Glu 


Ala 


lie 


Lys 


Ala 


Al a 


/al 


Gl u 


Leu 


Thr 


Asp 


Arg 








420 










425 










430 






Tyr 


a 


Ser 


Asp 


Arg 


Phe 


Phe 


Pro 


Asp 


Lys 




lie 


Asp 


Ala 


Het 


Asp 
















440 










445 








Glu 


Al a 


Gly 


Ala 


Ser 


Val 


His 


He 


Thr 




Val 


Val 


Ala 


Pro 


Lys 


Glu 




4 50 










455 


















lie 


Glu 


lis 


Leu 


Glu 


Ala 


Glu 


Leu 


Ala 


er 


Va 1 


Ara 


Glu 


Asn 


Lys 


Leu 












470 










475 










480 




Ala 


Val 


Lys 


Ala 


Gin 


Asn 


Tyr 


Glu 


Leu 


Ala 


Al a 


Ser 


Phe 


Arg 


Asp 










485 










4 90 










4 95 




Gin 


Glu 


Arg 


Arg 


Thr 


Gin 


Gin 


Gin 


lie 


Ala 


Glu 


Glu 


Lys 


Lys 


Lys 


Trp 








500 










505 










510 






Glu 


Glu 


Gin 


Met 


Ser 


Lys 


His 


Arg 


Glu 


Thr 


Val 


Asp 


Glu 


Asn 


Val 


Val 






515 










520 










525 








Ala 


His 


Val 


Val 


Ala 


Leu 


Met 


Thr Gly 


Val 


Pro 


Ala 


Glu 


Arg 


Leu 


Star 




530 










535 










540 










Thr 


Gly 


Glu 


Gly 


Glu 


Arg 


Leu 


Arq 


Thr 


Met 


Ala 


Asp 


Asp 


Leu 


Lys 


Thr 


545 










550 










555 








560 


Lys 


Val 


Val 


Gly Gin A;?p 


Thr 


Ala 


He 


Glu 


Lys 


Met 


Val 


His 


Ala 












565 










57 0 










575 




Gin 


Arg 


Asn 


Arg 


Leu 


Gly 


Leu 


Arg Asn 


Glu 


Lys 


Lys 


Pro 


He 


Gly 


Ser 








S80 










585 










590 






Plus 


Leu 


Phe 


Leu 


Gly 


Pro 


Thr 


Gly Val 


Gly 


Lys 


Thr 


Tyr 


Leu 


Ala 


Lys 






595 










600 










605 






Lys 


Leu 


Ala 


Glu Tyr 


Leu 


Phe 


Glu 


Asp 


GlU 


Asn 


Ala 


Het 


He 


Arg 


Val 




610 










615 










620 










A*p 


list 


Ser 


Glu 


Tyr 


Met 


Glu 


Lys 


Phe 


Ser 


Val 


Ser 


Arg 


Leu 


Val 


Gly 


62b 










630 










635 










640 


Ala 


Pro 


Pro 


Gly 


Tyr 


Val 


Gly Tyr Glu 


Glu 


Gly 


Gly 


Gin 


Leu 


Thr 


Glu 










645 










650 










655 




Arg 


Val 


Arg 


Arg 


Ly? 


Pro 


Tyr 


Ser 


Val 


Vai 


Leu 


Leu 


Asp 


Glu 


He 


Glu 








660 










665 










67 0 






Lys 


Ala 


His 


Ala 


Asp 


Val 


Phe 


Asn 


Leu 


Leu 


Leu 


Girt 


-Val 


Met 


Asp 


Glu 






67 5 










680 










685 








Gly 


Gin 


Leu 


Thr 


A<fp 


Sar 


Leu 


Gly 


Arg 


Arg 


Val 


Asn 


Phe 


Lys 


Asn 


Thr 














695 










700 








Val 


He 


He 


He 


Thr 


Ser 


Asn 


Val 


Gly 


Thr 


Arq 


Gin 


Leu 


Lys 


Asp 


Phe 


705 










710 










715 








720 


Gly 


Gin 


Gly 


He 


Gly 


Phe 


Arg 


Ser 


Glu 


hys 


Asp 


Glu 


Glu 


AJ A 


Asn 


Lys 
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72b 










73G 










/3S 




GJu 


Hi? 


Ser 


Arg 


Ser 


Val 


He 


Gin 


Lys Ala 


Leu 


Asn 


Lyp 


Thr 


Phe 


Ser 








740 










745 










750 






Pro 


Glu 


Ph* 


Leu 


Aen 


Arg 


Leu 


Asp 


Asp 


He 


lie 


Leu 


Phe 


Asp 


Gin 


Leu 






755 










760 










7 65 








Gly 


Lys 


Thr 


Glu 


Tie 


Axg 


Arg 


Met 


Val 


Asp 


He 


Glu 


Leu 


Lys 


Ala 


Val 




770 










775 










780 










Leu 


Ala 


Arg 


lie 


HI* 


Arg 


Ala 


Gly 




Asp 


Leu 


Val 


Leu 


Thr 


Asp 


Glu 


7B5 










7 90 










7 95 










800 


Ala 


Lys 


Ar=p 


Val 


Tie 


Ala 


Thr 


Lys 


Gly 


Tyr 


Asp 


Leu 


Gin 


Tyr 


Gly Ala 










805 










810 










815 




Arg 


Pro 


Leu 


Lys 


Arg 


The 


Leu 


Gin 


Asn 


Glu 


Val 


Glu 


Asp 


Arg 


Leu 


Thr 








820 










825 










830 






Asp 


Leu 


lie 


Leu 


ser 


Gly 


Gin 


He 


Glu 


Lys 


Gly 


Gin 


THr 


Leu 


Thr 


Leu 






835 










840 










845 








Ser 


Ala 


Arg 


Asp 


Gly 


Glu 


lie 


He 


Val 


Gin 


Glu 


Gin 


Ala 










aso 










855 










860 










<2\ 


INFORHATIOll 


FOR 


S3EQ 


ID NO:304 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 56 amino dcid^ 
[ R) TYPE: amino acid 
{ D) TOPOLOGY: linear 



tii) MOLECULE TYPE: protein 

Uii} HYPOTHETICAL; YES 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

lix) FEATURE: 

(A) NAME/KEY: miscjeaturo 
<BJ LOCATIOU 1. . .456 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 304 



Ary 


Gly 


Gly 


Gin 


He 


Arg 


Arg 


His 


His 


Thr 


Asp 


Ser 


Ser 


Arg 


Gly 


Ser 


1 








5 










10 










15 




Asp 


Ser 


Lys 


Ala 


Cys 


Arg 


Phe 


Ser 


Phe 


His 


Ala 


Glu 


Thr 


He 


Gly 


Phe 








2C 










25 










30 






Ser 


Asn 


His 


Gin 


Arg 


Ala 


Lys 


Met 


Asn 


Tyr 


Leu 


Tyr 


He 


Leu 


He 


Thr 






3b 










40 










4b 








Leu 


Leu 


Leu 


Ser 


Gly 


Phe 


Phe 


Ser 


Gly 


Ala 


Glu 


He 


Ala 


Phe 


Leu 


Ser 




50 










55 










60 










Ser 


Asp 


Lys 


I.6U 


Arg 


Leu 


Glu 


Leu 


As-p 


Arg 


Asn 


Arg 


Gly 


Asp 


Leu 


Thr 


65 










70 










75 










80 


Gly 


Arg 


Ala 


Leu 


Asn 


Leu 


Leu 


Tyr 


Arg 


His 


Pro 


Asp 


Gin 


Leu 


Val 


Thr 










85 










90 










95 




Thr 


Leu 


Lou 


Val 


Gly 


Asn 


Asn 


He 


Val 


Leu 


Val 


Val 


Tyr 


Gly Leu 


Leu 








100 










105 








110 






Ilet 


Ala 


Gly 


Leu 


Leu 


Ala 


Ala 


Pro 


Leu 


Ala 


Gin 


Trp 


He 


Asp 


Asn 


Asp 






115 










120 










125 






Ala 


Ilet 


He 


Val 


Val 


Leu 


Gin 


Ser 


Val 


Leu 


Ser 


Thr 


He 


He 


He 


Leu 




130 










135 










140 










Phe 


Thr 


Gly 


Glu 


Phe 


Leu 


Pro 


Lys 


Ala 


He 


Phe 


Lys 


Thr 


Asn 


Ala 


Asn 


145 










150 










155 








160 


Ilet 


Met 


Met 


Arg 


Val 


Phe 


Ala 


Leu 


Pro 


He 


Val 


Ala 


He 


Tyr 


Tyr 


Leu 










165 










170 








175 




Leu 


Tyr 


Pro 


Leu 


Ser 


Lys 


Leu 


Phe 


Thr 


Gly 


Leu 


Ser 


Arg 


Ser 


Fhe 


He 








100 










185 










190 






Arg 


T.eu 


Val 


Asp 


Lys 


Asn 


Tyr 


Val 


Pro 


Thr 


Thr 


Val 


Gly 


Leu 


Gly Arg 






1**5 










200 










205 








Val 


Asp 


Leu 


A£TD 


Hi* 


Tyr 


Leu 


Ala 


Glu 


Asn 


Met 


Ser 


Gly 


Glu 


Asn 


Glu 




210 










215 










220 








Gin 


Asn 


Asp 


Leu 


Thr 


Thr 


Glu 


Val 


Lys 


He 


He 


Gin 


Asn 


Ala 


Leu 


Asp 


225 










230 










235 










240 


Phe 


Ser 


Gly 


He 


Gin 


Val 


Arg 


Asp 


Cys 


Ilet 


He 


Pro 


Arg 


Asn 


Glu 


Met 










245 










250 










255 




He 


Ala 


cys 


Glu 


Leu 


Gin 


Thr 


Asp 


He 


Glu 


Val 


Leu 


Lys 


Thr 


Thr 


Phe 








260 










265 








270 






lie 


Asp 


Thr 


Gly 


Leu 


Ser 


Lys 


He 


He 


He 


Tyr 


Arg 


Gin 


Asn 


He 


Asp 






27 5 










280 










285 






Asp 


Val 


Val 


Gly 


Tyr 


He 


His 


Ser 


Scr 


Glu 


Met 


Phe 


Arg 


Gly 


Gin 


Asp 
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290 










295 










'300 












Trp 


Gin 


Lys 


Arg 


He 


Asn 


Thr 


Thr 


Val 


Phe 


Val 


Pro 


Glu 


Ser 


l!et 


Tyr 




305 










310 










315 










320 




Ala 


Asn 


Lys 


Leu 


Met 


Arg 


Leu 


Leu 


Met 


Gin 


Arg 


Lys 


Lys 


Ser 


He 


Ala 


5 










325 










330 








335 






lis 


Val 


lis 


Asp 
340 


Glu 


Leu 


Gly 


Gly 


Thr 
3 15 


Ala 


Gly 


Met 


Val 


Thr 
350 


Leu 


Glu 




Asp 


Leu 


Val 


Glu 


Glu 


IU 


Phe 


Gly 


Asp 


He 


Glu 


Asp 


Glu 


His 


Asp Thr 


10 






35S 










360 










365 








Arg 


Lys 
370 


lie 


lie 


Ala 


Lys 


Gin 


Leu 


Gly 


Pro 


His 


Thr 
380 


Tyr 


Leu 


Val 


Ser 




(31 y Arg 


Het 


Glu 


lis 


Asp 


Asp 


Val 


Asn 


Glu 


Arg 


Phe 


Gly 


Leu 


Ser 


Leu 




385 










390 










395 










400 


15 


Pro 


Glu 


ber 


Asp 


Asp 


Tyr 


Leu 


Thr 


Val 


Ala 


Gly 


Phe 


lie 


I.eu 


A^n 


Ser 










405 










410 








415 






His 


Gin 


Asn 


He 
420 


Pro 


Gin 


Ala 


Asn 


Glu 
425 


Val 


Val 


Glu 


He 


Ala 
430 


Pro 


Tyi 




Thr 


Phe 


Thr 


He 


Leu 


Arg 


Ser 


Ser 


Ser 


Thr 


J.ys 


Tie 


Glu 


Leu 


Val 




20 






435 










440 










445 








Ilet 


Ser 
450 


11- 


Asp 


Asp 


Gin 


Ser 
4 55 


Asn 



















(2) iriE-ORIIATIOII FOR SEQ ID NO: 305 

25 1> SEQUENCE CHARACTERISTICS: 

(Al LENGTH-. 299 amino acids 
(B| TYPE: amino acid 
(D) TOPOLOGY: linear 

30 til) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Porphyromonas gingivali? 

(ijt) FEATURE: 

(A) NAUE/KEY: raiscjeature 
<B) LOCATION 1...299 

40 



45 



50 



55 



60 



65 



70 





(XI. 1 


I SEQUENCE DESCRIPTION: SEQ ID HO: 305 










T.eu 


Lys 


Lys 


Glu 


He 


Thr 


Met 


Lys 


Gin 


Asn 


Tyr 


Phe 


Lys 


Arg 


Val 


Cys 


1 








5 










10 








15 




S«r 


Leu 


Leu 


Trp 


Leu 


Val 


Leu 


Pro 


Met 


Leu 


He 


Met 


Pro 


Leu 


Glu 


Val 








20 










25 










30 






Ala 


Ala 


Gin 


Glu 


He 


He 


Pro 


Asn 


Glu 


Glu 


Val 


Leu 


Glu 


Ser 


Leu 


Thr 






35 










40 










45 








Phe 


Val 


Ala 


Pro 


Val 


G)u 


Glu 


Thr 


Asp 


Ala 


He 


Glu 


Ala 


Glu 


Val 


Glu 




50 










55 










60 










Ala 


Leu 


Gin 


Glu 


He 


Val 


Ala 


Thr 


Glu 


Glu 


lie 


Ala 


Glu 


Gin 


Ala 


Val 


65 










70 










75 










80 


Arg 


Ser 


Tyr 


Thr 


Tyr 


Thr 


Val 


Tyr 


Axg 


Asp 


Gly 


Val 


Lys 


He 


Ala 


Ser 










85 










90 










95 




Gly 


Leu 


Thr 


Glu 


Pro 


Thr 


Phe 


Leu 


Asp 


Glu 


Asp 


Val 


Pro 


Ala 


Gly Glu 








100 










105 










110 






His 


Thr 


Tyr 


Cys 


Val 


Glu 


Val 


Gin 


Tyr 


Gin 


Gly 


Gly 


Val 


Ser 


Asp 








115 










120 










125 








Val 


Cys 


val 


Asp 


Val 


Glu 


Val 


Lys 


Asp 


Phe 


Lys 


Pro 


Val 


Thr 


Asn 


Leu 




130 










135 










140 










Thr 


Gly 


Thr 


Ala 


Ser 


Asn 


Asp 


Glu 


Val 


Ser 


Leu 


Asp 


Trp Asp Gly Val 


145 










150 










155 










160 


Glu 


Glu 


Lys 


Ala 


Glu 


Glu 


Pro 


Ala 


Ser 


Asp 


Lys 


Ala 


Val 


Ser Tyr 


Asn 










165 










170 










175 




Val 


Tyr 


Lys 


Asn 


Gly 


Thr 


Leu 


He 


Gly 


Asn 


Thr 


Ala 


Glu 


Thr 


His 


Tyr 








180 










105 










190 






Val 


Glu 


Thr 


Gly 


Val 


Ala 


Asn 


Gly 


Thr 


Tyr 


He 


Tyr 


Glu 


Val 


Glu 


Val 






195 










200 










205 








Lys 


Tyr 


Pro 


Asp 


Gly 


Val 


Ser 


Pro 


Lys 


Val 


Ala 


Val 


-Thr 


Val 


Thr 


Val 




210 










215 










220 










Thr 


Asn 


Ser 


Ser 


Leu 


Ser 


Asn 


Val 


Asp 


Gly 


Gin 


Ala 


Pro 


Tyr 


Thr 


Le=j 


225 










230 










235 








240 


Arg Val 


Glu 


Gly 


Lys 


Lys 


He 


He 


Ala 


Glu 


Ala 


His 


Gly Met 


He 


Thi 










245 










250 










255 




Leu 


Tyr 


Asp 


He 


Asn 


Gly 


Arg 


Thr 


Val 


Ala 


Val 


Ala 


Pro 


Asn 


Arg 


T.~u 
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260 



265 



27 0 



5 
10 
15 
20 
25 
30 
35 

* 40 
45 
50 
55 
60 
65 
70 
75 



Glu Tyr Met Ala Gin Thr GXy Phe Tyr Ala Val Arg Phe Asp Val Gly 

275 280 285 

Asn Lys His His Val Ser Lys Tie Gin Val Arg 



{2) INFORMATION FOR SEQ ID WO: 306 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3*'7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) HOLECULE TYPE:: protein 

(ill) HYPOTHETICAL: YES 

ivt) ORIGINAL SOURCE: 

ih) ORGANISM: Porphyromonas gingi/alii: 

{ Ui ) FEATURE : 

(A) MAI IE/ KEY : misc_feature 
(R| LOCATION 1 37 I 



(j:i) SEQUENCE DESCRIPTION: SEQ ID NO: 300 



L2U 


?ho 




His 


Lys 


Lys 


Ser 


Trp 


Tyr 


Glu 


Thr 


Phe 


Gin 


Phe 


Tyr 


Leu 


1 








5 










10 










15 




Val 


Val 


Phe 


Aig 


Ser 


Gly 


Phe 


He 


Leu 


Leu 


Cys 


Glu 


Asn 


Thr 


L*u 


Ala 








20 










25 










30 






Gin 


Gin 


T.y* 


Thr 


Glu 


GlU 


Phe 


Ala 


Pro 


Val 


Ser 


Asp 


Leu 


Arg 


Ala 


Glu 






35 










40 










45 








Ala 


Tyr 


Gly 


Ser 


Thr 


Val 


Phe 


Leu 


His 


Trp 


Thr 


Pro 


Pro 


Tyr 


Asp 


Asn 




50 










55 










60 










Pro 


Met 


He 


Pro 


I.cu 


Ser 


Glu 


Ser 


Phe 


Glu 


Ser 


Gly 


He 


Pro 


Ala 


He 


65 










70 










75 










eo 


Trp 


Lys 


Thr 


He 


Asp 


Ala 


Asp 


Gly 


Asp 


Gly 


Tyr 


Asn 


Trp 


Met 


His 


Leu 










es 










90 










9b 




Thr 


Asn 


Phe 


Thr 


Gly 


Gin 


Ser 


Gly 


Leu 


Cys 


Val 


Ser 


Ser 


Ala 


Ser 


Tyr 








100 










105 










110 






lie 


Gly 


Gly 


Val 


Gly 


Ala 


Leu 


Thr 


Pro 


Asp 


Asn 


Tyr 


Leu 


He 


Thr 


Pro 






115 










120 










125 








Glu 


Leu 


Lys 


Leu 


Pro 


Thr 


Asp 


Ala 


Leu 


Val 


Glu 


He 


He 


Tyr 


Trp 


Val 




130 










135 










140 










Cys 


Thr 


Gin 


Asp 


Leu 


Thr 


Ala 


Pro 


Ser 


Glu 


His 


Tyr 


Ala 


Val 


Tyr 


Ser 


145 










150 










155 








160 


Ser 


Ser 


Thr 


Gly 


Asn 


Asn 


Ala 


Ala 


Asp 


Phe 


Val 


Asn 


Leu 


Leu 


Tyr 


Glu 










165 










170 










175 




Glu 


Thr 


Leu 


Thr 


Ala 


Lys 


Arq 


He 


Gin 


Ser 


Pre 


Glu 


Leu 


Tie 


Arg 


Gly 








180 










185 










190 






Asn 


Arg 


Thr 


Gin 


Gly 


Val 


Trp 


Tyr 


Gin 


Arg 


Lys 


Val 


Val 


Leu 


Pro 


Asn 






195 










200 










205 








Asp Thr 


Lys 


Tyr 


Val 


Ala 


Phe 


Arg 


His 


Phe 


Asn 


Ser 


Thr 


Asp 


Asn 


Phe 




210 










215 










220 










Trp 


Leu 


Asn 


Leu 


Asp 


Glu 


Val 


Ser 


He 


Leu 


Tyr 


Thr 


Pro 


Leu 


Pro 


Arg 


225 










230 










235 










240 


Arg 


Ala 


Pro 


Cys 


Pro 


His 


Pro 


Gly 


Gly 


Tyr 


Thr 


Tyr 


Scr 


Val 


The 


Arg 










245 










250 










255 




Asp 


Gly 


Gin 


Lys 


He 


Ala 


Ser 


Gly 


Leu 


Ser 


Ala 


Leu 


Ala 


Tyr 


He 


Asp 








260 










265 










270 






Thr 


Asp 


Val 


Pro 


Tyr 


Gly 


Thr 


Gin 


Asp 


Tyr 


cys 


Val 


Gin 


Val 


Asn 


Tyr 






2*b 










280 










285 








Leu 


Gin 


Gly 


Asp 


ser 


Tyr 


Lys 


Val 


Cys 


Lys 


Asn 


He 


Val 


Val 


Ala 


Asn 




290 










295 










300 










Ser 


Ala 


Asn 


He 


Tyr 


Gl y 


Ala 


Asp 


Lys 


Pro 


Phe 


Ala 


Leu 


Thr 


Val 


Val 


305 










310 










315 










320 


Gly 


Lys 


Thr 


He 


Val 


Ala 


Ser 


Ala 


Phe 


Lys 


Gly 


Glu 


He 


Tnt 


Leu Tyr 










325 










330 










335 




Asp 


He 


Arg 


Gly 


Arg 


Leu 


He 


Ala 


Sec 


Gly 


Cys 


Asp„jThr 


Leu 


Arg 


Tyr 








340 










345 










350 






Lys 


Ala 


Glu 


Asn 


Gly 


PUe 


Tyr 


Leu 


He 


Lys 


lie 


Gin 


Val 


Asn 


Gly Thr 






355 










360 










365 








Val 


Tyr 


Thr 


Glu 


Lye 


He 


Gin 


He 


Gin 


















37 0 










375 
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(2) INFORMATION FOR SEQ ID HO: 307 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 9 amino acids 
O <B) TYPF. : amino acid 

(D> TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

10 (iiD HYPOTHETICAL: YES 

<vi> ORIGINAL SOURCE: 

(A) ORGAHISM: Porphyromonas gingival!; 
15 (ix) FEATURE: 

(A) NAME/KEY; miscfeature 

(B) LOCATION I...Q49 



(Ki) SEQUENCE DESCRIPTION: SEQ ID MOr307 
Phe Gly Tie Ser Pro Ser Met Lys Lys Ser Phe Leu Uu Ala II. Val 



Met Leu Phe Gly He Ala Hot Gin Gly hL S-r A!a Pro Val Thr Lys 
25 Glu Arg Ala Leu *.r Leu Ala Arg Leu Ala Leu Arg Gin Val Ser Leu 
Arc, Met Gly Gin Thr Ala Val Scr A,p T.y, He Ser lie A*p Tyr Val 

30 Ii" ^ G1V ' ** P ? 0 a G ^ U A ^ G1 * ^hr !« Gin Glu Glu Gly 

Pro Ala Tyr Phe Tyr Val Ala Aim Arg Gly tan Asn Glu Gly £yr 

90 or 
Ala Leu Val Ala Ala Asp tap Arg lie Pro Thr He Leu Al. Tyr Ser 

35 Pro lie Gly Arg Phe Asp Met Asp Ser Met Pro Asp Asn Leu Arc, Met 
Trp Leu Gin He Tyr Asp Gin G^u He Gly Leu He Leu Ser Gly Lys 
40 FA AS " G1U ?i" "° Leu Ar 9 Th ^ «u Gly Val Pro Ala Glu 

J.OU i c c 

Val HiS Ala Leu Met Asp to Gly His Phe Ala Asn Asp Pro Met A^g 

165 170 nr 

Trp fcn Gin Gly Tyr Pro Trp Asn Asn Lys Glu Pro Leu Leu Pro Asn 

45 Gly Asn HI, Ala Tyr Thr Gly Cys Val Al, Thr Ala Ala Ma Gin II- 

200 2 0 c 
Met Arg Tyr H is Ser Trp Pro Leu Oln Gly Glu «Y S er l>„. Asp Tyr 

50 £j Ala «y S5C Leu £J Asn Trp S er Gly Thr Phe Gly Glu Het 

Tyr Asp Trp lie Asn Het Pro Gly Asn Pro As^p Leu Asp Asn Leu Thr 
Gin Ser Gin Val Asp" Ala Tyr Ala Thr Leu Met Arg Asp Val Scr Ala 

55 ser Val Ser Het Ser Ph c Tyr Glu A^n Gly Ser Gly Thr Tyr Ser Val 



t o v p a c 

Tyr Val Val Gly Ala Leu Arg Asn Asn Phe Arg Tyr Lys Arg Ser Leu 

60 SS ^ ^ Si ^ S " G1 " ™" TtP " 1S *»* M=t 

II- Arg <siy Glu Leu Ala Ser Gly Arg Pro Val Tyr Tyr Al a Gly ™ 

Asn Gin Ser lie Gly His A la One Val £s Asp Gly Tyr Ala Ser Asp 

65 Gly Thr Phe His Phe Asn Trp Gly Trp Gly Gly Val Scr Asn Gly Phe 

Tyr Lys L.u Thr Leu Leu Ser Pro Thr Ser Leu Gly "I Gly Gly Glu 

— » ** ' 3 38^ 

70 3fll ° iy Thr Tyc Gin Glu 11= no Thr-Gly ,,. G lu Pro 

Jyu 395 



Ala Lys Thr Pro Ala Gl'u Ala Gly Thr Asp A !a L,u Pro He Leu A?° 



405 



..eu Lys Asp lie Glu Ma Glu Tyr Lys Ser Glu Ser Gly Leu Asn Val 
Gly Tyr Sar He Tyr Asn Thr Gly «u Glu Gin Ser Asn ilu Asp Leu 
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65 







435 










440 










445 








Gly Tyr 


Arq 


Leu 


Asn 


Lys 


Ala 


Asp 


Gly 


Glu 


Val 


He 


Glu 


Val 


Lys 


Thr 




450 










4 55 










460 










Ser 


Ser 


He 


Asn 


He 


Ser 


Trp 


Tyr 


Gly 


Tyr 


Gly 


Glu 


His 


Pro 


Glu 


Ser 


4 65 










4 10 










475 










480 


Phe 


Ser 


L^u 


Ala 


Pro 


Asi\ 


Gin 


Lau 


Ser 


Gin 


Gly 


Tl<* 


Asn 


Thr 


He 


Thr 










485 










490 










4 95 




Leu 


Leu 


Tyr 


Arg 


Arg 


Thr 


Gly 


Thr 


Glu 


Gin 


Trp 


Glu 


Pro 


Val 


Arg 


His 








500 










505 










510 






Ala 


Gin 


Gly 


Gly 


Tyr 


Val 


Asn 


Ser 


He 


Lys 


Val 


Asn 


Thr 


Thr 


Asp 


Pro 






515 










520 










525 








Am 


Asn 


Val 


Val 


Val 


Thr 


Val 


Asp 


Asn 


Asn 


Glu 


Gly Lys 


Leu 


Ser 


He 




530 










535 










540 










Val 


Fro 


Asn 


Ser 


Phe 


Val 


Ala 


Asp 


Leu 


Asn 


Ser 


Tyr 


Glu 


His 


Ser 


Thr 


545 










550 










555 










560 


He 


Thr 


Val 


Gin 


Phe 


Asn 


Ser 


Asp 


Ser 


Pro 


Asp 


Glu 


He 


Arg 


Thr 


Pro 










565 










570 










575 




Val 


Ala 


Phe 


Ala 


Leu 


Ser 


Thr 


Gly Ala Thr 


Ala 


Asp Asp Val 


He 


Ser 








580 










505 










590 






Leu 


Gly 


Trp 


Val 


Met 


Ala 


Glu 


Val 


Pro 


Gly 


Gly Ser 


Ser 


Asn 


Tyr 


Pro 






595 










600 










605 








Val 


Val 


Trp 


Ser 


Lys 


Asp 


Val 


Leu 


Thr 


Leu 


Ser 


Glu 


Gly Asp 


Tyr 


Thr 




610 










615 










620 










Leu 


Tip 


Tyr 


Arg 


Phe 


Szc 


He 


Asn 


Asn 


Gin 


Lys 


Asp 


Glu 


Trp 


Lys 


Lys 


625 










630 










635 










640 


Tl« 


Gly 


Ser 


Val 


ser 


Val 


Lys 


Thr 


Pro 


Thr 


Cl u 


Tyr 


Thr 


His 


pro 


Leu 










64 5 










650 








655 




Phe 


Glu 


Val 


Gly 


His 


Arn 


Gin 


Thr 


Ser 


Thr 


Tyr 


Thr 


Leu 


Asp 


Met 


Ala 








660 










665 










670 






His 


A.-?r. 


Ara 


Val 


Leu 


Pro 


Asp 


Phe 


Thr 


Leu 


Lys 


A-?n 


Leu 


Gly 


Leu 


Pro 






675 










600 










685 








Phe 


Asn 


Gly 


Glu 


Leu 


Val 


Val 


Val 


Phe 


Arg 


Gin 


Thr 


Gin 


Ser 


Ser 


Ser 




690 










695 










700 










Gly 


Ser 


Leu 


Trp 


Ala 


Ala 


Gin 


Glu 


Thr 


Val 


His 


He 


Lys 


Gin 


Gly Glu 


705 










710 










715 










720 


Thr 


Phe 


Val 


Tyr 


Lys 


Pro 


Val 


Val 


Glu 


Gly 


Pro 


He 


Pro 


Asp 


Gly 


Ser 










725 










730 










735 




Tyr 


Arg 


Ala 


Thr 


Leu 


His 


Ala 


Phe 


Val 


Asn 


Gly 


Gin 


Gin 


Gin 


Leu 


Tyr 






740 










745 










750 






Leu 


Lys 


Gly 


Lys 


Arg 


Asn 


Tyr 


Thr 


Val 


Lys 


Ile 


Val 


Asn 


Gly 


Thr 


Ala 






755 










760 










7 65 








Val 


Glu 


Ala 


He 


Glu 


Ser 


Ser 


GlU 


Glu 


He 


Arg 


Val 


Phe 


Pro 


Asn 


Pro 




770 










775 










780 










Ala 


Arg 


Asp 


Tyr 


Val 


Glu 


lis 


Ser 


Ala 


Pro 


Cys 


He 


Pro 


Gin 


Glu 


Thr 


785 










790 










/95 










80C 


Ser 


lie 


He 


Leu 


Phe 


Asp 


Leu 


Scr 


Gly Lys 


He 


Val 


Met 


Lys 


Asn 


Ser 










805 










610 










015 




Leu 


Ser 


Ala 


Gly 


His 


Gly 


Arg 


Wee 


Asp 


Val 


Ser 


Arg 


Leu 


Pro 


Asn 


Gly 








820 










825 










830 






Ala 


Tyr 


He 


Leu 


Lys 


Val 


Asp 


Gly Tyr Thr 


Thr 


Lys 


He 


Asn 


He 


Val 
















B40 










845 








His 

































10 



15 



20 



25 



30 



35 



40 



45 



50 



55 {2) TM FORMATION FOR SfcO ID NO: 308 

(U SEQUENCE CHARACTERISTICS: 

IA> LENGTH: 295 amino acids 
{ B J TYPE: amino acid 
60 ( D) TOPOLOGY: linear 



Hi) HOLECULS TYPE: protein 

(ilii HYPOTHETICAL: YES 

I vi) ORIGINAL SOURCE: 

(AJ ORGANISM: Porphyromonas gingiva lid 



lix) FEATURE : 
70 (A» NAItE/KEY: miacjeaturc 

(B) LOCATION 1,,.295 

Uil SEQUENCE DESCRIPTION : SEQ ID NO: 308 

75 Thr Lys Lys Thr Leu net Lys Lys Leu Phe T.«u Ser Leu Thr Ser Leu 
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10 



15 



20 



25 



30 



35 



40 



45 



65 



70 



75 



X 








5 










10 










15 




Val 


Met 


Val 


Phe 
2^ 


Ala 


Val 


Ala 


Ser 


Cys 
25 


Asp 


71- 


11^ 


Asp 


Lys 
30 


Asp 


Gl n 


Thr 


Leu 


Leu 


Pro 


Ala 


Pro 


Thr 


Asn 


Val 


Thr 


Pro 


Asp 


Asn 


Pro 


Asp 


Asp 






35 










40 










45 






Asn 


Pro 


Ser 


Glu 


He 


Asp 


He 


Thr 


Gin 


Thr 


His 


Thr 


Glu 


Lys 


Tyr 


Val 




50 










55 










60 








Leu 


Ala 


Glu 


Gl u 


Phe 


Thr 


Gly 


Gin 


Lys 


Cys 


Leu 


Asn 


Cys 


Pro 


Lys 


Gly 


65 










70 










•75 










80 


Ills Arg 


Ly- 


Leu 


Ala 


Ala 


Leu 


Lys 


Glu 


Gin 


Tyr 


Gly 


Lys 


Arg 


Leu 


Thr 










85 










90 










95 




Val 


Val 


Gly 


He 
100 


His 


Ala 


Gly 


Pro 


Gly 

105 


Ser 


Leu 


Val 


Pro 


Pro 
110 


Lsu 


Phe 


Arg 


Thr 


Glu 
IIS 


Ala 


Gly 


Asp 


Ala 


Tyr 
120 


Tyr 


s?r 


Lys 


Phe 


Ala 

125 


Asn 


Asn 


Thr 


Pro 


Leu 
130 


Pro 


Ala 


Leu 


Met 


Val 
135 


Ser 


Arg 


Lys 


Lys 


Phe 
140 


Gly 


Ser 


Ser 


Tyr 


Val 


Tyr 


Asp 


Lys 


Ser 


Tyr 


Lys 


Thr 


Trp 


Asp 


Val 


Pro 


He 


Ala 


GlU 


Gin 


145 










150 










155 










160 


I let 


Gl u 


Gin 


Lys 


Ala 
165 


Lys 


He 


Asn 


He 


Phe 

no 


Ala 


Val 


Ala 


Glu 


Tyr 
175 


Thr 


Asp 


Thr 


Gin 


Lys 
180 


He 


Lyi> 


Val 


Thr 


Val 
185 


Lys 


Gly 


Lys 


He 


Leu 
190 


Glu 


Gly 


As w 


Thr 


L&u 
1?^ 


Pro 


Lys 


Ser 


flet 


Val 

200 


Gin 


Val 


Tyr 


Leu 


Leu 
20b 


Glu 


Asp 


Lys 


Lsu 


He 


Ala 


Pro 


Gin 


Val 


Asp Gly Arn 


Thr 


Thr 


Val 


Glu 


Asn 


Tyr 


Glu 




210 










21b 










220 










His 


Asn 


His 


Val 


Lgu 


Arg 


Gly Ala 


Val 


Asn 


Gly 


He 


Trp 


Gly 


Glu 


Glu 












230 










235 








2-10 


Phe 


Val 


Asn 


Leu 


Lys 


Asp Tyr 


Leu 


Tyr 


Thr 


Tyr 


Ala 


Val 


Glu 


Pro 


Leu 










245 










250 










255 




Ser 


Gly Het 


Ser 


Phe 


Val 


Ala 


Glu 


Asn 


Tyr 


Ser 


He 


Val 


Ala 


Phe 


Val 








260 










2 65 










2 to 






Tyr 


Asp 


Val 


Gin 


Thr 


Phe 


Glu 


Val 


Tyr Asp Val 


Val 


Mis 


Val 


Lys 


He 






27 5 










280 










285 








Asn 


Pro 

290 


Gin 


Scr 


Asp 


Gly 


Lys 
2 95 





















(2) IN FORI (ATI Oil FOR SEQ ID MO: 309 

ti) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 230 amino acids 
(B) TYPL: amino acid 
(D) TOPOLOGY: linear 

Ui> MOLECULE TYPE: protein 

till) HYPOTHETICAL: YES 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

tix) FEATURE: 

(A) MAI IE/ KEY : misc_f eatur- 
55 ( B J LOCATION 1...230 



60 





Ixi) 


SEQUENCE DESCRIPTION: SEQ : 


ID NO:309 










Thr 


Asn 


Lys 


Lys 


Glu 


Glu 


Thr 


Met 


Lys 


Lys 


Ser 


Ser 


Val 


Val 


Ala 


Sar 


1 








5 










10 










15 




Val 


Leu 


Ala 


Val 


Ala 


Leu 


Val 


Phe 


Ala 


Gly 


Cys 


Gly 


Leu 


Asn 


Asn 


Met 








20 










25 










30 






Ala 


Lys 


Gly 


Gly 


Leu 


He 


Gly 


Ala 


Gly 


Val 


Gly 


Gly 


Ala 


He 


Gly 


Ale 






35 










40 










45 






Gly 


Val 


Cli- 


Asn 


Val 


Ala 


Gly 


Asn 


Thr 


Ala 


Val 


Gly 


Ala 


He 


Val 


Gly 




50 










55 










60 








Thr 


Ala 


val 


Gly 


Gly 


Ala 


Ala 


Gly 


Ala 


Leu 


He 


Gly 


Lys 


Lys 


Wet 


Atp 


65 










70 










75 










80 


Lys 


Gin 


Lys 


Lys 


Glu 


Leu 


Glu 


Ala 


Ala 


Val 


Pro 


Asp. 


Ala 


Thr 


He 


Gin 










35 










90 








95 




Thr 


Val 


Asn 


Aip 


Gly 


Glu 


Ala 


He 


Leu 


Val 


Thr 


Phe 


Asp 


Ser 


Gly He 








100 










105 










110 






Levi 


Phe 


Ala 


Thr 


Asn 


Ser 


Ser 


Thr 


Leu 


Ser 


Pro 


Asn 


Ser 


Arg 


Thr 


Ala 






115 










120 










125 






Leu 


Thr 


Lys 


Phe 


Ala 


Ala 


Asn 


Het 


Asn 


Lys 


Asn 


Pro 


A.*p 


Thr 


Asp 


He 
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130 13b 14D 

Arg He Val Gly His Thr Asp Asn Thr Gly Ser Asp Lys lie Asn Asp 
145 150 15b 160 

Pro Leu Ser Glu Arg Arg Ala Ma Ser Val Tyr Ser Phe Leu Asn Ser 
5 1G5 170 175 

Gin Gly Val Ser Mst Ser Arg Met Ala Ala Glu Gly Arg Gly Ser His 

180 185 190 

Glu Pro Val Ala Asp Asn Ser Thr Val Ala Gly Arg Ser Ala Asn Arq 
195 200 205 

10 Arg Va] Glu Val Tyr He Leu Pro Asn Ala Lys Met He Glu Gin Ala 
210 215 220 

Gin Gin Giy Thr Leu Lys 
225 230 

15 <2) INFORMATION FOR, SEQ ID NO: 310 

(1) SEQUENCE CHARACTERISTICS: 

{Al LENGTH : 34 2 amino acids 
(B) TYPE: amino acid 
20 (P) TOPOLOGY; linear 



25 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(li) MOLECULE TYPE: protein 

(til) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(Al ORGANISM: Porphyromona^ gingival is 



tlx} FEATURE : 
30 (A) NAME/ KEY: miscjeaturs 

(B> LOCATION 1...342 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 310 



Asn 


Arg 


Asn 


Arg 


Asn 


Met 


Ser 


Lys 


Lys 


Ser 


He 


Leu 


Leu 


Leu 


Cys 


Cys 


X 








5 










10 










15 




Scr 


Leu 


Cys 


Phe 


He 


Ser 


Ala 


Thr 


Lys 


Ala 


Val 


Thr 


Pro 


Val 


Arg 


Asn 








20 










25 










30 




Val 


Arg 


Asn 


Ser 


Gin 


Val 


Asn 


Ser 


Lys 


Ala 


Lys 


Thr 


Glu Arg 


Thr 


Lys 






35 










40 










45 








Pro 


Ser 


Asp 


Ser 


Val 


Arg 


Tyr 


He 


Ser 


Asn 


Met 


He 


Ala 


Asp 


Arg 


Leu 




50 










55 










60 










Glu 


Ptie 


Arg 


Asn 


Lys 


He 


Ser 


Ser 


Glu 


Lys 


Glu 


Val 


Arg 


Lys 


Ala 


Glu 


65 










70 










75 










eo 


Tyr 


Glu 


Asn 


Arg 


Leu 


Ala 


Met 


Glu 


Ala 


Leu 


Asn 


Tyr 


Pro 


Ala 


He 


Asp 










85 










90 










95 




Leu 


Tyr 


Gly 


GlU 


Asp 


Ser 


Trp 


Ser 


Glu 


Tyr 


Val 


Asn 


Pro 


Phe 


Val 


Gly 








100 










105 










no 






Ala 


Gly 


Thr 


Asp 


Val 


Glu 


He 


Pro 


Asn 


Ser 


Tyr 


Asp 


He 


Asp 


Cys 


Ser 






115 










120 










125 








Ser 


Phe 


Val 


Met 


Pro 


Val 


Glu 


Asp 


Lys 


Gin 


Val 


Thr 


Ser 


Gin 


Phe Gly 




130 










135 










140 










Tyr 


Acq 


Arg 


Arg 


Phe 


Gly 


Arg 


Met 


His 


Tyr 


Gly 


He 


Asp 


Leu 


Ser 


Val 


145 










150 










155 


( 








160 


Asn 


Arg 


Gly 


Asp 


Thr 


He 


Arg 


Ala 


Ala 


Phe 


Asp 


Gly 


Lys 


Val 


Arg 


Val 










165 










170 










175 




Arg 


Ser 


Tyr 


Glu 


Ala 


Arg 


Gly 


Tyr 


Gly 


Tyr 


Tyr 


He 


Val 


Leu 


Arg 


Hi c 








( 100 










185 










190 






Pro 


Asn 


Gly 


Leu 


Glu 


Thr 


Val 


Tyr 


Gly 


His 


Met 


Ser 


Arg 


Gin 


Leu 


Val 






195 










200 










205 








Asp 


Glu 


Asn 


Gin 


He 


Vdl 


Arg 


Ala 


Gly 


Gin 


Pro 


He 


Gly 


Leu 


Gly Gly 




210 










215 










220 










Ser 


Thr 


Gly 


Arg 


Ser 


Thr 


Gly 


Pro 


His 


Leu 


His 


Phe 


Glu 


Thr 


Arq 


Phe 


225 










230 










235 










240 


Met 


Gly 


lie 


Pro 


He 


Asn 


Pro 


Ser 


Thr 


He 


He 


Asp 


Phe Asp 


Asn Gly 










245 










250 










255 




Val 


Pro 


Leu 


Arg 


Asp 


He 


Tyr 


Thr 


Phe 


Lys 


Arg 


Gly 


Ser 


Asn 


Ser 


Arg 








260 










265 










27 0 






Tyr 


Ala 


Lys 


Ala 


Ser 


Lys 


Thr 


Ser 


Ser 


Arg 


Ty r 


Alq 


Lys 


Lys 


Gly 


Lys 






275 










280 










285 








Lys 


Gly 


Arg 


Gin 


Ala 


Ser 


Ser 


Pco 


Met 


Thr 


Tyr 


Arg 


He 


Lys 


Lye 


Gly 




290 










295 










300 










Asp Thr 


Leu 


Glu 


Thr 


He 


Ala 


Lys 


Arg 


His 


Gly 


Thr 


Ser 


Val 


Gin 


Lys" 


305 










310 










315 










320 


Leu 


cys 


Ala 


Thr 


Atn 


Gly 


He 


Gly 


Lys 


Ser 


Lys 


lie 


Leu 


Thr 


Pro 


Gly 
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32b 330 335 

Lyr A) a L«;u Arg He Lys 

340 

(2) INFORMATION FOR SEQ ID NO: 311 

{!) SF.QUF.IJCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 
(3) TYPE: amino acid 
tD) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

tvl) ORIGINAL SOURCE: 

(A) ORGANISM : Porphyromonas gingivals 

( ix) FKATURF.: 

(A) NAME /KEY : misc feature 
(E) LOCATION l. . .159 

<;<i) SEQUENCE DESCRIPTION: SEQ ID HO: 311 

Pro Ser Lys Thr lie He Lys Thr Met- Ala Ly* He Am ?he Tyr Ala 

10 15 
Glu Gly Val s-r L*u Pro Arg He Arg Arg Arg He Val Gly Lys Trp 

He Ala Glu Val Cys Ser Arg Tyr Gly Lys Ala Val Gly Glu He Ser 

Tyr Leu Phe Cys A«p Asp Glu Tyr He Leu Lys Ala Asn Gin Glu Phe 

ao 55 60 

Leu Asp His Asp Tyr Tyr Thr Asp He He Thr Pne Aso Ser Cys Glu 

Ala Asp Thr Val Asn Gly Asp Leu Leu lie Ser Leu Asp Thr Val Arg 

85 90 95 

Ser Asn Ala Arg Ala Leu Asp Leu Arg Tyr Glu Asp Glu Leu His Arg 

100 105 no 

Val Tie He His Gly He Leu His Leu Cys Gly Leu Lys Asp Lys Ser 

115 120 !25 

Lys Lys Asp Glu Ala Gin Met Arg Ala Ala Glu Glu Lys Ala Lsu Val 

XJU 135 i4o 

Met Leu Arg Glu Thr He Gly Ser Glu Leu Ser Leu Leu Mis Thr 
±qt> I 50 155 

(2) INFORMATION FOR SEQ ID NO: 312 

<i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 3 95 ami nr» acids 
IB ) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iij MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix> FEATURE: 

(A) NAME/ KEY : mi ro_f eatur- 

(B) LOCATION 1. . .395 

<Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312 
Lys Ser Lys Thr Met Lys Val Lys Tyr Leu Met Leu Thr Leu Val Gly 
Ala lie Ala Leu Asn Ala Ser Ala Glr, Glu Asn Thr Val Pro Ala Thr 
Leu Pro Ala Lys Asn Val 

35 40 45 

Ser Asn Trp Phe Val Thr Leu Gin Gly Gly Val Ala Ala Gin Ph- Leu 
55 60 



Gly Gin Leu Pro M a Lys Asn Val Ala p ne Ala Arg Asn Lys Ala Gly 
Ser Asn Trp Phe Val Thr Leu Gin Gly Gly Val Ala Ala Gin Ph- Leu 
Asn Asp Asn Asn Asn Lys A^p Leu Met A S p Arg Leu Gly Ala He Gly 



7 0 75 



80 
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50 



55 



70 



75 



Ser 


Leu 


Ser 


Val 


Gly 


Lys 


Tyr 


His 


S~r 


Pro 


Phe 


Phe 


Ala 


Thr 


Arg 


Leu 










85 










90 










95 




Gin 


He 


Arn 


Gly 


Gl y 


Gin 


Ala 


His 


Thr 


Phe 


Leu 


Gly 


Lys 


Asn 


Gly Glu 








100 










105 










110 






Gin 


Glu 


He 


Arn 


Thr 


Asn 


Phe 


Gly Ala 


Ala 


His 


Phe 


ASp 


Phe 


Het 


Phe 






115 










120 










125 








Asp 


Val 


Vai 


Asn 


Tyr 


Phe 


Ala 


t?ro 


Tyr 


Arg 


Glu 


Asn 


Arg 


Phe 


Phe 


His 




130 










135 










140 










Leu 


He 


Pro 


Trp 


Val 


Gly Val 


Gly 


Tyr 


Gin 


His 


Lys 


Phe 


He 


Gly Ser 


145 










ISO 










155 










160 


Glu 


Trp 


Ser 


Lys 


Asp 


Asn 


Val 


Glu 


Ser 


Leu 


Thr 


Ala 


Asn 


Val 


Gly 


Val 










165 










170 










175 




Met 


Met 


Ala 


Phe 


Ary 


Leu 


Gly I.ys 


Arg 


Val 


Asp 


Phe 


Val 


He 


GlU 


Ala 








100 










185 










190 






Gin 


Ala 


Ala 


His 


Ser 


Asn 


Leu 


Asn 


Leu 


Ser 


Arg 


Ala 


Tyr 


Asn 


Ala 


Lys 






195 










200 










205 








Lys 


Thr 


Pro 


Val 


Phe 


Glu Asp 


Pro 


Ala 


Gly Arg, 


Tyr 


Tyr 


Asn 


Gly 


Phe 




210 










215 










220 










Gin 


Gly 


tl«t 


Ala 


Thr 


Ala 


Gly 


Leu 


Asn 


Phe 


Arg 


Leu 


Gly 


Ala 


Val 


Gly 


225 










230 










235 










240 


Phe 


A-u 


Ala 


He 


Unh 


Pro 


Me: 


Asp Tyr 


Ala 


Leu 


He 


Asn 


Asp 


Leu 


Asn 










245 










250 










255 




GAy 


Gin 


He 


Asn 


Arg 


Leu 


Arq 


Ser 


Glu 


Vai 


Glu 


Glu 


Leu 


Ser 


Lys 


Arg 








260 










265 










2/0 






Pro 


Val 


Ser 


Cys 


Fro 


Glu 


Cy E 


Pro 


Glu 


Val 


Thr 


Pro 


Val 


Thr 


Lys 


Thr 






275 










280 










285 






Glu 


Asm 


He 


Leu 


Thr 


G) u 


Lys 


Ala 


Val 


Leu 


Phe 


Arg 


Phe 


Asp 


Ser 


His 




290 










295 










300 










Val 


Val 


Asp 


Lys 


Asp 


Gin 


Leu 


He 


Asn 


Leu 


Tyr 


Asp 


Val 


Ala 


Gin 


Phe 


305 










310 










315 










320 


Val 


Lys 


Glu 


Thr 


Asn 


Glu 


Pro 


He 


Thr 


Val 


Val 


Gly 


Tyr 


Ala 


Asp 


Pro 










325 










330 








335 




Thr 


Gly 


Asn 


Thr 


Gin 


Tyr Asn 


GlU 


Lys 


Leu 


Ser 


Glu 


Arg 


Arg 


Ala 


Lys 








34U 










345 










350 






Ala 


Val 


Vnl 


Asp 


Val 


Leu 


Thr 


Gly 


Lys 


Tyr 


Gly 


Val 


Pro 


Ser 


Glu 


Leu 






355 










360 








365 








He 


Ser 


Val 


Glu 


Trp 


Lys 


Gly 


Asp 


Ser 


Thr 


Gln 


Pro 


Phe 


Ser 


Lys 


Lys 




370 










375 










380 










Ala 


Trp 


Asn 


Arq 


Val 


Val 


He 


Val 


Axg 


Ser 


Ly3 












385 










390 










395 












(2 > 


INFORMATION 


FOR 


SEQ 


ID HO: 313 

















10 



15 



20 



25 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 397 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



{ ii ) IIOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pocphyrowionas glngivalis 

lix) FEATURE: 

(A] NAME/ KEY: misc_feature 

( B ) LOCATION 1 ... 3 87 



60 UD SEQUENCE DESCRIPTION : SEQ ID NO: 313 



65 



Tyr 


Lys 


Met 


Thr 


Tyr 


Arg 


lie 


net 


Lys 


Ala 


Lys 


Ser 


Leu 


Leu Leu 


Ala 


1 








5 










10 








15 




Leu 


Ala 


Gly 


Leu 
20 


Ala 


Cys 


Thr 


Phe 


Ser 
25 


Ala 


Thr 


Ala 


Gin 


Glu Ala 
30 


Thr 


Thr 


Gin 


Asn 
35 


Lys 


Ala 


Gly 


Het 


His 
40 


Thr 


Ala 


Phe 


Gin 


Arg 
45 


Asp Lys 


Ala 


Ser 


Asp 
50 


His 


Trp 


Phe 


He 


Asp 
55 


He 


Ala 


Gly 


Gly 


Ala 

60- 


Gly 


Met Ala 


Leu 


Ser 


Gly 


Trp 


Asn 


Asn 


Asp 


Val 


Asp 


Phe 


Val 


Asp 


Arg 


Leu 


Ser He 


Val 


65 










70 










75 








80 


Pro 


Thr 


The 


Gly 


He 
85 


Gly 


Lys 


Trp 


His 


Glu 
90 


Pro 


Tyr 


Phe 


Gly Thr 
95 


Arg 


Leu 


Gin 


Phe 


Thr 


G) y 


Phe 


Asp 


He 


Tyr 


Gly 


Phe 


Pro 


Gin 


Gly Ser Lys 








100 










105 










110 
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Glu 


Arg 


Asn 


His 


Asn 


Tyr 


Phe 


Gly 


Asn 


Ala 


His 


Leu 


Asp 


Phe 


Met 


Phe 






115 










120 










125 








Asp 


Leu 


Thr 


Asn 


Tyr 


Phe 


Gly Val 


Tyr 


Arg 


Pro 


Asn 


Arg 


Val 


Phe 


His 




130 










135 










140 










lis 


lie 


Pro 


Trp 


Ala 


Gly 


He 


Gly 


Phe 


Gly 


Tyr 


Lys 


Phe 


His 


Ser 


Glu 


145 










150 










155 










160 


Asn 


Ala 


Asn 


Gl y 


Glu 


Lys 


Val 


Gly 


Ser 


Lys 


Asp 


A-=D 


Met 


Thr Gly Thr 










165 










170 










175 




Val 


A^n 


Val 


Gly 


Leu 


Mel 


Leu 


'j * i 


Phe 


Arg 


Leu 


Ser 


Arg 


Val 


Val 


Asp 








100 










185 










190 






Phe 


Asn 


He 


Glu 


Gly Gin 


Ala 


Phe 


Al a 


Gly 


Lys 


Met 


Asn 


Phe 


He 


Gly 






19b 










200 










205 








Thr 


Lys 


Arg 


Gly 


Lys 


Ala 


Asp 


Phe 


Pro 


Val 


Uet 


Ala 


Thr 


Ala 


Gly 


Leu 




210 










215 










220 










Thr 


Phe 


Asn 


Leu 


Gly 


Lys 


Thr 


Glu 


Trp 


Thr 


Glu 


He 


Val 


Pro 


Met 


Asp 


225 










230 










235 










240 


Tyr 


Ala 


Leu 


Val 


Asn 


Asp 


Leu 


Asn 


Asn 


Gin 


Tie 


Asn 


Ser 


Leu 


Ara 


Gly 










245 










250 










255 




Gin 


Val 


Glu 


Glu 


Leu 


Ser 


Arg 


Arg 


Pro 


Val 


Ser 


Cys 


Pro 


Glu 


Cys 


Pro 








260 










265 










270 






Glu 


Pro 


The 


Gin 


Pro 


Thr 


Val 


Thr 


Arg 


Val 


Val 


Val 


Asp 


Asn 


Val 


Val 






275 










280 










2Q5 








Tyr 


Phe 


Arg 


He 


Asn 


Ser 


Ala 


Lys 


He 


Asp 


Arg 


Asn 


Gin 


Glu 


He 


Asn 




290 










295 










30C 










Val 


Tyr 


Asn 


Thr 


Ala 


Glu 


Tyr 


Ala 


Lys 


Thr 


Aj?n 


Asn 


Ala 


Pro 


lie 


Lys 


305 










310 










315 










320 


Val 


Val 


Gly 


Tyr 


Ala 


Asp 


Glu 


Lys 


Thr 


Gly Thr 


Ala 


Ala 


Tyr 


Asn 


f'OZ 










325 










330 










335 




Lys 


Leu 


Ser 


Glu 


Arg 


Arg 


Ala 


Lys 


Ala 


Val 


Ala 


Lys 


tlet 


Leu 


Glu 


Lys 








340 










34 S 










350 






Tyr 


Gly 


Val 


Ser 


Ala 


Asp 


Arg 


He 


Thr 


He 


Glu 


Trp 


Lys 


Gly 


Ser 


Ser 






355 










360 










365 








Glu 


Gin 


He 


Tyr 


Glu 


Glu 


Asn 


Ala 


Trp 


Asn 


Arg 


He 


Val 


Val 


Met 


Thr 




370 










37 5 








380 










Ala 


Ala 


Glu 




























385 
































(2) 


INFORMATION 


FOR 


SE0 


ID HO: 314 

















10 



15 



20 



25 



30 



35 



40 (i) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 195 amino acids 
(D) TYPE: amino acid 
(D) TOPOLOGY : linear 

45 Jii) MOLECULE TYPE; protein 

(111) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
50 (A | ORGANISM: Pocphyromonas gingival is 

I ix ) FEATURE : 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1 . . .195 

55 



60 



65 



70 



75 





(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 31 4 










Val 


He 


Gly 


He 


He 


Met 


Glu 


Phe 


Phe 


Met 


Leu 


Ph- 


He 


Ala 


Ala 


Val 


1 








5 










10 










IS 




Phe 


Val 


Asn 


Acn 


Val 


Val 


Leu 


Ser 


Gin 


Phe 


Leu 


Gly 


He 


Cy* 


Pro 


Phe 








20 










25 








30 






Leu 


Gly Val 


Ser 


Lys 


Lys 


Val 


A*p 


Thr 


Ser 


He 


Gly 


Met 


Gly Ala 


Ala 






35 










40 








45 








Val 


Thr 


Phe 


Val 


Leu 


Ala 


Leu 


Ala 


Thr 


Leu 


Val 


Thr 


Phe 


Leu 


He 


Gin 














55 










60 










Lys 


Phe 


Val 


Leu 


Asp 


Arg 


Phe 


Gly 


Leu 


Gly 


Phe 


Met 


Gin 


Thr 


He 


Ala 


65 










70 










75 










80 


Phe 


He 


Leu 


Val 


He 


Als 


Ala 


Leu 


Val 


Gin 


Met 


Val 


Glu 


He 


He 


Leu 










65 










90 










95 




Lys 


Lys 


Val 


Sei 


Pro 


Pro 


Leu 


Tyr 


Gin 


Ala 


Leu 


Gly 


Val 


Phe 


Leu 


Pro 








100 










105 








110 






Leu 


He 


Thr 


Thr 


Asn 


Cys 


Cys 


Val 


Leu 


Gly 


Val 


Ala 


He 


Leu 


Val 


He 






115 










120 








125 








Gin 


Lys 


Asp 


Tyr 


Thr 


Leu 


Leu 


Gin 


Ser 


Phe 


Val 


Tyr 


Ala 


He 


Ser 


Thr 




130 










135 










140 
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Ala lie Gly Phe Thr Leu Ala Met Val Thr Phe Ala Gly He Arg Glu 
145 150 155 160 

Gin Leu A.-p Met Thr Asn Leu Pro Lys Ala Met Lys Gly He pro Ser 
165 17 0 17 5 

O Ala Leu Leu Ala Ala Gly Tie Leu Ala Met Ala Phe Met Gly Phe S°r 
180 185 190 

Gly He Ala 

1S5 

10 (2) INFORMATION FX)R SEQ ID NO: 315 

(1) SEQUENCE CHARACTERISTICS: 

(A J LENGTH: 87 6 amino acids 
(E) TYPE: anano acid 
15 ( D) TOPOLOGY: linear 



20 



30 



40 



50 



60 



70 



Ui) MOLECULE TYPE ; protein 

till) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE; 

(A) ORGANISM: Porphyromonas gingival Is 



(ix) FEATURE: 
25 [A) NAME/ KEY : misc feature 

(B) LOCATION 1. . .87 6 



(XI J SEQUENCE DESCRIPTION: SCO, ID NO: 315 



Tyr Arg Ser Tyr Arg Gly He Gly Ser Gly Thr His Ser Pro Asn Leu 
15 10 is 

Lys Asn Arg Leu Lys Arg He Gly He Arg He Pro Asn Arg His Tvc 

20 25 30 

He His He Lys Pro He Lys Pro Lys Asn Lys Met Lys Gin Leu Asn 
OO 35 40 45 

lie He Ser Phe He He Ala Phe Leu Phe Leu Gly Thr Ser Ala Ser 

50 55 60 

Ala Gin Gin Ser Gly Gly Ser Val Thr Gly Thr Val Val Asp Lys Ser 
65 70 75 00 

Ser Lys Glu Pro He Ala Tyr Val Gin Val Phe Val Lys Gly Thr Thr 

85 go 95 

Leu Gly Thr Ser Thr Asp Ala Asn Gly Asn Tyr Ser He Lys Gly He 

100 105 no 

Pro Ser Gly Asn Gin Thr He Val Ala Arg Leu MeL Gly Tyr Ser Thr 

115 120 12b 

Cys Glu Glu Lys Val His He Glu Lys Gly Gly Ser Arg His Val Asp 

130 135 140 

Leu Tyr Leu Thr Glu Glu He Leu Ser Leu Asp Gly Val Val Val S-r 
145 150 155 iso 

Ala Asn Arg Asn Glu Thr Phe Arg Arg Gin Ala Pro Ser Leu Val Thr 

165 170 175 

Val Leu Ser Pro Glu Leu Phe Leu Lys Thr Asn Ser Thr Asn Leu Ser 

180 185 i9o 

Gin Gly Leu Lys Phe Gin Pro Gly Leu Arg Val Glu Asp Asn Cys Gin 
DO 195 200 205 

Asn Cys Gly Phe Asn Gin Val Arg He Asn Gly Leu Glu Gly Ala Tvr 

210 215 220 

Ser Gin He Leu He Asp Ser His Pro He Phe Ser Ser Leu Ala Glv 
225 23° 235 240 

Val Tyr Gly Leu Glu Gin Met Pro Ala Asn Met He Glu Arg Val Glu 

245 250 255 

Val He Arg Gly Gly Gly Ser Ala Leu Phe Gly Ser Asn Ala Val Gly 

260 265 270 

Gly Val He Asn Val He Thr Lys Glu Pro T.«u Arg Asn Ser Ala Glu 
bb 275 280 285 

He Ser His Ser Thr Met Thr Phe Asp His Ala Lys Gly Trp Gly Spr 

290 295 300 

Phe Gin Asn Thr Thr Gin Phe Asn Gly Ser Met Leu Thr Glu Asp Arg 
3uS 3X0 315 320 

Lys Ala Gly Val Met Val Phe Gly Gin His Asn Tyr Arg Pro Gly Gin 

325 330 335 

Asp He Asp Gly Asp Asn Phe Thr Glu Leu Pro Asn Leu Arg Asn Arg 

340 345 350 

Ser Leu Gly Phe Arg Ser Tyr Tyr Lys Thr Gly Leu Tyr Ser Lys Ala 
/0 355 360 365 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Tli t 


L=u 


Glu 


Tyr 


His 


Ser 


1 let 


Gin 


Glu 


Tyr 


Arg 


Arg 


Gly 


GU 


Asp 


Atg 














175 










36*-' 










Leu 


Asp 


Asn 


Pro 


Pre 


Phtr 


Glu 


Ala 


Gl n 


J 1 e 


Al a 


Glu 


Tyr 


Leu 


Gin 


Hir 












3 90 










395 










40C 


Tyr 


I 1 £■ 


Am 


Gly 


y 


Ser 


Phe 


Lys 


Phe 


Asp 


Gin 


Gly 


Phe 


Ser 


Gly 


Gly 










4 05 










410 










415 




Lys 


Asp 


Phe 


Phe 


Ser 


Lsu 


Tyr 


Ala 


Ser 


Ala 


Gl n 


As/p 


Va 1 


Gl n 


Arg 


Arg 








4 20 










425 










4 30 






Ser 


Tyr 


Tyr 


Gly 


Gl y 


Gly 


Asp 


Tyr 


Thr 


Glu 


Asn 


Leu 


Leu 


Asn 


Gly 


Ala 






4 35 










4 4 0 










445 








Val 


Gin 


Ser 


Gly 


Ser 


Thr 


Glu 


Ser 


Asp 


Glu 


Tyr 


A-sn 


Asp 


Ala 


Phe 


Thr 




4 50 










155 










4 60 










Ala 


Leu 


Thr 


Ser 


Tyr 


Gly 


Thr 


Thr 


Lys 


Gly 


Phe 


Asp 


Leu 


Gin 


Gly 


Gly 


465 










470 










47 5 










480 


Gly 


Uet 


Tyr 


Arg 


His 


Thr 


Phe 


Gly 


Glu 


Asn 


Trp 


Asp 


Phe 


Thr 


Gly 


Gly 










485 








490 










4 95 




Leu 


Glu 


Tyr 


He 


Tyr 


Gly 


Gin 


Leu 


Asp 


Asp 


Arg 


Ser 


Gly 


Tyr 


Arg 


Pro 








500 










505 










510 






Ser 


Lys 


lie 


Asp 


Gin 


Asn 


Thr 


Ser 


Thr 


Phe 


Scr 


Gin 


Tyr 


A£ P 


Gin 


Leu 






51 5 










520 










525 








Glu 


Tyr 


Lys 


Thr 


Gl u 


Lys 


Leu 


Ser 


Ala 


Leu 


He 


Gly 


Ala 


Arg 


He 


Asp 




530 










535 










540 










Tyr 


Val 


Leu 


Leu 


Asn 


Gin 


Asp 


Gly 


Lys 


Arg 


Tyc 


lis 


Asp 


Pro 


Leu 


Phe 


545 










S50 










555 










560 


He 


Phe 


Ssr 


Pro 


Arg 


Al a 


Asn 


Val 


Arg 


Tyr 


Asn 


Pro 


Asn 


Lys 


Asn 


Leu 










565 










57 0 










575 




Ser 


Phe 


Arg 


Leu 


Ser 


Tyr 


Str 


Glu 


Gly 


Ph~ 


Arg 


Ala 


Pro 


Gin 


Tyr 


Ph* 








500 










585 










590 






Arp 


Glu 


Asp 


Leu 


His 


Val 


Glu 


Leu 


Ala 


Gly 


Gly 


Thr 


Pro 


He 


Ser 


Arg 






595 










600 










605 






Val 


Leu 


Ser 


Pro 


Asn 


Leu 


Lys 


Glu 


Glu 


Arg 


Ser 


Arg 


Ser 


He 


Ser 


Ala 




610 










615 










620 










Ser 


Ph- 


Asp 


Tyr 


Tyr 


His 


Arg 


Ala 


Asp 


Glu 


Trp 


Gin 


Phe 


Am 


He 


Met 


625 










630 










635 










64 0 


Gly 01 u 


Ala 


Phe 


Ser 


Thr 


Phe 


Tie 


Ser 


Asn. 


Gin 


Phe 


Lys 


Pro 


Ser 


Asp 










645 










650 










655 




Lys 


Val 


Glu 


Thr 


Thr 


Ser 


Asp 


Gly 


Lys 


Glu 


Trp 


He 


He 


Arg 


Thr 


He 








660 










665 










670 






Tyr 


Asn 


Asp 


Lys 


Asp 


Gly 


Val 


Ser 


Lys 


Val 


Tyr 


Gly 


Val 


Asn 


Leu 


Glu 






67 5 










680 










685 








Gly 


Arg 


lie 


A] a 


Tyr 


Asn 


Lys 


Ser 


Phe 


Asp 


Leu 


Gin 


Leu 


Gly 


Gly 


Thr 














695 










700 










Trp 


Gin 


Arg 


Ser 


Arg 


Tyr 


Gly 


Ser 


He 


Tyr 


Thr 


Ala 


Val 


Clu 


Ala 


Asp 


7 0S 










710 








715 










720 


Lye 


Thr 


Thr 


Gly 


Gin 


Ala 


Glu 


He 


Ser 


Val 


Lys 


Asp 


Tyr 


Val 


Arg 


Thr 










725 










730 










735 




Pro 


Asn 


Leu 


Tyr 


Gly 


Tyr 


Phe 


Val 


Ala 


The 


Val 


Arg 


Pro 


Thr 


Glu 


His 








740 










745 










750 






Phe 


Ala 


11 e 


Asn 


Leu 


Ser 


Gly 


Thr 


Phe 


Thr 


Gly 


Lys 


Met 


Asp 


Val 


Val 






755 










750 










7 65 








His 


Glu 


Ala 


Tyr 


Glu 


Gly 


Asp 


He 


Pro 


Ala 


Glu 


Hi? 


He 


Ala 


Pro 


Asp 




770 










775 










780 








Gly 


Ser 


Phe 


Asp 


Phe 


Glu 


Met 


Asn 


Gly 


Gin 


Gin 


Phe 


Lys 


Gly 


Leu 


Ala 


785 










790 








7 95 








800 


Glu 


Gly 


His 


Ala 


Lys 


Leu 


Val 


Lys 


Thr 


Pro 


Ala 


Phe 


Ala 


Asp 


He 


Asp 










805 










810 










815 




Leu 


Lys 


Lsu 


Ser 


Hi s 


Asp 


Phe 


His 


Leu 


Ala 


Ser 


Thr 


Met 


Thr 


Leu 


GLu 








820 










825 










830 






Leu 


Asn 


Ala 


Gly 


lie 


Gin 


Arn 


He 


Phe 


Asn 


Ser 


Tyr 


Gin 


Lys 


Asp 


Thr 






835 










840 










845 








Asp 


Lys 


Gly 


Pro 


Gly 


Arg 


Ala 


Ser 


Thr 


Tyr 


Val 


Tyr 


Gly 


Pro 


Met 


Gin 




850 










855 










860 










Pro 


Arg 


Arg 


lie 


Ph« 


Val 


Gly 


Thr 


Lys 


He 


Asn 


Phe 










B65 










870 








B7 5 













\2) INFORMATION FOR SEQ ID 110:316 



|i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 899 ammo acids 
70 ( B ) TYPE: amino acid 

tD) TOPOLOGY: linear 

Hi) MOLECULE TYPE: protein 

75 till) HYPOTHETICAL: YES 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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10 



("1) ORT1T1JAL SOURCE: 

(Pil ORjAI.ISU: Pcrpiv/comona5 gingivali.= 

(l,:) FEATURE: 

[A) MAi 1r./ KEY : mi sc_ feature 
(Bl LOCATION 1 - . .899 

(::i> SEQUCUCE DESCKi PTIOt I : SEQ ID 110:316 



He Leu Asn His Leu Arg Lys Thr Met Tyr Lys Lys He He Ala Val 
1 5 10 15 

Ala Ala Leu Phe Cys Ala Ser He Gl a - He Leu Lys Gly Gin Ser Ser 
2C 25 30 

15 Asp Leu Tit i Pro Gin Asp Thr Tie Tyr Ser Pro Glu He Ser Tyr Ala 
35 «J0 45 

Lys Pro He His Lys Thr He Ala Ser He Glu Us Glu Gly Met Arg 

50 55 60 

Stc Phv A.?p Asp Phc Val Leu Arq Asn Leu Ser Gly Leu Ala Val Gly 
20 65 70 75 80 

Afp Glu Val Leu He Pro Gly Asp Ala Het Ser Ala Ala Val Asn Arg 

85 90 95 

He Het Arg Gin Gly Tyr Phe Ser Asn Val Arg He He Ala Asp Lys 
100 105 110 

25 Tyr Va] Gly Asrv Lys Val Tyr Leu Lys He He Val Tivr Glu Arg Pro 
115 120 125 

Arg He Ser Lys Val Thr Phe Ser Gly Val Lys Lys Ser Glu Arg Glu 

130 135 140 

Asp Leu Glu I let Lys lie Gly Leu Arg Glu Gly He Gin Het Thr Arg 
30 145 150 155 160 

Asn Asn Glu Asp Lys Val Arg Gin He Val Gin Lys Tyr Phe Ser Glu 

165 J70 175 

Lys Gly Tyr Arg Asp Ala Ser He Arg He Thr Gin Glu Pro Asp Leu 
160 185 190 

35 Ser Lys Asp Gly Phe Val Asn Val Leu lie Ser He Glu Lys Lys Ser 
195 200 205 

Lys Thr Lys Val Asn Glu He Tyr Phe Ser Gly Asn Lys Ala Leu Seu 

210 215 220 

Asn His Lys Leu Arg Met Ala Met Lys Asn Thr Asn Ala Lys Phe Ser 
40 225 230 23S 240 

Leu Arg Lys His He Arg Ser Ser Phe Leu Lye Leu Phe Ser Thr His 

245 250 255 

Lys Phe Val Glu Glu Ser Tyr Arg Glu Asp Leu Val Arg Leu He Glu 
260 265 270 

45 Lys Tyr Gin Glu Tyr Gly Tyr Arg Asp Ala Glu He Leu Thr Asp Ser 
275 280 285 

Val Val Lys Ala Pro Asp Gly Lys Arg Val Asp He Tyr Leu Asn He 

290 295 300 

Glu Glu Gly Gin Lys Tyr Tyr He Lys Asp Val Asn Phe Val Gly Asn 
50 305 310 315 320 

Ser Gin Tyr Pro Sor Glu Tyr Leu Glu Arg Val Leu Gly He Lys Ser 

325 330 335 

Gly Asp Val Tyr Asn Gin Arg Arg Leu Ala Lys Arg Leu Asn Glu Asp 
340 345 350 

55 Glu Asp Ala Val Gly Asn Leu Tyr Tyr Asn Asn Gly Tyr He Phe Ala 
355 360 365 

Trp Val Asp Pro Val Glu Thr Asn Val Val Gly Asp Ser Val Ser Leu 

370 375 380 

Asp He Arg He Ala Glu Gly Lys Gin Ala Asn He Asn Lys Val He 
60 385 390 395 400 

He Lys Gly Asn Thr Val Val Tyr Glu Asp Val Val Arg Arg Glu Leu 

405 410 415 

Tyr Thr Lys Pro Gly Gin Leu Phe Ser Arg Glu Asp lie He Asn Ser 
420 425 430 

65 He Arg Leu He Asn Gin Leu Gly Hie Phe Asp Ala Glu Lys Ser He 
435 440 445 

Pro Arg Pro He Pro Asn Pro Glu Thr Gly Thr Val Asp He Glu Tyr 

450 4S5 460 

Asp Leu Val Pro Arg Ser Ser Asp Gin Leu Glu Leu Ser Val Gly Trp 
70 465 470 47S~~ 480 

Ser Gin Ser Gly Leu Leu Phe Arg Gly Ala He Lys Phe Thr Asn Phe 

485 490 495 

Ser Val Gly Asn Leu Leu His Pro Ser Met Tyr Lys Lys Gly He He 
500 505 510 

75 Pro Gin Gly Asp Gly Gin Thr Leu Ser Leu Ser Ala Gin Thr Asn Gly 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



70 







51 5 










5C0 










52 Z, 










T l j £ 


Tvr 


n 


Gin 


T* *r 




Val 


Thr 


fr'l.e 


Ilet 


Asp 


Pro 


Trp 


Phe 


Gl" 




530 










535 










5-JO 










'Jl 1 * 


L*"S 


g 


Pro 


Asp 


Met 


Phe 


Se r 


Ph A 


Sec 


Ala 


Phe 


Tyc 


Ser 


L**s 


Th r 


54 *> 










550 










555 










560 


Th r 


Alu 


lie 








The 


T-- 


Asn 


r 


Asn 


Ala 


Gly Asn 


I i r 


Tyr 










565 










57 0 










57 5 




Asn 


Al a 


T* T 


II., , r 

1 


Asn 


Ser 


T -r 
1 > L 




Asn 


Asn 


T /r 


At n 


Sec 


Tyr 


T\r 

1 1 L 


Arn 








580 










58b 










590 








Met 




Am 


T 

i yr 


Thr 


Gl y 


Asp 




Tyc 


Thr 


Gin 


Ala 


Ser 


Asp 


Pro 






595 










600 










605 








A^D 


Arg 


Ser 


Leu 


Gl rt 


Met 


Leu 


Gly 


The 


Set 


He 


Gly 


Tyr 


Gly 


Lys 


Ara 




61 0 










615 










620 










L~u 


Tlir 


_ 

irp 


P ro 


AS 


A^n 


Trp 


Phe 


Gin 


He 


Tyr 


Thr 


Ser 


Leu 


Asn 


i j r 


625 










630 










635 












Thr 




Tyr 


Arg 


Leu 


Arg 


Asn 


Trp 


Ser 


Tyr 


Asn 


Thr 


Phe 


Gin 


Asn 


Phe 










64 5 










650 










655 




His 


His 




Ser 


Al a 


sn 


Asp 


Leu 


Asn 


Leu 


Glu 


Leu 


Arg 


Leu 


Ser 


tg 








660 










665 










670 






r 


s~ - 


lis 


Asp 


n 


Pro 


He 


Tyr 


Thr 


Arg 


Ser 


Gly 


Ser 


Asp 


Phe 


et 






67 5 










680 










685 








Va 1 


s«r 


Vol 


Al 

Ala 


Al a 


Th 


Leu 


Pro 


Tyr 


Ser 


Leu 


Trp 


Asp 


Asn 


His 


Asp 




690 










695 










7 00 










Tyc 


Ala 


Ser 


Gin 


Asn 


Leu 


Ser 


Val 


Ser 


Asp Arg 


Tyr Arg 


Phe 


Tie 


Glu 


705 










710 










715 










ton 


Tyr 


His 


Lys 


Trp 


Lys 


Phe 


Arg 


Gly 


Arg 


Val 


Phe 


Thr 


Pco 


Leu 


Leu 


Asn 










725 










730 










735 




Pro 


Ala 


Thr 


Has 


Lys 


Tyr 


Thr 


Pro 


Val 


Leu 


M-t 


Scr Ara 


Val 


Glu 


Gly 








740 










745 










750 




Al<* 


Val 


Leu 


Gly 


Ser 


Tyr 


Asn 


Ser 


Asn 


Lys 


Lys 


Ser 


Pro 


Phe 


Gly 


Tht 






755 










760 










765 








Phe 


Tyr 


Het 


Gly 


Gly 


Asp 


Gly Met 


Ser 


Ser 


Tyr 


Tyr Gly Gly Tyr 


Met 




770 










775 










7B0 










Asn 


Glu 


Thr 


lie 


Gly 


Leu 


Arg 


Gly Tyr 


Lys 


Asn 


Gly 


Ser 


He 


Ala 


Gly 


785 










790 










7 95 










800 


Asn 


Asn 


Tyr 


Asp 


Tyr 


Ala 


Tyr 


Ala 


Tyr 


Met 


Arg 


Leu 


Thr 


Met 


Glu 


Leu 










805 










810 










815 




Arg 


Phe 


Pro 


lie 


Leu 


Phe 


Glu 


Asn 


Ser 


Phe 


Asn 


Ala 


Trp 


Leu 


Leu 


Ala 








820 










825 










830 






Phe 


Ala 


Glu 


Ala 


Gly 


Asn 


Ala 


Trp Arg 


Ser 


He 


Asp 


Asn 


Tyr 


Asn 


Pro 






835 










840 










845 








Phe 


Asn 


Leu 


Lys 


Arg 


Ser 


Ala 


Gly Val 


Gly 


Leu 


Arg 


Val 


Thr 


Leu 


Pro 




850 










855 










860 










Met 


Val 


Gly 


Met 


Leu 


Gly 


He 


Asp 


Trp 


Gly 


Tyr 


GJy 


Phe 


Af p 


Arg 


Pro 


865 










870 










875 










880 


Asp Asn 


Ser 


Leu 


Gin 


Arg 


Gly Gly Ser 


Asn 


Val 


His 


Phe 


Val 


Leu 


Gly 










885 










890 










895 




Gin 


Glu 


Phe 





























(2) INFORMATION FOR SEQ. ID NO: 317 



(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 177 amino acids 
55 (B) TYPE: amino acid 

(D> TOPOLOGY: linear 

tii) MOLECULE TYPE: protein 

60 (iii) HYPOTHETICAL: YES 

(Vij ORIGINAL SOURCE : 

(A} ORGANISM: Porphyromonas gingival is 

65 fix) FEATURE: 

\A) NAME /KEY : miscf eature 
<BJ LOCATION 1 177 



txa) SEQUENCE DESCRIPTION: SEQ ID NO: 317 



Gin lib Asn Met Asn Gly Asp Met Lys Arg Phe Leu He Leu He Gly 
15 10 15 

Phe Ala Leu Ala Val Ala Phe Ser Gly Phe Ser Gin Lys Phe Ala Leu 
20 25 30 

75 Val A3p Met Glu Tyr He Leu Arg Asn He Pro Asp Tyr Glu Met Met 
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35 40 4 5 

Asn Glu Gin Leu Giu Gin Val Ser Lys Lys Trp Gin Aen Glu lie Glu 

SO 55 60 

Ala Leu Glu A- a Glu Ala Gin Ser Met Tyr Lys L,s Tyr Gin Ser Asp 
65 70 it go 

Leu Val Phe Leu Sor Al~ Ala Gin Lys Ly^ Thr Gin Glu Glu Ala lie 

?5 90 05 

Val Ly? Lys Glu Gin Gin Ala Ser Glu Leu Lys Arg Lys Tyr Phe Gly 

100 105 no 

Pro Glu Gly Glu Leu Tyr Lys Lys Arg Ser Asp Leu Met Lys Pro II- 

115 120 125 

Gin As P Glu He Trp Asn Ala He Lys Glu lie Ala Lys Arg Asn Asn 

130 135 140 

Tjr Gin M-L Val Leu Asp Arg Gly Thr Ser Gly Tie He Phe Ala Ser 
15 I 45 150 155 160 

Pro Ser Tie Asp He Ser Asp Leu Val Leu Ser Lys Met Gly Phe Ser 
165 170 175 

Lys 



10 



20 



25 



40 



45 



50 



55 



60 



65 



70 



(2) INFORMATION FOR SEQ ID HO: 318 

li) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 170 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

30 (iiil HYPOTHETICAL: YES 

tvi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas ginqivalis 

35 ixx) FEATURE: 

(A) HAT IE/ KEY : misc feature 
(B> LOCATION 1...170 



Ui) SEQUEIICE DESCRIPTION : SEQ ID NO: 318 



Arg He 


Asn 


Lys 


Gin 


His 


Glu 


Met 


Lys 


Lys 


Phe 


Phe 


Leu 


Met 


Leu 


Leu 


1 






S 










10 










15 




HeL Ala 


Leu 


Pro 


Leu 


Ser 


Leu 


Leu 


Ala 


Gin 


Lys 


Val 


Ala 


Val 


Val 


Asn 


Thr Glu 




20 










25 










30 






Glu 


He 


He 


Ser 


Lys 


Met 


Pro 


Glu 


Gin 


Val 


Ala 


Ala 


Thr 


Lys 




35 










40 










45 






Gin Leu 


Acn 


Glu 


Lou 


Ala 


Glu 


Lys 


Tyr 


Arg 


Leu 


Asp 


Leu 


Lys 


Ser 


Met 


50 










55 










60 








Asp Asp 
65 


Glu 


Phe 


Ala 


Lys 
70 


Lys 


Thr 


Glu 


Glu 


Phe 
75 


Val 


Lys 


Glu 


Lys 


Asp 
30 


Ser Leu 


Leu 


Glu 


Asn 


He 


Arg 


Asn 


Arg 


Arg 


Gin 


Gin 


Glu 


Leu 


Gin 


Asp 


He Gin 






85 










90 










95 


Thr 


Arg 


Tyr 


Gin 


Gin 


Ser 


Tyr 


Gin 


Thr 


Met 


Gin 


Glu 


Asp 


Leu 


Gin Lys 




IOO 










105 










110 




Arg 


Gin 


Gin 


Gin 


Leu 


Phe 


Ala 


Pro 


He 


Gin 


Gin 


Lys 


Val 


Ala 




115 










120 










125 






Asp Ala 


He 


Lys 


Lys 


Val 


Gly 


Asp 


Glu 


Glu 


Asn 


Cys 


Ala 


Tyr 


He 


Met 


130 










135 










140 








Glu Ala 


Gly 


Met 


Mot 


Leu 


Tyr 


Thr 


Gly 


Ala 


Thr 


Ala 


He 


Asp 


Leu 


Thr 


145 








150 










155 








160 


Ala Lys 


Val 


Lys 


Ala 
165 


Lys 


Leu 


Gly 


He 


Lys 
170 













(2J INFORMATION FOR SEQ ID HO: 31 9 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ilJ MOLECULE TYPE: protein 
(ill) HYPOTHETICAL: YES 
75 (vl) ORIGINAL SOURCE: 
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(A) ORGAllISIt: Porphyroinona r gingivaiir 

I IK) FEATURE : 

(A) HA! IE/KEY; mizc feature 
5 (B) LOCATION 1...826 

SEQUENCE DESCRIPTION SEQ IP UO:31? 

lie llet Lys Glii Ala He Pro Arg L/s Asn Lyr Tyr He Lys l.eu A?n 
10 1 5 10 15 

Gly He T^ r Arg Leu Ser Phe He Lsu Leu Cys Cys Leu Leu Cys Ser 

20 25 30 

Gin Ala Ala Met Ala Gin Gly Val Arg Val Sor Gly Tyr Val Leu Asp 
35 40 45 

15 Arg Gly Glu Lye Pro He Pro Phe Ala Gly Val Lys Val Arg Gly Thr 
50 55 60 

Gly Thr Gly Ala Thr Thr Asn Leu Lys Gly Tyr Tyr Glu Phe Arg Met 
65 70 75 80 

Lys Ala Thr Thr Asp Ser He Thr He Glu Phe Ser Ser Met Gly Tyr 
20 85 90 95 

Gin Gly Val Ser Arg Ser Phe Pro Ser Leu Thr Lys Asp Thr Arg Leu 
100 105 110 

■ Asn Val Arg Lou Ala Glu Ala Glu Met Glu Leu Ser Ser Val Thr Val 
115 120 125 

25 Gin Ala Thr Lys Arg Arg Leu Asn Thr Met Glu Arg Val Asn Thr Arg 
no 135 140 

A*p Leu Arg VaJ Asn Ala Gly Pro Thr Gly Gly Val Glu Ser Leu He 
145 150 155 160 

S«*r Thr Tyr Ala Gly Val Thr Gin Asn Asn Glu Leu Ser Ser Gin Tyr 
30 165 170 175 

Ser Val Arg Gly Gly Ser Tyr Asp Glu Asn Met Val Tyr Val Asn Gly 

180 185 190 

Val Glu Val Tyr Arg Pro Leu Leu Val Arg Scr Ala Gin Gin Glu Gly 
195 200 205 

35 Leu Ser Phe Val Asn Pro Asp Leu Thr Gin Ser Val Gin Phe Ser Ala 
210 215 220 

Gly Gly Phe Thr Ala Asp Tyr Gly Asp Lys Met Ser Ser Val Leu Asp 
225 230 235 240 

He Arg Tyr Lys Gin Pro Gin Glu Lys Glu Gly Ala Val Leu Leu Gly 
40 245 250 255 

Met Leu Gin Ser Ser Ala Tyr Tyr Gly Scr Ser Ala Gly Ala Phe Ser 

2S0 265 270 

Gin He Thr Gly Val Arg Tyr Lys Ser Ala Lys Ser Leu Leu Gly Thr 
275 280 285 

45 Thr Asp Thr Lys Ala Glu Tyr Asp Pro Ho Tyr Ala Asp Gly Gin Thr 
290 295 300 

Phe Met Thr Tyr Arg Phe Ser Pro Lys Leu Ser Val Ser Phe Leu Gly 
305 310 315 320 

Asn lie Ser Gin Thr Arg Tyr Lys Phe Val Pro Gin Thr Arg Glu Thr 
50 325 330 335 

Ser Phe Gly The Leu Ser Asp Ala Lys Lys Leu Lys He Phe Phe Asp 

340 345 350 

Gly Gin Glu Gin Asp Arg Phe L&u Thr Tyr Phe Gly Ala Phe Ser Met 
355 360 365 

55 Asn Phe Val Pro Asp Asp Lys Gin Arg His Thr Val Thr Leu Ser Ala 
370 375 380 

Phe Asn Ser Asn Glu Arg Glu Thr Tyr Asp lie Gin Gly Glu Tyr Phe 
305 390 395 400 

Leu Asn Asp Val Gin Leu Gly Ala Asp Gly Thr Ala Ser Met Ala Ser 
60 405 410 415 

Gly Ser Glu Asn Ser Asn Gly Leu Gly Tie Gly Arg Asn His Glu His 

420 425 430 

Ala Arg Asn Arg Leu Ser Tyr Arg Val Leu Asn Met Gly Tyr Arg Gly 
435 440 445 

65 Glu Met Lys Leu Asn Glu Lys His Arg Leu Gin Ala Gly Val Ser Ala 
450 455 460 

Gin Met Glu Lys He Ala Asp His He Ser Glu Trp Glu Arg Arg Asp 
465 470 475 480 

Ser Val Gly Tyr Asn Leu Pro His Ser Glu Thr Val Leu Leu Met Tyr 
70 *^ 485 490 495 

Asn Asn Leu Tyr Ala Asp Thr Gin Met Arg Gly Thr Arg Leu Ser Ala 

500 505 510 

Phe Val Gin Asp Arg Phe Asn phe Ser Met Gly Gly Gly Thr Phe Ser 
515 520 525 

75 Leu lie Pro Gly He Arg Ala Ser Trp Trp Ser Phe Asn Lys Glu Leu 
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10 



15 



20 



25 



30 



35 



40 



45 



70 



75 





530 










5 Jb 










54 0 










Leu 


Val 


Ser 


Fro 


Arg 


T 1 


S*r 


Val 


*3ly 


Tyr 


Ser 


Pro 


Glu 


Ser 


As:* 


Pre 


515 










550 










555 










560 


Ala 


Leu 


Val 


Leu 


Arg 


Ala 


Ala 


Ala 


Gly 


Leu 


Tyr 


Tyr 


Gin 


Ala 


Pro 


Phe 










565 










bVO 










57 5 




Tyr 


Ly^ 


Glu 


Ltu 


Arg 


Gin 


Thr 


His 


Lys 


Asp 


Ala 


Glu 


Gly 


Asn 


Asn 


Val 








53 0 










585 










590 






\'al 


Val 


Leu 


Asn 


Glu 


Lys 


11- 


Arg 


Ser 


Gin 


Gly 


Ala 


Phe 


His 


He 


Leu 






595 










600 










605 








Al c 


Gil 


Ala 


Asp 


Tyr 


Thr 


Ph^ 


Glu 


net 


Gly 


Gly 


Arg 


Lys 


Tyr 


Lys 


Phe- 














615 










620 










Thr 


Ala 


Glu 


Ala 


Tyr 


Tyr 


Lys 


Scr 


Leu 


Phe 


A?n 


He 


Asn 


Pro 


Tyr 


lie 












630 










635 










64 0 


lie 


Glu 


Asn 


Vol 


Lye 


He 


Arg 


Tyr 


Leu 


Gly 


Glu 


Asn 


He 


Gly 


Ser 


Gly 










645 










650 










655 




Tyr 


Ala 


Ala 


Gly 


lie 


Asp 


L^u 


Lys 


Leu 


Phe 


Gly 


Glu 


Leu 


Val 


Pro 


Gly 








660 










665 










67 0 






Val 


Arp 


Ser 


Trp 


Leu 


Thr 


Ala 


Ser 


He 


He 


Lys 


Ala 


Arg 


Gin 


Lys 


Leu 






67 5 










680 










685 








Asp 


Gly Tyr 


Gly 


Ser 


Leu 


Pro 


Leu 


Met 


Asn 


Ala 


Pro 


Thr 


Tyr 


Asn 


Phe 




690 










6 95 










700 










Ser 


Phe 


Phe 


Leu 


Gin 


GlU 


Tyr 


Val 


Pro 


Gly 


Asn 


Lys Arg 


He 


Thr 


Ala 


705 










710 










715 










720 


Thr 


Leu 


Arg 


Ala 


Al3 


Leu 


Ser 


Gly 


Gly 


Leu 


Pro 


Gin 


Leu 


Asn 


Pro 


Ser 










725 










730 










735 




Lys 


Gly 


Leu 


Ser 


Ser 


Pro 


Ala 


Phe 


Thr 


Ala 


Pro 


Ala Tyr 


Lys 


Arg 


Val 








740 










745 










750 






Asp 


Leu 


Gly 


Val 


Met 


Tyr 


Lys 


Trp 


Leu 


Asp 


Pro 


Asp Asp 


Ser 


Phe 


Ala 






7 55 










760 










765 








Gly 


Arg 


Ser 


Lys 


Trp 


L*u 


Met 


Gly 


Val 


Lys 


Gly 


Ala 


Tyr 


He 


Gly Ala 




770 










775 










7B0 








A^p 


Leu 


Phe 


Asn 


Leu 


Phe 


Asp 


Met 


Thr 


Asn 


Val 


Awn 


Ser 


Tyr 


Tyr 


Trp 


785 










790 










795 










800 


Val 


Ser 


Asp 


Ala 


Tyr 


Gin 


Gin 


Gin 


Tyr 


Ala 


Val 


Pro 


Asn 


Tyr 


Leu 


Thr 










805 










810 










815 




Arg 


Arg 


Gin 


Phe 


Asn 


Leu 


Arg 


Lea 


Leu 


Val 


Glu 


Phe 
















820 










825 

















(2) INFORMATION FOR SEQ ID NO: 320 

{!) SEQUENCE CHARACTERISTICS; 

tA> LENGTH: 67 9 amino acids 
(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

Ui) molecule type: protein 

till) HYPOTHETICAL: YES 

50 <vij ORIGINAL. SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

fix) FEATURE: 

(A) NAME/KEY: miscf eature 
55 (B) LOCATION 1...679 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320 

Pro 

60 



65 



Pro He Pro 


Phe 


Leu 


Leu 


Asn 


Met 


Tyr 


Ser 


Gly 


His 


His 


Lys 


He 


His 


1 




5 










10 










15 




Tyr Pro Phe 


Leu 


He 


Leu 


Leu 


Val 


Cys 


Leu 


Ala 


Phe 


Ala 


Ala 


cys 


Lys 




20 










25 










30 






Ser Val Lys 


Leu 


Lys 


Asp 


Ala 


Glu 


Lys 


Ala 


His 


Asp 


Arg 


Gin 


Glu 


Tyr 


35 










40 










45 








Thr Lys Ala 


Ala 


Asp 


Met 


Tyr 


Asn 


Thr 


Leu 


Tyr 


Arg 


Arg 


Thr 


Arg 


Arg 


50 








55 










60 










Lys Gin Val 


Glu 


Met 


Lys 


Ala 


Tyr 


Thr 


Ala 


Phe 


Arg 


Ser 


Gly 


Glu 


Asn 


65 






70 










75 








80 


Tyr Arg Ala 


Ala 


Gly 


Arg 


Gin 


Ala 


Lys 


Ala 


Leu 


-Arg 


Gly 


Tyr 


Leu 


Asn 






85 










90 










95 




Ala Arg Arg 


Tyr 


Gly 


Tyr 


Pro 


Asp 


Ser 


Val 


Val 


Leu 


Leu 


Arg 


Leu 


Ala 




100 










105 










110 






Gin Thr Tyr 


Gin 


Gin 


Gly 


Gly 


Asn 


Tyr 


Lys 


Glu 


Ala 


Glu 


Val 


Leu 


Phe 


115 










120 










12S 








Arg Gly Tyr 


Leu 


Glu 


Ala 


Tyr 


Pro 


Lys 


Ser 


Tyr 


Phe 


Ala 


Ala 


He 


Gly 
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1-10 13S 1-J0 

Uu Glu Gl / Cyr L.?u Pl\e Ala Arq Gin Gin Lys Gl u Tyr Pro The Arg 
14 5 150 15S 160 

Tyr Arg II* Arg Arg Ala A-La l 3Lu Trp Asn Scr Ala Arg Gl y A^p Fhe 
5 ' 165 1^0 115 

Gly Pro Ala Ti r Ala Pro A-«p Ala Ser Ala Leu Tyr Phe Thr Ser Ser 

180 J 85 190 

Arg Ser Lys- Asp Asp Gly Leu A*-p Arn Ser Ser lie- Thr Gly Leu Lys 
195 200 205 

10 Pro Asn Atp lie Tyr lie He Lys Ara As-p Ala Gin Giy Aig Trp Gly 
210 215 220 

Arq Pro Asp Ser Val Ser Gly Gly lis Am Thr Pro Trp Asp Glu Gly 
225 230 235 240 

Val Pro Thr lie Thr Pro Asp Gly Ser Thr lie Tyr Tyr Thr Leu Ala 
15 245 250 255 

Gin Gin Gly Ala Asp Tyr Asp Arg Thr Val Gin He Tyr Ser Ala Ala 

260 265 270 

Arg Ser Gly Glu Gly Gly Trp Ser Asn Gly S<=r Leu Vul Asp He Met 
275 260 285 

20 Arg Arp Ser Leu Arg Met Ala Ala His Pro Ser Het Ser Ala Ser Gly 
290 295 300 

Acp Tyr Leu Tyr Phe Val Ser A-n He Gly Gly Ser Tyr Gly Gly Lys 
305 320 315 320 

Asp He Tyr Arg Val Lys Val Ser Asp Arg Ser Tyr Gly Ser Pro Glu 
25 325 330 335 

Atn Leu Gly Pro Asp He Asn Thr Pro Gly Asp Glu Met Phe Pro Phe 

340 345 350 

He Asp Gly Arp Ser Ttir Lsu Phe Phe Ala Ser A.?p Gly His Ala Gly 
355 360 365 

30 Leu Gly Gly Leu Asp He Phe Lys Ala Thr Leu Asp Ser Thr Gly Gin 
370 375 380 

Trp His Val Val Met Gly Gin Pro Val Asn Ser Ser Ala Asp Asp 

385 390 395 400 

Phe Gly Leu Ala Val Glu Pro Lys Gly Lysr Asn Lys Glu Glu Ala Leu 
35 405 410 415 

Pro Asp Asn Gly Val Lys Gly Vol Phe cys Ser Asn Arg Gly Asp Ala 

420 425 430 

Arg Gly Trp Pro His Leu Phe His Phe Glu Lau Pro Ala He Tyr Thr 
435 440 445 

40 Glu He Gin Gly Tyr Val Met Asp Arg Glu Glu Asn Pro He Ala Gly 
450 455 460 

Ala Thr Val Arq He Val Gly Glu Arg Gly Pro Val Gly Gin Gly Phe 
465 470 475 480 

Val Thr Thr Arg A^p Asp Gly Ser Tyr Lys Met Ser Val Gin Gly Asp 
45 485 490 495 

Thr Arg Tyr Val Met Leu Ala Gly Ala Ser Gly Tyr Leu Asn Gin Tyr 

500 505 510 

Val Glu Leu Lys Thr Asp Thr Ala Lys Gin Ser Glu Thr Tyr Tyr Val 
515 520 525 

50 Acp Phe Phe Leu Ala Ser Arg Glu Lys Ala Glu Gly Leu Gin Asn He 
530 535 540 

Phe Tyr Aep Phe Asp Lys Ala Thr Leu Arg Pro Glu Ser Met Lys Ser 
54 5 550 555 560 

Leu Asp Glu Leu He Arg He Leu Thr Asp Asn Pro Asp He Arg He 
55 56S 570 575 

Glu Leu Gly Ser His Ala Asp Arg Lys Gly Pro Asp Ala Tyr Asn Leu 

580 585 590 

Gly Leu Ser Asp Arg Arg Ala Lys Ser Val Val Aap Tyr Leu Thr Ser 
595 600 605 

60 Arg Gly lie Ala Ala Asp Arg Leu Thr Trp Lys Gly Tyc Gly Lys Ser 
610 615 620 

Val Pro Lys Thr Val Thr Ala Lys He Ala Glu Arg His Asp Phe Leu 
625 630 635 640 

Lys Glu Gly Asp Val Leu Thr Glu Glu Phe Val Ala Pro Leu Thr Glu 
65 64b 650 $55 

Glu Gin Gin Ser Val Cys Asp Gin Leu Asn Arg Arg Thr Glu Phe Arg 

660 665 670 

Val He Glu Glu Glu Leu Arg 
67 5 



70 



12) II1FORIIATION FOR SEQ ID NO; 321 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 772 amino acids 
75 (B) TYPE: amino acid 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99729870 



243/490 

(D) TOPOLOGY: linear 

(lit MOLECULE TYPE: protein 

5 <iii) HYPOTHETICAL: YES 

<vi> ORIGINAL SOURCE; 

(A) ORGANISM: Porphyromonas gingival is 

10 (ix) FEATURE: 

(A) MAI IE/ KEY: mi^Ceature 
tR) LOCATION 1 . . .77 2 



15 



30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321 



Pro Ala Gin Pro Lys Thr Tyr Cys He Arg Tyr Phe Arg Arg Glu Val 
Ser Pro Arg Axg Gin Thr Glu Arg Thr Leu Tyr Ala Asp Lys Met Arg 

nn 20 25 30 

ZU Arc * His He Val ^ Cys Cys Pro Ser Ala Ser lie Leu Leu Gin Glu 

35 40 45 

Tyr Ser Thr Thr Glu Leu Phe He His His Phe Asn He S~r He II- 
50 55 60 

nez "5 t hYS Lys Phe Phe Pne Ala Leu Leu SeA ' II* Gly He Ser Ala Gin 
Zt> 65 70 7b bo 

Ala Phe Ala Lys Thr Asp Asn Val Pro Thr Asp Ser Leu Arg Val His 

85 90 95 

Asn Leu Gin Thr Val Thr Val Tyr Ser Thr Arg Thr Ala Val Pro Leu 

100 105 no 

Lys Lys He Pro Ala Lys Met Glu Leu He Ser Ser Ara Asn He Ly« 

115 120 125 

Gin Ser Gly Phe Asn Asn Met Thr Asp He Leu Lys Thr Gin Ser Ser 

130 135 140 

Leu Asp Val He Gin Tyr Pro Gly Phe Ser Ser Asn He Gly He Arg 
145 150 155 160 

Gly Phe Lys Pro Ser Gly Lys Tyr Val Thr Val Leu Val Asn Gly He 

165 170 175 

Pro Ala Gly Thr Asp Asn He Ser Thr Leu Asn Thr Ser Asn He Glu 

180 185 190 

Gin He Glu He Leu Lys Gly Pro Phe Ser Ser He Tyr Gly Thr Asn 

195 200 205 

Ala Met Gly Gly Val Val Asn He He Thr His Lys Ser Lys Asp Lys 

210 215 220 

lie His Gly Asn Val Ser Leu Phe Gly Gly Ser Tyr Gin Thr Met Ala 
22S 230 235 240 

Gly Ser Phe Asn Lou Gly Gly Arg Phe Glu Asp He phe Ser Phe Asp 

245 250 255 

Leu Ser Leu Gly Leu Asp Lys Gin Asn Lys Asp Tyr Lys Thr Gly Ser 

260 265 270 

Asn Asn Phe Leu Ser Leu Ser Lys Leu Glu Glu Ala He Val Asp Val 

275 280 285 

Asn Ala Thr Lys Asn Lys Lys Met Lys Gly Ser Asp Tyr Thr Val Ala 

290 295 300 

Thr Gly Arg Leu Arg Phe Gly He Asp Phe Thr Pro Glu Trp Ser Leu 
305 310 315 320 

Asn Leu Tyr Gin Asn Val Phe Leu Gly Asp Ala He Pro Val Gly Gly 

325 330 335 

Ser He Trp Gly Val Tyr Gly Glu Ser Lys Lys Asn Leu Asn Arg Ser 

340 345 350 

Ser Thr Ser Phe Glu Leu Leu Gly Lys His Gly Cys His Thr Lou Gin 

355 360 365 

Phe Ser Pro Tyr Phe Asn He Glu Lys Ser Glu Asn Tyr Asn Asn Ala 

370 375 380 

Asp Pro Thr Gly Phe He Asn Tyr Lys Ser Asp Tyr Tyr Thr Tyr Gly 
385 390 395 40Q 

Ala Leu Leu Gin Asp Lys He Ser Phe Gly Gly Gin Asn He Val Leu 

405 410 415 

Gly Val Asp Ser Arg Asn Met Thr Met Glu Ser Glu Arg Phe Glu Gin 

420 425 _ 430 

Ala Gly Val Asn Thr Lys Pro Tyr Asn Pro Gly Tyr Ala Thr Asn Asn 

135 440 445 

He Gly Leu Phe Gly Gin Ala Asn Phe Tyr Leu Leu Asn Asp Ala Leu 

4 50 455 460 

Ser He Ser Ala Gly Ala Arg Ala Asp Phe Mot Phe Phe Asp Leu Lys 
465 47 0 4 75 4B0 
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50 



60 



65 



70 



75 



Ala 


Am 


Glu 


T/r 


Leu 


Asn 


Asa 


Glu 


Ala 


Lys 


Gli. 


Glu 


Thr 


His 


Am 


val 










485 










4 90 














11c 




Pro 


A^n 


Val 


Gly 


lie 


Lys 


Tyr 


Glu 


Phe 


Val 


Lys 


Giy 


Leu 


The 








500 










505 










510 






Ala 


His 


Gl/ 


Thr 


Phe 


Gly 


Ser 


Ala 


Phe 


Ser 


Ala 


Pro 


Asp 


Ala 


Phe 


Gin 






515 










S20 










525 








Lys 


Ala 


Gly 


Gin 


Tyr 


Val 


Gly 


Pro 


Phe 


Gly 


Thr 


Thr 


He 


Gly 


Asn 


Pro 




530 










535 










540 










Asp 


Leu 


Ly^ 


Pro 


Glu 


Lys 


Str 


Met 


Thr 


Trp 


Asp 


Phe 


Gly 


He 


Gly 


Tyr 












550 










555 










560 


Ser 


Afn 


Ala 


ArQ 


Cys 


Gly 


lie 


Gin 


Ala 


Ajt F 


Val 


Thr 


Leu 


Thr 


Tyr 


Phe 










565 










57 0 










57 5 




His 


Thr 


Asp 


His 


Lyt 


Asp 


Leu 


lie 


Leu 


Ser 


Ser 


Pro 


Asp 


Tyr 


Ala 


Asn 








580 










585 










590 






Am 


lie 


The 


Thr 


Tyr 


lie 


Asn 


Ala 


Asp 


Lys 


Ala 


Arg 


HeL 


Ser 


Gly He 






595 










600 








605 








Glu 


Ala 


Leu 


Leu 


Ser 


Tyr 


A-p 


Phe 


GLy 


Ser 


Leu 


Phe 


Ala 


Asn 


Lys 


Phe 




610 










615 










620 










Ser 


Leu 


Arq 


Ala 


Phe 


Ala 


Asn 


Ala 


Thr 


He 


Met 


Leu 


At'n 


Ser 


Glu 


HeL 


625 










630 










635 










640 


Lys 


Lys 


Ser 


Gin 


Thr 


Asp 


Ala 


Pro 


Trp 


Ser 


Glu 


Met 


Tyr 


Tyr 


Val 


Arg 










64b 










650 










655 




i.ys 


G3 n 


A?*n 


lie 


Thr 


Phe 


Gly 


He 


Glu 


Tyr 


Arq 


Gly 


Lys 


Glu Gly 


Leu 








660 










665 










67 0 






Glu 


Val 


M*t 


Leu 


Asn 


Gl y 


Arg 


Phe 


Met 


Gly 


Arg 


Arg 


He 


Gl u 


Gin 


A?n 






675 










680 








685 








Trp 


Tyr 


Ala 


Tyr 


Tyr 


Pro 


Glu 


Val 


Acg 


Pro 


Glu 


Leu 


Gin 


Gin 


Leu 


Leu 




690 










655 










700 










Ala 


Aid 


Glu 


Glu 


Pro 


Glu 


Leu 


Ala 


Ala 


Gin 


Gly 


Leu 


Leu 


Arg 


His 


Pro 


705 










710 










715 










720 


Gin 


AJ a 


Met 


Val 


Phe 


Asn 


Ala 


Ser 


Ala 


Tyr 


Tyr 


His 


Met 


Asn 


Lys 


Tyr 










725 










7 30 










735 




Leu 


Thr 


Phe 


Gly 


Val 


flsn 


Leu 


Asn 


Asn 


He 


Leu 


Asp 


Glu 


Leu 


Tyr 


Thr 








740 










745 










750 






Glu 


Lyr? 


Asp 


Gly 


Tyr 


His 


Met 


Pro 


Gly Arg 


Asn 


He 


Met 


Gly 


Lys 


Val 






755 










760 










765 








Met 


Val 


Asn 


Phe 




























770 































10 



15 



20 



25 



30 



35 



40 (2i INFORMATION FOR SEQ ID HO: 322 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4 84 amino acids 

(B) TYPE: amino acid 
45 ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

tiii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE; 

<AI ORGANISM: Porphyromonas gingival!* 



lix) FEATURE: 
55 (A) MAHE/KEY: misc_f eature 

(Bl LOCATION 1...4 84 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 322 



Gly 


Arg 


Ala 


Ser 


Ser 


Pro 


Tyr 


Arg 


Gin 


Met 


Asp 


Gly 


He 


Leu 


Asn Asp 


1 








5 










10 










15 


Glu 


Tyr 


Arg 


Gin 


Ala 


Ser 


Met 


Asn 


Arg 


Phe 


Ser 


Asn 


His 


Trp 


Pro Cys 








20 










25 










30 




He 


Leu 


Val 


Gly 


Phe 


Val 


Leu 


Trp 


Phe 


Val 


Ser 


Ala 


Ser 


Arg 


Thr Val 






35 










40 










45 






Ala 


Gin 


Asn 


Ala 


Ser 


Glu 


Thr 


Thr 


Val 


Ser 


Tyr 


Asp 


Thr 


Asp 


Thr Ala 




50 










55 










60 








Val 


Leu 


Ser 


Glu 


Ala 


Asp 


Val 


Leu 


Arg 


He 


Ala 


Leu 


Ser 


Glu 


Asn Ala 


65 










70 










75 








60 


Thr 


Val 


Lys 


Val 


Ala 


Asp 


Met 


Asp 


Val 


Arg 


Lys 


Gin 


Glu 


Tyr 


Ala Arg 










85 










90 












Arg 


Ala 


Ala 


Arg 


Ala 


Asp 


Leu 


Phe 


Pro 


Lys 


Val 


Asp 


Leu 


Asn 


Gly Val 








100 










105 










HO 




Tyr 


Ser 


His 


Thr 


Leu 


Lys 


Lys 


Gin 


Val 


Leu 


Tyr 


He 


Asp Met 


Pro Gly 






115 










120 








125 
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65 



The 


Ser 


Ser 


Ser 


Glu 


Gly 


He 


Glu 


Met 


Gly 


Arg 


Thr 


His 


Asn 


Thr 


Gin 














13S 










140 










Gl^ 




v*i 


A;n 


Val 


Ser 


Met 


Pro 


Leu 


Val 


Ser 


Ala 


Gin 


Leu 


?rp 


Lys 


1-15 










150 










155 








160 


Ser 


He 


Ala 


Met 


Thr 


Gly 


Glu 


Gin 


Leu 


Asp 


t.eu 


Ala 


Leu 


Glu 


Lys 


Ala 










165 










170 










175 




Arg 


Ssr 


Ser 


Arg 


lie 


f\sp 


Leu 


Val 


Al a 


Glu 


Val 


Lys 


Lys 


Ala 


Tyr 


Leu 








180 










185 










190 






Ser 


Val 


Leu 


Leu 


Ala 


Glu 


Arp 


Ser 


T**r 


Gly 


Val 


Phe 


Lys 


Arg 


Ser 


Tyr 






1?S 










200 










205 










A^n 


Ala 


Leu 


Ala 


Asn 




- 


Asn 


He 


Ser 


rt-p 


Lys 


Phe 


Asp Arg 




210 










215 










220 










Gl" 


Leu 


V*l 


Ala 


Glu 


Tvr 


Asp 




lie 


Arg 


Ala 


Asn 


Val 


Gl ti 


Val 


Arg 












230 




















24 0 


A**n 


Tie 


Clu 


Pro 


Asn 




Leu 


Gin 


Ala 


31 n 


Asn 


Ser 


Val 


Ala 


Levi 


Ala 










245 










250 










255 




L*»u 


Trp 


Gin 


Leu 


Lys 


Val 


L~u 


Met. 


Ser 


Met 


Glu 


Val 


Glu 


Thr 


Pro 


He 








260 










265 










270 






Arg 


Lou 


Ser 


Gly 


Ser 


Leu 


Ser 


Asp 


1 j L 


Lys 


Glu 


Gin 


Val 


Tyr 


Thr 


Gly 






275 










2 80 










285 








Tyr 


Fh~ 


Ala 


Ala 




Thr 


L^u 


I le 


Ser 


Asn 


Asn 


Ser 


Ser 


Leu 


Arg 


Gin 




2 90 










295 










300 








Leu 


Asp 


He 


Gin 


Arg 


Arg 


Leu 


Ala 


val 


Ser 


Ala 


Asp 


Lys 


Leu 


Asn 


Lys 


305 










310 










315 










320 


Tyr 


Ser 


Phe 


Leu 


Pro 


Thr 


Leu 


Asn 


Lou 


Gly Gly 


Gin 


Tyr 


Thr 


Tyr 


Ser 










325 










330 










335 




Leu 


Acn 


Ser 


Am 


Asp 


He 


Lys 


Phe 


Trp 


Gly 


Glu 


Gly 


Gin 


Arg 


Trp 


Thr 








34 0 










345 










350 






Pro 


Phe 


Ser 


Thr 


He 


Ser 


Leu 


Ser 


Leu 


Tyr 


He 


Pro 


He 


Phe 


Asn 


Gly 






355 










360 










365 






Gly 


Lys 


Arg 


I.su 


Tyr 


Asn 


Val 


Lys 


Gin 


Ser 


Ala 


Leu 


Ser 


Tie 


Arg 


Gln 




370 










375 










380 








He 


Asp 


Leu 


Gin 


Arg 


Arg 


His 


He 


Glu 


Gin 


Ser 


lie 


Arg 


Met 


Gly 


He 


385 










390 










395 










4 00 


Lys 


Asn 


Gin 


Asn 


Asp 


Arg 


Leu 


Arg 


Thr 


Cys 


Met 


Gin 


Arg 


Phe 


Val 


Ala 










405 










410 










415 




Ser 


LJlU 


Glu 


Ala 


Val 


Arg 


Ser 


Ala 


Glu 


Lys 


Gly 


Tyr 


Gin 


He 


Ala 


Glu 








420 










425 










430 






Lys 


Arg 


Tyr 


Gin 


Thr 


Gly 


Glu 


Gly 


Thr 


Leu 


Val 


Glu 


Leu 


Asn 


Asp 


Ala 






435 










440 










44b 








Asp Val 


Ala 


Leu 


Leu 


Gin 


Ala 


Arg 


Leu 


Asn 


Tyr 


Asn 


Gin 


Ala 


He 


Phe 




4 50 










455 










460 










Asp 


Phe 


Met 


Thr 


Ala 


Lys 


Ala 


Glu 


Leu 


Arp 


Lys 


Met 


Asn 


Gly Met 


Gly 


465 










470 










475 










460 


lie 


Pro 


Glu 


Gin 



























10 



15 



20 



25 



30 



35 



40 



45 



(2) INFORMATION FOR SEQ ID HO: 323 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 0 amino acids 
IB) TYPE: amino acid 
(D> TOPOLOGY: linear 

55 (11) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 
60 (A) ORGANISM: Porphyromonas gingivalis 

( IX ) FEATURE : 

(A) NAME/ KEY: mi scf eat u re 
[B> LOCATION 1...54 0 



|xi) SEQUENCE DESCRIPTION: SEQ ID HO: 32 3 



Phe His Asn Phe Asp Phe Leu Asn Gly He Lys Leu Phe Ser Met Ala 

1 5 10 - IS 

70 Asn Asn Thr Leu Leu Ala Lys Thr Arg Arg Tyr Val Cys Leu Val Gly 

20 25 30 

Phe Cys Trp Leu Met Ala Met Met His Leu Ser Gly Gin Glu Val Thr 

35 40 45 

Met Trp Gly Asp Ser His Gly Val Ala Pro Asn Gin Val Arg Arg Thr 

75 50 55 60 
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Leu Val Lys Val Ala Lsu S~r Glu Ser I.pu Pro fro Gly Ala Lys Gin 
65 70 75 BO 

lie Arg lie Gly The Ser Leu Pro Lys Glu Thr Glu Glu Lyr Val Thr 
85 90 95 

5 Ala Leu Tyr Lsu Leu Val Ser Asp Spr Leu Ala Val Arg Asp Leu Pro 
100 105 110 

Asp Tyr Lys Gly Arg Val Ser Tyr Asp Sei Phe Pro He Ser Lys Glu 

115 120 12^ 

Asp Arg Thr Thr Ala Leu Ser Ala A:rp S-r Val Ala Gly Acq Arg Phe 
10 130 135 140 

Phe Tyr Leu Ala Ala Asp He Gly Pro Val Ala Ser Phe Ser Arg Ser 
145 150 155 160 

Asp Thr Leu Thr Ala Arg Val Glu Glu Val Ala Val Asp Gly Arg Pro 
165 170 175 

15 Leu Pro Leu Lys Glu Leu Ser Pro Ala Ser Arg Arg Leu Tyr Arg Gly 
190 165 190 

Tyr Glu Ala Leu Phe Val Pro Gly Asp Gly Gly Ser Arg Asn Tyr Arg 

195 200 205 

lie Pro Ala He Leu Lys Thr Ala Asn Gly Thr Lsu He Ala Met Ala 
20 210 215 220 

Asp Arq Arg Lys Tyr Asn Gin Thr Asp Leu Pro Glu Asp He Asp He 
225 230 235 240 

Val Met Arg Arg Ser Thr Asp Gly Gly Lys Set Trp Ser Asp Pro Arg 
245 250 255 

25 He He Val Gin Gly Glu Gly Arg Asn His Gly Phe Gly Asp Val Ala 
260 265 270 

Leu Val Gin Thr Gin Ala Gly Lys Leu Leu Met He Phe Val Gly Gly 

275 280 285 

Val Gly Leu Trp Gin Ser Thr Pro Asp Arq Pro Gin Arg Thr Tyr He 
30 290 295 300 

Sec Glu Ser Arg Asp Glu Gly Leu Thr Trp Ser Pro Pro Arg Arp He 
305 310 315 320 

Thr Hjls Phe He Phe Gly Lys Asp Cys Ala Asp Pro Gly Arg Ser Arg 
32S 330 335 

35 Trp Leu Ala Ser Phe Cys Ala Ser Gly Gin Gly Leu Val Lsu Pro Ser 
340 345 350 

Gly Arg He Thr Phe Val Ala Ala He Arg Glu Ser Gly Gin Glu Tyr 

355 360 365 

Val Leu Asn Asn Tyr Val Leu Tyr Ser Asp Asp Glu Gly Asp Thr Trp 
40 370 375 380 

Gin Leu Ser Asp Cys Ala Tyr Arg Arg Gly Asp Glu Ala Lys Leu Ser 
385 390 395 400 

I.ru Met Pro Asp Gly Arg Val Leu Met Ser Tie- Arg Asn Gin Gly Arg 
405 410 415 

45 Gin Glu Ser Arg Gin Arg Phe Phe Ala Leu Ser Ser Asp Asp Gly Leu 
420 425 430 

Thr Trp Glu Arg Ala Lys Gin Phe Glu Gly He His Asp Pro Gly Cys 

435 440 445 

Asn Gly Ala I let Leu Gin Val Lys Arg Asn Gly Arg Asp Gin Val Leu 
50 4S0 455 460 

His Ser Leu Pro Leu Gly Pro Asp Gly Arg Arg Asp Gly Ala Val Tyr 
465 470 475 480 

Leu Phe Asp His Val Ser Gly Arg Trp Ser Ala Pro Val Val Val Asn 
465 490 495 

55 ' Ser Gly Ser Ser Ala Tyr Ser Asp Met Thr Leu Leu Ala Asp Gly Thr 
500 505 510 

He Gly Tyr Phe Val Glu Glu Gly Asp Glu He Ser Lsu Val Phe He 

515 520 525 

Arg Phe Val Leu Asp Asp Leu Phe A3p Val Arg Gin 
60 530 535 540 

(2) I H FQ R14AT I OH FOR SEQ ID NO: 324 

ti) SEQUENCE CHARACTERISTICS: 
65 (A) LENGTH : 2 93 amino acids 

|B) TYPE: amino acid 
(01 TOPOLOGY i linear 



70 



75 



iii) MOLECULE TYPE: protein 

till) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 
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10 



15 



20 



25 



30 



35 



40 



45 



U-) FEATURE : 

(^) MAI IE/ KEY : miscjeature 
fP) LOCATION 1...293 

<.-.!> SEQUENCE DESCRIPTION: SEQ ID HO: 324 



Ly* 


Ser 


Pro 


Ser 


Asp 


Ser 


Ala 


Leu 


Pro 


Ser 


Val 


Trp 


Ser 


Ala 


Asn Thr 


1 








5 










IO 










15 


lie 


Ser 


Cily 


Leu 
20 


Val 


Gly 


Gly 


Lys 


Arg 

2b 


He 


Thr 


Leu 


Leu 


He 
30 


Leu Tyr 


Ser 


Met 


Ala 


lie 


Arg 


Ser 


Ser 


Asn 


He 


Glu 


S*r 


He 


Gin 


Cys 


Phe Val 






35 










40 










45 




llet 


Lys 


Lys 


Glu 


Lys 


Leu 


Trp 


He 


Ala 


He 


Val 


Ala 


Gly 


Leu 


Ala Phe 




50 










55 










60 




Val 


Leu 


Gly 


Lsu 


Tyr 


Ala 


Leu 


Gly Arg 


Ser 


Val 


Ala 


Gln 


Leu 


Arg Arg 


65 










70 










75 








8 0 


Ser 


Gin 


Pre 


Ser 


Val 


Thr 


Val 


Thr 


Gly 


Met 


Ala 


Glu 


Arg 


Asn 


Phe L-'s: 










05 










90 








95 


Ser 


Arp 


Leu 


I le 


Val 


Trp 


Thr 


Ala 


Ser 


Tyr 


Gin 


Leu 


Gin 


Met 


Met Asp 








100 










105 










110 


Leu 


GlU 


Ser 
115 


Ala 


Tyr 


Lys 


Ala 


Leu 

120 


Lys 


Glu 


Lys 


Gin 


lie 
125 


Leu 


Val Ala 


Asp 


Tyr 


Leu 


Lys 


Asn 


Lys 


Gin 


Leu 


Pro 


Asp 


Ser 


Ser 


Tyr 


He 


Phe Ser 




130 










135 










140 






Ser 


Val 


Ala 


He 


Ser 


Lys 


Glu 


Tyr 


Asn 


Tyr 


Tyr 


Tyr 


Asp 


Pro 


Arg Gin 


145 










150 










155 






1 60 


Glu 


Gin 




Val 


Arg 
165 


Thr 


Phe 


Ala 


Gly 


Tyr 
170 


Leu 


Leu 


Ser 


Gin 


Thr Val 
175 


Thr 


Val 


Thr 


Ser 


Gin 


Asp 


He 


Glu 


His 


Val 


Glu 


Lys 


Ile 


Ser 


Arg Asp 


lis 


Thr 




180 










185 










190 


Glu 


Lsu 


He 


Asn 


Gin 


Gly Val 


Glu 


lie 


Thr 


Ser 


Asp 


Arg Pro 






195 










200 










205 


Ala 


Tyr 


Tyr 


Tyr 


Thr 


Lys 


Leu 


Asn 


Asp 


Leu 


Lys 


Val 


Glu 


Met 


Leu Arq 




210 










215 










220 






Asn 


Ala 


Ser 


Glu 


Asp 


Ala 


Phe 


Asn 


Arg 


Ala 


Ser 


Val 


He 


Ala 


Glu Gly 
240 


225 


Gly 








230 










235 








Ser 


Ser 


Ser 


Val 


Gly 


Lys 


Met 


Leu 


Ser 


Ser 


Ser 


Met 


Gly 


Val Phe 


Gin 








245 










250 








255 


He 


Val 


Gly 


Leu 


Asn 


Ser 


Asn 


Glu 


Asp 


Tyr 


Ser 


Trp 


Gly 


Gly Ser 


Phe 






260 










265 










270 


Asn 


Thr 


Ser 


Ser 


Lys 


Met 


Lys 


Thr 


Ala 


Ser 


He 


Thr 


Val 


Lys Ala 






275 










280 










285 




Ser 


Phe 


Ala 


Leu 


Lys 
























290 



























(2) INFORMATION FOR SEQ ID HO: 325 



11) SEQUENCE CHARACTERISTICS: 
e _ CA) LENGTH: 280 amino acids 

50 (B) TYPE: amino acid 

t D ) TOPOLOGY : 1 ine a r 

(ii) MOLECULE TYPE: protein 

55 (iii) HYPOTHETICAL: YES 

tvi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyroraonas gingivalis 



60 
65 

G1 V 
1 

Leu 

70 Val 

Gin 

He 
75 65 



tlx) FEATURE: 

(A) NAME/ KEY: misc_feature 
[B> LOCATION 1...2B0 

(xij SEQUENCE DESCRIPTION: SEQ ID NO:325 



Lys 


Leu 


Gin 


Met 


Lys 


Lys 


Thr 


He 


Ala 


He 


He 


Ala 


Ser 


Ala 


Leu 


Gly Ala 


Val 


Gly 


Cys 


Lys 


10 
Lys 


Asn 


Ala 


Asp 


Thr 






20 










25 








30 


Ser 


Glu 


Lys 


Asp 


Ser 


He 


Ala 


Leu 


Ser 


n-r 


Gly 


He 


Leu 




35 










40 








45 




Asp 


Phe 


Ala 


Asn 


Gin 


Phe 


Glu 


Met 


Ser 


Arg 


Leu 


Gin 


Gly 


50 










55 








60 




Asp 


Ser 


Val 


Ala 


Phe 


Leu 


Asp Gly 


Phe 


Lys 


Tyr 


Gly 


He 



75 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



O v3 Lo S It H w tu» ^ ^ ^ " 



WO 99/29870 PCT/AU98/01023 

248/ 490 



45 



50 



55 



60 



65 



70 



75 



Thr 


Arg 


Phe 


Ser 


Tyr 
85 


Arn 


Leu 


Gly 


Ala 




Tyr 


Ala 


Ser 


Atn 


He 
95 


Ala 


Arg 


Gin 


L-u 


Ala 
100 


His 


A^p 


Set 


He 


Asp 
105 


He 


A.»p 


Lys 


Phe 


Tyr 
110 


Ala 


Ala 


net 


Arg 


Ala 
115 


Ala 


Leu 


Leu 


l.y? 


Asp 
120 


Thr 


val 


Set 


He 


Ala 
125 


Met 


Lys 


Pre 


Ala 


Asp 
130 


Ala 


Gin 


Ala 


Phe 


Met 
13S 


Gin 


Arg 




Gin 


Ala 
140 


Lys 


Ly* 


Gin 


Arg 


Glu 


Asa 


Arn 


Met 


Lyj> 


Gin 


Phe 


Gly 


Gin 


Asr. 


lie 


Glu 


Lys 


Gly 


Arn 


Glu 


145 










150 










1 55 










160 


Tyr 


He 


Asp 


Thr 


Phe 


Lys 


Lys 


Glu 


Asp 


Gly 


Val 


Thr 


Val 


Thr 


Thr 


Thr 










165 








17') 










ns 




Gly 


Leu 


Ala 


Tyr 


Lys 


Thr 


Leu 


Gin 


Glu 


Gly 


Thr 


Gly Ala 


Thr 


Pro 


Ser 








180 










185 










190 






Leu 


Aid 


Asp 
195 


Thr 


Vol 


Arg 


Val 


Lys 
200 


Tyr 


Val 


Gly 


Thr 


Leu 
205 


Val 


Asp 


Gly 


Lys 


Glu 
210 


Phe 


Asp 


Lys 


Asn 


Glu 

215 


Glu 


Gly 


He 


Giu 


Phe 
220 


Ala 


Val 


Thr 


Gly 


Val 


11* 


hys 


Cli- 


Trp 


Thr 


Glu 


Ilet 


Leu 


Gin 


Leu 


Met 


Lys 


Val 


Gly 


Gin 


225 










230 










235 










240 


Lys 


Val 


Arg 


val 


Val 


He 


Pro 


Gl n 


Glu 


Leu 


Ala 


Tyr 


Gly Glu 


Thr 












245 










250 










25S 




Arn 


Tyr 


Thr 


He 
260 


Glu 


Pro 


Phe 


Ser 


Thr 
265 


Leu 


Thr 


Phe 


Glu 


Met 
270 


Glu 


Leu 


He 


Gly 


He 
2"?S 


Lys 


Pro 


Gly 


Lys 


Lys 
280 



















10 



15 



20 



25 



(2) III FORI 1 AT I0W FOR SEQ ID HO: 326 

30 <H SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 amino acids 
\B) TYPE: amino acid 
ID) TOPOLOGY: linear 

35 (ill MOLECULE TYPE: protein 

(ill) HYPOTHETICAL; YES 

(Vi) ORIGINAL SOURCE: 
40 (AJ ORGANISM: Porphyrons nas gingivalis 



Ux( FEATURE: 

<A» NAME /KEY : misc_feature 
(B| LOCATION 1...803 

(Xi) SEQUENCE DESCRIPTION: S£Q ID 110:326 



Cys 


Arg 


Lys 


Gin 


Lys 


Leu 


Ser 


Asn 


Pro 


L/s 


Arg 


Ser 


Arg 


Tyr 


Pro cyst 


1 








5 










le 










15 


Arg 


Arg 


He 


Ser 


Phe 


Arg 


Gin 


Ser 


Asp 


Lys 


Asn 


Pro 


Met 


Lys 


Val Leu 








20 










25 










30 




Arg 


Gin 


Val 


Phe 


Lsu 


Pro 


He 


Leu 


Phe 


Val 


Leu 


Leu 


Thr 


Gly Ala Cys 






35 










40 










45 






Ser 


Thr 


Thr 


Lys 


Asn 


Leu 


Pro 


Glu 


Gly 


Glu 


Gin 


Leu 


Tyr 


He 


Gly Met 




50 










55 










60 








Gly 


Lys 


Thr 


Gin 


He 


Leu 


Arq 


Gin 


Asp 


Lys 


Ser 


His 


Ala 


Gly Gin Gin 


55 










70 










75 








80 


Ala 


Leu 


Thr 


Glu 


Val 


Glu 


Ser 


Thr 


Leu 


Lys 


Val 


Thr 


Pro 


Asn 


Gly Ala 










85 










90 










95 


He 


Phe 


Gly 


Ser 


Ala 


Ser 


Ala 


Ser 


Leu 


Pro 


Lys 


He 


Pro 


Phe 


Gly Leu 








100 










105 








110 




Trp 


Leu 


Tyr 


Asn 


Ser 


Phe 


Val 


Gly 


Asp 


Ser 


Thr 


Val 


He 


Ser 


Lys Trp 






115 










120 










125 






lie 


Phe 


Asp 


Lys 


Phe 


Ala 


Ala 


Lys 


Pro 


Val 


Phe 


He 


Ser 


Gin 


Val Lys 




130 










135 










140 








Ser 


Asp 


Ser 


Arg 


Ala 


Lys 


Val 


Ala 


Thr 


Asn 


He 


Leu 


Arg 


Glu 


His Gly 


145 










150 










155 








160 


Tyr 


Phe 


Asp 


Ala 


Lys 


Val 


Lys 


Ser 


Ser 


Val 


Thr 


Thr 


Leu 


Lys 


Lys Asp 










165 










170 










175 


Ser 


Leu 


Lys 


Ala 


Lys 


He 


Ser 


Tyr 


Thr 


Val 


Asp 


Met 


Ala 


Ser 


Pro Tyr 








180 










185 










190 




His 


Tyr 


Asp 


Ser 


He 


He 


Pro 


Leu 


Pro 


He 


Ser 


Thr 


Phe 


Pro 


Asp Ser 






195 










200 










205 






He 


Leu 


Ala 


Tyr 


Arg 


Gin 


Thr 


Pro 


Ser 


Leu 


He 


Arg 


Lys 


Gly Asp Gin 




210 










215 










220 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Phe 






Ale 


Lye 


Leu 


His 


Glu 


Glu 


Arg 


Gin 


Thr 


He 


Ser 


Ala 


Leu 












230 










235 












Leu 


Arg 


A-p 


Asn. 


Gly 


Tyr 


Tyr 


Tyr 


Phe 


Arg 


Pro 


Gin 


Arp 


He 


He 


Tyr 




















250 










255 




Glu 


Ala 


Asp 


Thr 


Leu 


Leu 


Val 


Arg 


Gly 


Ala 


Val 


Cys 


Leu 


Arg 


Ala 


Lys 








260 










265 










270 






Leu 


S-r 


Glu 


Arp 


Thr 


Pro 


Pro 


Gin 


Ala 


Met 


Arg 


Pro 


Trp 


Arg 


He 


Gly 






275 










280 










285 








Lys 


Arg 


Thr 


Ala 


Val 


Leu 


Leu 


Gly 


Met 


Asn 


Gly 


Glu 


Ser 


Pro 


Thr 


Asp 




2 90 










295 










3 CO 










Ser 


Leu 


Glu 


Val 


Glu 


Asp 


tlet 


Lys 


Val 


Leu 


Tyr 


Tyr 


Arg 


Lys 


net 


Pro 


30S 










310 










315 










320 


Val 


Arg 


Pro 


Lys 


He 


Leu 


Ala 


l.ya 


Arg 


Phe 


Arq 


Phe 


Phe 


Ser 


Gly 


Asn 










325 










330 










335 




Leu 


Tyr 


Arg 


Gl n 


Lys 


Asp 


Asp 


Glu 


Thr 


Thr 


Arg 


Lys 


Ser 


Lou 


Ala 


Arg 








3*10 










345 










350 






Leu 


Gly 


Ala 


Phe 


Ser 


Val 


He 


Asp 


L-u 


Asn 


Phe 


Leu 


Gin 


Arg 


Asp 


Ser 






355 










360 










365 








lie 


Set 


Gly 


Lsu 


Leu 


Asp 


Val 


Arg 


Leu 


Leu 


Thr 


Thr 


Leu 


Aep Lys 


Pro 




J7C> 










37 5 










300 










Trp 


Asp 


Ala 


Ser 


Leu 


Glu 


Thr 


Leu 


Phe 


Thr 


Ser 


Lys 


Ser 


Asn 


Asp 


Phe 


385 










390 










395 










400 


lie 


Gly 


Pro 


Gly 


Leu 


Asn 


Phe 


Ala 


Leu 


Ala 


Arg 


Arg 


Asn 


Val 


Phe 


Gly 










405 










410 










415 




Gly 


Gly 


Glu 


Asn 


Leu 


Ser 


Trp 


Asn 


He 


Gly Gly 


Ser 


Tyr 


Glu 


Trp 


Glu 








420 










425 










430 






Thr 


Gly 


Ann 


Arg 


Pro 


Glu 


Asn 


Ser 


Ser 


Asn 


Arg 


Leu 


He 


Asp 


He 


Asn 






435 










440 










445 








Ser 


Tyr 


Asn 


Met 


Asn 


Thr 


Ala 


Val 


Asn 


Leu 


Ser 


Phe 


Pro 


Ser 


lie 


Val 




450 










455 










4G0 










Phe 


Pro 


Gly 


Leu 


Leu 


Asp 


Lys 


Tyr 


Tyr 


Tyr 


Tyr 


Pro 


Thr 


Thr 


Thr 


Thr 


465 










470 










475 










480 


Phe 


Gin 


Ala 


Ser 


Ala 


Thr 


Ala 


Leu 


Asn 


Arg 


Ala 


His 


Tyr 


Phe 


Ser 


Met 










485 










490 










4 95 




Tyr 


Ser 


Phe 


Gly 


Phe 


Ser 


Thr 


Thr 


Tyr 


Glu 


Phe 


Gin 


Pro 


Ser 


Lys 


Glu 








500 










505 










510 






His 


Arg 


His 


Ala 


He 


Phe 


Pro 


Leu 


Lys 


Leu 


Asn 


Tyr 


Asn 


Leu 


Leu 


Gly 






515 










520 










525 








His 


Gin 


Thr 


Glu 


Thr 


Phe 


Gin 


Ala 


He 


Thr 


Ala 


A3n 


Asn 


Pro 


Pro 


Leu 




530 










535 










540 










Leu 


Leu 


Ser 


Leu 


Gin 


Ser 


Gin 


Phe 


Leu 


Ala 


Gin 


Met 


Gly 


Tyr 


He 


Tyr 


545 










550 










555 










S60 


Thr 


Phe 


Asn 


Lys 


Ser 


Val 


Ser 


Glu 


Lys 


Ser 


Pro 


His 


His 


Leu 


Trp 


Met 










565 










570 










57 5 




Gin 


Phe 


Gly 


Leu 


Ser 


Glu 


Ala 


Gly 


Asn 


Leu 


Leu 


Asn, 


Leu 


He 


Tyr 


Leu 








580 










585 










590 




Ala 


Ala 


Gly 


Lys 


Li- 


Tyr 


Ser 


Asp 


Thr 


Lys 


Asn 


Phe 


Val 


Gly Val 


Pro 






595 










600 










605 








Phe 


Ser 


Gin 


Phe 


lle 


Lys 


Ala 


Thr 


Gly 


Glu 


Leu 


Arg 


Tyr 


Ser 


Tyr 


Thr 




610 










615 










620 










lie 


Asp 


Arg 


Asn 


Gin 


Ser 


Leu 


Ala 


Thr 


Arg 


Phe 


Gly Thr 


Gly Val 


He 


625 










630 










635 










640 


Tyr 


Ser 


Tyr 


Gly 


Asn 


Met 


Arg 


Val 


Ala 


Pro 


Tyr 


Ser 


Glu 


Gin 


Phe 


Tyr 










645 










650 










655 




Val 


Gly 


Gly 


Ala 


Asn 


Ser 


He 


Arg 


Ala 


Phe 


Thr 


Val 


Arg 


Ser 


He 


Gly 








660 










665 










67 0 






Pro 


Gly 


Arg 


Phe 


Asn 


Pro 


Asp 


Ser 


Asp 


Asn 


Gin 


Tyr 


Ser 


Tyr 


Leu 


Asp 






675 










660 










665 








Gin 


val 


Gly 


Glu 


Phe 


Lys 


Leu 


Glu 


Ala 


Asn 


Val 


Glu 


Tyr 


Arg 


Gly 


Lys 




690 










695 










700 










Leu 


Phe 


Gly 


Asp 


Leu 


Hi3 


Ala 


Ala 


Val 


Phe 


Leu Asp Ala 


Gly Asn 


Val 


705 










710 










715 










720 


Trp 


Leu 


Leu 


Arg 


Glu 


Asp 


Ser 


Ser 


Arg 


Pro 


Gly Gly Ala 


Leu 


Ser 


Glu 










725 










730 










735 




Val 


Gly 


Ser 


Val 


Ser 


Asn 


Phe 


Leu 


Asn 


Ser 


He 


Ala 


Leu 


Gly Thr 


Gly 








740 










745 










750 




Val 


Gly 


Leu 


Arg 


Tyr 


Asp 


Leu 


Ala 


Phe 


Leu 


Val 


Val Arg 


Val 


Asp 


Val 






755 










760 










765 








Gly 


Phe 


Gly 


Leu 


His 


Leu 


Pro 


Tyr 


Asn 


Thr 


Gly 


Lys 


Lys 


Gly Tyr 


Tyr 




770 










775 










"780 








Asn 


He 


Pro 


Arg 


Phe 


Lys 


Asp 


Ala 


He 


Gly 


Phe 


Hrs 


Leu 


Ala 


Val 


Giy 


785 










790 










7 95 










Q00 


Tyr 


Pre 


Phe 
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IZ) IUFORllATlOtl FOR SEQ ID NO: 327 

(ii SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 783 amino acids 

(B) TYPE: amino icid 
(P) TOPOLOGi : linear 

(ii) MOLECULE TYFE; prot-in 

Ixxi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Po rphyrowon as gingival is 

(i>.) FEATURE: 

(A| NAME/ KEY: mj.sc_f eature 
iB) LOCATION 1 . „ .7 83 

IXi) SEQUENCE DESCRIPTION: SEQ ID r TO: 327 

Ssr Leu Ser Pro Tyr He Arg Phe Pro Met Ser Ser His Ser Val Arg 

15 10 15 

Tyr Leu He Gly He Ala Gly Cys Leu Leu Leu Met Leu Ala Ser Ser 

20 25 30 

Cys- Ser Val Thr Arq Tyr Val Pro Asp Gly Ser Arg Leu Leu Asp Arg 

35 40 45 

Val Thr He Ala Ser Glu Thr Gly Ser He Ala Leu Pro Glu Asp Tie 

50 55 60 

Arg Asp Tyr Thr Leu Gin Gin Pro Asn Tyr Arq Leu Phe Gly Met Thr 
65 70 75 80 

Arg Trp Leu Leu Arg Val Tyr Ser Ser Ser Asn Pro Asn Scr Asn Ser 

85 90 95 

Trp Trp Asn Arg Ser Leu Arg Lys Met Gly Glu Pro Pro Val Leu He 

100 105 110 

Asp Ser Val Leu Thr A^p Arg Thr Ala Asn Arg Leu Ala Lys Ala Mat 

115 120 125 

Ala Gly Asp Gly Phe Leu Asp Ala Thr Ala Arg Ala Val Val Asp Thr 

130 135 140 

Gly Leu Tyr Lys Lys Ala Arg He Thr Tyr Leu He Gin Pro Gly Ser 
145 150 155 160 

Arg Tyr Tyr He Arg Asn Met Ala Leu Asp Val Lys Asn Pro Leu Leu 

165 170 175 

Pro Pro Val Ala Leu Gly Asn Ser Leu Pro Ser Ala Tyr Lys Val Gly 

180 185 190 

He Ser Glu Gly Ser Pro Leu Ser Pro 11= Val Leu Asp Glu Glu Arg 

195 2O0 205 

Lys Ala He Ala Arg His Met Arg Asn Asn Gly Phe Tcp Lys Phe Ser 

210 215 220 

Ma Glu Asp Val Tyr Tyr Glu Ala Asp Thr Thr Val Ser Gly Gly Ser 
225 230 235 240 

Gly Thr Lys Ser Ala Asp Leu Lys Leu Val Val Asn Gly He Gly Arg 

245 2S0 255 

Tyr Pro Tyr Arg He Gly Arg Val Phe Phe His Ala Asp Tyr Asp Pro 

260 265 270 

Leu Glu Ser Asp Phe Arg Val Gin Glu Leu Pro Arg He Asp Ser He 

275 280 285 

Ser Arg Gly Asp Tyr Thr Val Tyr Tyr Gly Ser Arg Gly Arg Tyr He 

290 295 300 

Arg Ala Ser Ala Leu Thr Arg Ser Val Ser Val Thr Pro Gly Ala Phe 
305 310 315 320 

Phe Cys Glu Asp Asp Val Glu Arg Ser Tyr He Lys Leu Asn Ala Leu 

325 330 335 

Pro lie Val Arg Asn Val Asn He Arg Phe Val Glu His Asn Gly Lys 

340 345 350 

Asp Glu He Ala Leu Ala Asp Ser Ser Arg Leu Val Aep Cys Tyr He 

355 360 365 

Leu Thr Val Pro Ala Lys Ser Lye Ser Phe Glu Ala Glu Val Leu Gly 

370 375 380 

Thr Asn Ser Ala Gly Asp Phe Gly Ala Ala Leu Ser Leu Gly Phe Thr 
3B5 390 395 400 

Asp Arg Asn Leu Phe Arg Gly Ala Glu Met Phe Asn He Lys Leu Lys 

405 410 415 

Gly Ala Tyr Glu Ala He Arg Lys Gly Ser His Ser Phe Met Glu Tyr 

-420 425 430 

Gly Val Glu Ser Ser Leu Arg Phe Pro Arg Leu Leu Phe Pro Phe He 
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435 440 
Ser Asp Glu Thr Arq Arg Arg Leu Arg Ala Ser Thr j Trp Lys He 

'150 455 460 

Gly Tyr Asn Tyi Gin Thr Arg Pro Glu Phe A^p Arg Val Tie L-u Ser 
465 4 70 .|75 480 

Ala Gin Leu Asn Tyr Ser Trp Gin Thr Tyr Leu His Asn Arg Leu Arg 

485 490 495 

His Thr II- Arg Leu Leu Asp Val Asp Tyr Leu Hir Leu Pro Tyr II- 

500 505 510 

At-p Pro Asp Phe Ala Gin Ser Leu Pro Pro Thr Thr Ala Leu T-r Asn 

51b 520 525 

Tyr Thr Glu Gin Phe lie Leu Gly Ser Ala Tyr lie Leu Am Tyr Thr 

530 535 54Q 

Thr Ala Sor Ser Met Glu Arg Thr Val Ser Am Pro Fhe Thr Ala Arg 
15 54S 550 55S 560 

Phe Ser lie Gin Thr Ala Gly Aen Leu Leu Gin Ala He Ser Tyr Leu 

56 5 570 575 

Thr Asp Ser Pro Lys Asp Glu His Gly Leu Tyr Lys Met Phe Gly L-u 
„ n 580 585 590 

ZU His Tyr Ala Gin Phe Val Lys Leu Asp Leu Acp Leu Ala Lys Thr Val 
5 ?5 600 605 

Leu Leu Glu Lys Asp Asn Thr Leu Ala Leu His Leu Gly Phe Gly L~u 

"0 615 620 

Ala Phe Pro Tyr Gly Asn Ala Arg His He Pro Phe Glu Leu Arg Tyr* 
25 625 630 - 6 35 640 

Phe Ala Gly Gly Ser Asn Ser Val Arg Gly Trp Ser Val Arg Thr Leu 

645 650 655 

Gly Pro Gly Ser Met Lys Met Thr Pro Asp Lys Thr Phe Phe Asp Gin 

66 ° 665 670 

Hot Gly Asp He Arg Leu Asp Leu Asn Val Glu Tyt Aru Thr Lys L-u 

675 680 6BS 

Phe Trp Lys Phe Arg Ala Ala Ala Phe Val Arp Ala Gly Asn Val Trp 

690 695 700 

j[ h f Ile Glu T Y r G l" Asn Gin Glu Asp Gly Leu Phe Arg Phe Asp 

35 705 710 715 720 

Arg Phe Tyr Lys Glu Ile Ala Leu Ala Tyr Gly Leu Gly Leu Arg Leu 

125 730 735 

Anp Phe Asp Tyr Phe Leu Val Arg Leu Asp Ala Gly Leu Lys Ala Tyr 

7 40 745 750 

Asp Pro Gin Gin Thr Gly Arg Tyr Lys Trp Ala Ile Thr Arg Pro Asn 

755 760 765 

Leu Ser Sor Asn Phe Ala Trp His He Ala Val Gly Tyr Pro Phe 
770 775 78o 

45 (2> INFORMATION FOR SEQ ID NO: 328 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 amino acids 
<B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(il) MOLECULE TYPE: protein 

Uil) HYPOTHETICAL: YES 

<vi» ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas glngivalis 

<lx) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1. . .875 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 328 



40 



55 



60 



65 



70 



75 



50 



Lys 


Leu 


Leu 


Cy3 Leu 


Met 


Arg 


Lys 


Arg 


He 


Leu 


Gin 


Leu 


Ala 


5 








10 










IS 




Leu 


Leu 


Leu Ala 


Leu 


Gly 


Ser 


Ser 


Leu 


Ala 


Ile 


Ala 


20 








25 










30 






Val 


Thr 


Gly 


Lys .Val 
40 


lie 


Asp 


Sei. 


GJ u 


Thr 
45 


Ser 


Glu 


Pro 


Val 


Ser 


Val 


Ser Thr 


Gly 


Gin 


Gly 


Ala 


Ser 


Leu 


Arq GlV 


Asp 






55 








60 










Met 


Asp 
70 


Gly Gly 


Phe 


Arg 


Phe 

75 


Glu 


Val 


Pro 


Ala 


Lys 
80 


Thr 


Phe 


Arg 


Cys Val 


Gly 


Tyr 


Ala 


Thr 


Val 


Thr 


Arg 


Ser 
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8 5 ?0 ?5 

He Gly Arcj Gly Ssr Gin Glu Asp Leu Gly Thr He Leu Leu Asp Pro 

100 105 110 

Gin Ala He Gly Leu Asp Glu He Gin Val lit Ala Ser Val Val Pro 

IIS 120 125 

Lyc Arp Arg Met The Pro Val Pro Val Ser Asn He Arg Val Ala Asp 

130 135 140 

He Gin Ala Ala Ser Leu Asn Val Glu Fhc Pro Glu Leu V»l Lys Ser 
145 ISO 155 160 

Thr Pro Ser Thr Tyr Thr Thr Lys Gly Ser Gly Gly Fhe Gly Asp Gl / 

165 170 175 

Ara Thr Asn Val Arg Gly Phe Asp Thr Tyr Asn Phe Gly Val Leu He 

180 185 190 

Atn Gly Val Pro Val Asn Gly Met Glu Asp Gly Lys Val Tyr Trp Ser 

19b 200 205 

Asn Trp Ser Gly Leu Met Asn Gin Ala Ser Thr He Gin He Gin Arg 

210 215 220 

Giy Leu Gly Ala Ser Lys Leu Gly He Ser Ser Val Gly Gly Thr Met 
225 230 235 240 

Ai n He He Thr Lys Thr Thr Asp Ala Asn Thr Gly Gly Ser Ala Tyr 

245 250 255 

Val Gly llec Gly Asn Asp Gly Leu His Lys Glu Ser Phe Ser He Ser 

260 265 270 

Thr Gly Met Asn Asp Gly Trp Ala He Thr He Ala Gly Ser His Met 

275 280 285 

Thr Gly Leu Gly Tyr Val Lys Gly Leu Lys Gly Arg Ala Phe Ser Tyr 

290 295 300 

Phe Phe Asn Val Ser Lys Lys Phe Asn Glu Arg His Thr Leu Ser Leu 
305 310 315 320 

Thr Gly Phe Gly Ala Pro Gin Trp His Asn Gin Arg Ser Ser Lys Tyr 

325 330 33b 

ser Val Ala Asp Tyr Asp Lys Tyr Gly He Arg His Asn Gin Ser Phe 

340. 345 350 

Gly Tyr Leu Arg Gly Glu Leu Thr Pro Thr Ala Tyr, Ala Tyr Asn The 

355 360 365 

Tyr His Lys Pro Gin Phe Ser Leu Asn His Phe Trp Lys Met Asp Glu 

370 375 380 

Asn Thr Ser Leu Tyr Thr Ala Unfc Tyr Ala Ser Leu Ala Thr Gly Gly 
385 390 395 400 

Gly Arg Arg Ala Tyr Gly Lys Asn Ser Lys Trp Val Leu He Asn Tyr 

405 410 415 

Asn Thr Gly Gin Pro Tyr Glu Gin Thr Lys Val Thr Pro Asp Gly Leu 

420 425 430 

He Asp Tyr Asp Ala Val Leu Ala Ala Asn Ala Ala Ala Ser Asn Gly 

435 440 445 

Ser Glu Ala He Phe Ala Leu Gly Ser Asn Ser His Lys Trp Phe Gly 

450 455 460 

Leu Leu Ser Ser Phe Lys Lys Lys Leu Asn Ser Ser Leu Thr Leu Thr 
465 470 475 480 

Ala Gly Tyr Asp Gly Arg Tyr Tyr Arg Gly Asp His Tyr Asp Lys He 

485 490 495 

Thr Asp Leu Leu Gly Gly Ser Tyr Tyr He Glu Asp Pro Lys Thr Lys 

500 505 510 

Leu Ala Tyr His Ala Glu Gly Gin Gin Leu Lys Val Gly Asp He Val 

515 520 525 

Asn Arg Asp Tyr Thr Gly Glu He Met Trp His Gly Leu Phe Ala Gin 

530 53S 540 

Met Glu His Ser Ser Glu Trp He Asp Ala Phe Val Ser Gly Ser He 
545 550 555 560 

Asn Tyr Glu Leu Tyr Arg Asn His Asn Tyr Gly Gly Ser Lys Ser Thr 

565 570 575 

Gly Tyr Leu Pro Gly Val Ser Pro Trp Lys Ser Phe Leu Pro Trp Ser 

580 5B5 590 

Gly Lys Ala Gly Leu Ser Tyr Lys Phe Ala Gin Gly His Asn Val Phe 

595 600 605 

Ala Asn Gly Gly Phe Phe Thr Arg Ala Pro Leu Phe Gly Asn He Tyr 

610 615 620 

Ala Ala Gly Ala He He Pro Asn Asp Lys Ala Asn Met Glu Lys Val 
625 630 63S-- 640 

Leu Thr Gly Glu Val Gly Tyr Gly Phe Thr Asn His Lys Asn Phe Glu 

645 650 655 

Phe Asn He Asn Gly Tyr Tyr Thr Lys Trp Met Asp Arg Val Thr Ser 

660 665 670 

Lys Arg He Gly Asn Glu Tyr Val Tyr Leu Asn Gly Val Asp Ala Val 
675 680 685 
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45 



50 



55 



60 



65 



70 



75 



H 1 5 


Cys 


Gl/ 


Val 


Glu 


Al a 


Gl u 


Val 


Ser 


Tyr 


Arg 


Pro 


Ho 


Arg 


Gin 


He 




690 




















7 00 








Arp 


Leu 


Arg 


Gl y 


Ilet 


Phe 


Ser 


Lsj 


Gly 


Asp 


Trp 


Thr 


Trp 


Gin 


Asn 


Asn 


705 










710 










715 










Val 


Ser 


Tyr 


Thr 


Ser 


Tyr 


Asp 


Glu 


Ala 


Gly 


Asn 


Glu 


Thr 


Gly 


Gin 


Arp 


He 


















730 










735 




Thr 


Tyr 


He 


hyp 


Gly 


Leu 


His 


Val 


Gly 


Asp 


Ala 


Ala 


Gin 


Met 


Thr 








7-10 










74 5 










750 






Ala 


Ala 


Val 


Ser 


Ala 


Asp 


lie 


Glu 


Leu 


Phe 


Lys 


Gly 


Phe 


His 


Val 


He 


Gly 




755 










760 








7 65 






Lys 


Tyr 


Asn 


Phe 


Leu 


Gly 


Lys 


Asn 


Tyr 


Ala 


Gly 


Phe 


Asn 


Pro 


Ala 




770 










775 










780 








Thr 


Arg 


Asn 


Ala 


Gin 


Gin 


Tyr 


Glu 


Ala 


Asp 


Gly 


Lys 


Glu 


He 


Val 


Glu 


785 










7 90 










795 








800 


Ser 


Tip 


Lys 


Leu 


Pro 


Asp 


Val 


Gly 


Leu 


Phe 


Asp 


Leu 


Ser 


Ala 


Ser 


Tyr 










805 










610 










815 


Asn 


Phe 


Lys 


Leu 


Gly 


Ser 


Leu 


Ser 


Thr 


Thr 


Phe 


Tyr 


Phs 


Asn 


Met 


Asp 








820 










825 










830 




Acn 


Val 


Ala 


Asp 


Lys 


Arg 


Tyr 


Val 


Ser 


Asp 


Ala 


Asp 


Asp 


Asn 


He 


He 






835 










840 








645 








Uly 


Lys 


Lys 


His 


Asp 


Glu 


Ala 


Ser 


Ala 


Leu 


Val 


Trp 


Tyr 


Gly 


Phe 


Gly 




850 










855 










860 




Arg 


Thr 


Trp 


Ser 


Thr 


Gly 


lis 


Arg 


Val 


Asn 


Phe 












865 










870 








875 












(2) 


INFORMATION 


FOR 


SEQ 


ID NO:329 

















10 



15 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 60 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

Hi) IIOLECULE TYPE: protein 

35 (Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

40 <ix> FEATURE : 

(A) NAME /KEY : miscjeature 
<B) LOCATION 1. . .460 



(xi) SEQUENCE DESCRIPTION: SEQ ID 110:329 



Asn 


Arg 


Cys 


Asn 


Pro 


Ser 


Arg 


Gin 


Tyr 


Phe 


Hi* 


Leu 


Ser Gin Asn Asp 


1 








5 










10 










15 




Val 


Gly Arg 


Ser 


Leu 


Arg 


Ala 


Phe 


Leu 


lie 


His 


Leu 


S-r 


Met 


Lys 


Phe 








20 










25 










30 




Ser 


He 


Arg 


Leu 


Phe 


Leu 


Cys 


He 


He 


Phe 


Leu 


Leu 


Ser 


Ala 


Phe 


He 






35 










40 










45 






Leu 


Pro 
50 


Ala 


Lou 


Gly Gin 


Lys 
55 


Ser 


Lys 


Gin 


Val 


Gin 
60 


Arg 


Leu 


Glu 


Lys 


Gin Arg Lys 
65 


Glu 


Ala 


Leu 

70 


Lys 


Ala 


He 


Glu 


Lys 
75 


Thr 


Asp Arg Gl u 


Leu 
80 


Arg 


Asn 


Thr 


Lys 


Lys Asp 


Lys 


Gin 


Asp 


Lys 


Gin 


Lys 


His 


Leu 


Asn 


Leu 










65 










SO 










25 




Leu 


Asn 


Lys 


Gin 


Val 


Ala 


Gin 


Arg 


Lys 


Gin 


Met 


Val 


Gin 


Leu 


Leu 


Asp 


Asn 


Glu 




100 










105 










110 




Val 


Lys 


Glu 


Leu 


Gin 


Ser 


Asp 


He 


Asp 


Ser 


Met 


Thr 


Glv Val 






115 










120 










125 








Cys 


His 


Gin 


Leu 


Ser 


Val 


Glu 


Glu 


Lys 


Ala 


Arg 


Ser 


Asp 


Glu 


Tyr 


Ala 




130 










135 










140 






Gin 
145 


Ala 


Leu 


Gin 


Ser 


Met 
150 


Gin 


Lys 


Arg 


Lys 


Arg 

155 


Ser 


Leu 


Asp Arg 


He 
160 


Leu 


Phe 


He 


Ser 


Ser 


Ala 


Lys 


Ser 


Phe 


Asp 


Glu 


Gly 


Met 


Arg 


Arg 


Met 


Arg 








165 










170 










175 




Phe 


Leu 


Glu 


Gin 


Tyr 


Ala 


Ser 


Ala 


Tyr 


Lys 


Leu 


Ala 


Ser 


Val 


Arg 








180 










185 










190 




Leu 


Arg Asp 


Thr 


Arg 


Ser 


Lys 


Leu 


Glu 


Thr 


Glu- Arg 


Ala 


Thr 


Val 


Glu 






195 










200 










205 






Asp Ala 


Lys 


Lys 


Glu 


Lys 


Gly 


His 


Leu 


Leu 


Val 


He 


Arg 


Glu 


Glu 


Glu 




210 










215 










220 






Lys 


Lys 


Lys 


Leu 


Glu Gly 


Gin 


Gin 


Ala 


Glu 


Gin 


Arg 


Arg 


Gin 


Val 


Gin 


225 










230 










235 






240 
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40 



50 



55 



60 



65 



70 



75 



Ala 


Leu 


Gl/ 


Ala 


L/s 
245 


Gin 


Lyr 


Af P 


Leu 


Glu 
250 


Ala 


Gin 


Lej Arg 


Lys 


Gin 


1,'j'S 


Lyr 


g: r> 


Ala 

260 


Glu 


Ala 


L~u 


Asn 


Arg 


Lys 


lie 


Glu 


L y s Gin 
270 


He 


Ala 


LY* 


Glu 


lie 
27 5 


Glu 


Ala 


Ale 


Glu 


Arg 
280 


Arg 


Al * 


Arg 


Glu 


Glu Aro 

295 


Glu 


Axg 


Leu 


Ala 


Arg 


Glu 


Ala 


Lys 


Ala 

295 


Lys 


Gly 


Lys 


Pro 


Val 
300 


Pro Ala 


Glu 


Pro 


Glu 


Arg 


Lys 


Ala 


Glu 


Thr 


Lys 


Gly 


Gly 


Tyr 


Ala 


Met 


Asp Ala 


Ser 


Glu 


305 










310 










315 








320 


Arg 


Ala 


Leu 


Ser 


Gly 


Ser 


Phe 


Ala 


Gin 


A^a 


Lyc 


Gly 


Arg Leu 


Pro Gly 










325 










330 








335 




Pro 


Val 


Aty 


Gly 
340 


Arg 


Tyr 


Arg 


lie 


Val 
345 


Ser 


Asp 


Phe 


Gly Val 
350 


His 


Gin 


Hie 


s~r 


Glu 


Leu 


Lys 


Lys 


Val 


Gin 


Vil 


Asn 


Aen 


Gly 


Gly lie 


Asp 


Tie 






355 










360 








365 






Ala 


Val 


Ala 


Thr 


Gly 


Scr 


Asp 


Ala 


Thr 


Ser 


Val 


Phe 


A^p Gly Val 


Val 




37 0 










37 5 










3B0 








Ssr 


spr 


Val 


Phe 


Val 


lie 


Pro 


Gly 


Tyr 


Asn 


Ser 


Ala 


Val Met 


Val 


Arg 


385 










390 










395 








40Q 


His 


Clo- 


hsn 


Tyr 


lie 
405 


Thr 


Val 


Tyr 


Ala 


Asn 
43 0 


Leu 


Ser 


Lys Val 


Tyr 
415 


Val 


A^n 


ser 


Cly 


Thr 

420 


Arg 


Val 


Lys 


Thr 


Gly 
425 


Gin 


Ala 


Leu 


Gly Arg 
430 


Ala 


Tyr 


Thr 


Asp 


Pro 
435 


Ser 


Asn 


Asn 


GJ n 


Thr 
440 


lie 


lie 


His 


Phe 


Glu lie 
445 


Trp 


Lys 


Glu 


Arg 
450 


Ser 


Lys 


Gin 


Asn 


Pro 
455 


Arg 


Leu 


Tr P 


Leu 


Arg 
460 









10 



15 



20 



25 



30 (2i INFORMATION FOR SEQ ID NO: 330 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 2 amino acids 

(B) TYPE: amino acid 
35 ( D) TOPOLOGY: linear 

(iij MOLECULE TYPE: protein 

(iiii HYPOTHETICAL: YES 



tvi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyrorftonas g^ngivalis 



FEATURE: 

45 (A> NAME /KEY: ml3c_f eature 

IB) LOCATION 1. . .342 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 330 



Ser 


Phe 


Tyr 


Gin 


Glu 


He 


Asp 


Arg 


Leu 


Met 


Lyr 


Lys 


Tyr 


Leu 


Leu 


Tyr 


1 








5 










10 










IS 




Ala 


Ser 


Leu 


Leu 


Thr 


Ser 


Val 


Leu 


Leu 


Phe 


Ser 


Cys 


Ser 


Lys 


Asn 


Asn 








20 










25 










30 






Pro 


Asn 


Glu 


Pro 


Val 


Glu 


Asp 


Arg 


Ser 


He 


Glu 


He 


Ser 


He 


Arg Val 






35 










40 










45 








Asp 


Asp 


Phe 


Thr 


Lys 


Thr 


Gly 


Glu 


Ala 


Val 


Arg 


Tyr 


Glu 


Arg 


Asn 


Gin 




50 










55 










60 








Gly 


Ser 


Ala 


Ala 


Glu 


Arg 


Leu 


He 


Thr 


Asn 


Leu 


Tyr 


Leu 


Leu 


Leu 


Phe 


65 










70 










75 










80 


Asp 


Gin 


Ser 


Gly 


Ala 


Asn 


Pro 


Ala 


Lys 


Tyr 


Tyr 


He 


Thr 


Gly Asn 


Thr 










85 










90 










95 




Phe 


Thr 


Gly 


Gly 


Thr 


Trp 


Levi 


Pro 


Asp 


Asp 


Met 


Lys 


Val 


Lys 


Leu 


Asp 








100 










105 










110 






Met 


Thr 


Gin 


Ser 


Glu 


Ala 


Gly 


Glu 


Arg 


Lys 


Val 


Tyr 


Val 


Val 


Ala 


Asn 






115 










120 










125 








Val 


Asp 


Asn 


Ala 


Val 


Lys 


Thr 


Ala 


Leu 


Asp 


Ala 


Val 


Ala 


Asn 


Glu 


Ser 




130 










135 










140 










Asp 


Leu 


Gin 


Thr 


Val 


Lys 


Arg 


Thr 


Thr 


Ala 


Met 


Pro 


Trp 


Ser 


Thr 


Asp 


145 










150 










155 










160 


lie 


Ala 


Ser 


Pro 


Phe 


Leu 


Met 


Ser 


Gly 


Asn 


Lys 


Thr 


His 


Asp 


Phe 


Leu 










165 










170 










175 




Ala 


Asn 


Axg 


Leu 


Leu 


Asp 


Asn 


Val 


Pro 


Leu 


Val 


Arg 


Ala 


He 


Ala 


Lys 








100 










185 










190 






Val 


Glu 


Leu 


Asn 


He 


Ser 


Leu 


Set 


Glu 


Lys 


Phe 


Gin 


He 


Val 


Pro 


He 






195 










200 








205 
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lis VgI A^n Gly £i«r Leu Sec Glu Phe Lys Fne Arg Tyr Va] A^n Phe 

210 215 220 

Asp Lys Glu Thr r Val Val Lys Pro Thr The Lys Pro A^p Asn Leu 
7. ?. c . 230 235 24 0 

5 He Ser Ser Ala A^n Gly Val Trp Pro Gin He Thr Asp Trp Thr Val 
245 250 250 

Trp Gly Ala Ser Leu Asn Thr Ser Pro Ala Pro Asp Ala Gly Thr Gly 

26C 265 270 

Tyr Thr Levi Asp Ala Asn Gly Lys V<.1 Thr Ala Leu Arg He Val Thr 
10 2"?5 280 285 

Tyr Leu Asn Glu Arg Asp Ser Lys Gly Ala Thr Val Glu Val Ala Leu 

290 295 300 

Pro Arg Val Asrp Asp Gly Thr Leu Pro Pro Pro Glu Phe Gly Pro Glu 
305 310 315 320 

15 Leu Tyr Arq Leu Pro Leu Pro Asp Lys lie Leu Arg Asn Has Trp Tyr 
325 330 335 

Lys Tyr Glu Val Glu He 
340 

20 [2) INFORMATION FOR SEQ ID NO: 331 

(11 SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 329 amino acids 

(B) TYPE; amino acid 
25 (D) TOPOLOGY: linear 

(ill HOLECULE TYPE J protein 

(ill) HYPOTHETICAL: YES 



30 



40 



45 



50 



55 



60 



65 



70 



75 



(Vl) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas glngivalis 



fix) FEATURE: 
35 |A> NAME/KEY: misc_f eature 

(B> LOCATION 1...32 9 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331 



Asn 


Leu 


Arg 


Thr 


Gin 


Pro 


Phe 


Phe 


Leu 


Val 


Asp 


Leu 


Gin 


Ser Met 


He 


1 








5 










10 








15 




Arg 


Thr 


lie 


Leu 


Ser 


Arg 


Tyr 


Val 


Ser 


Ser 


Asn 


Phe 


Trp 


Ser Arg 


Gly 








20 










25 










30 




Ala 


Thr 


Phe 


Phe 


Phe 


Thr 


He 


Phe 


Pro 


Ala 


Phe 


He 


Leu 


Ala Ala 


Thr 






35 










40 










45 






Ala 


Leu 


Pro 


Ala 


cys 


Gly Gly 


Gly 


Thr 


Ala 


Ser 


Gly Ser Asp Arg 


Thr 




50 










55 










60 








Leu 


Ala 


Val 


Thr 


He 


Glu 


Pro 


Gin 


Lys 


Tyr 


Phe 


He 


Glu 


Ser He 


Al- 


65 










70 










75 








80 


Asp 


Lys 


Ser 


Val 


Gin 


Val 


Val 


Ala 


Leu 


Val 


Pro 


Ala 


Gly 


Ser Asn 


Pro 










85 










90 








95 




Glu 


Glu 


Tyr 


Asp 


Pro 


Ser 


Pro 


Thr 


Val 


Met. 


Lys 


Arg 


Leu 


Ser Glu 


Ala 








100 










105 










110 




Asp 


Ala 


Tyr 


Phe 


Tyr 


He 


Gly 


Gly 


Leu 


Gly Phe 


Glu 


Gin 


Arg Asn 


Leu 






115 










120 










125 






Ala 


Ala 


He 


Arg 


Asp 


As:n 


Asn 


Pro 


Lys 


Leu 


Pro 


Leu 


Phe 


Glu Met 


Gly 




130 










135 










140 








Lys 


Ala 


Leu 


Ala 


Asp 


Ala 


Gly 


Ser 


Ala 


Asp 


Leu 


His 


Gly 


Ser Cys 


Thr 


14 5 










150 










155 








160 


Asp 


His 


Ser 


HiiT 


Thr 


Asp 


Leu 


His 


Ala 


His 


Asp 


Pro 


His 


Tyr Trp 


Ser 










165 










170 








175 




Scr 


Val 


Val 


Gly 


Ala 


Lys 


Ala 


Leu 


Ser 


Arg 


Ala 


Ala 


Tyr 


Asp Ala 


Leu 








180 










185 










190 




Val 


Glu 


Leu 


Tyr 


Pro 


Asn 


Glu 


Lys 


Asp 


Lys 


Trp 


Asp Lys 


Gly His 


Asp 






195 










200 










20S 






Arg 


Leu 


Asn 


Gly 


Arg 


He 


Asp 


Ser 


Val 


Lys 


Arg 


Leu 


Val 


Asp Thr 


Met 




210 










215 










220 








Phe 


Ala 


Asn 


Gly 


Lys 


Ala 


Asp 


Lys 


Ala 


Phe 


Val 


He Tyr 


Hie Pro 


Ser 


225 










230 










235 








240 


Leu 


Ser 


Phe 


Phe 


Ala 


Gin 


Glu 


Phe 


Gly 


Leu 


Arg 


Gin 


He 


Val He 


Glu 










245 










250 








255 




Glu 


Asp 


Gly 


Lys 


Glu 


Pro 


Thr 


Ala 


Ala 


Hie 


Leu 


Arg Arq Val He 


Asp 








260 










265 










270 




Gin 


Ala 


Arg 


Ala 


Asp 


Gly Val 


Arg 


He 


Val 


Phe 


He 


Gin 


Pro Glu 


Phe 






275 










280 










285 







SUBSTITUTE SHEET (Rule 26) (RG/AU) 



WO 99/29870 



256/ 490 



PCT/AU98/0I023 



Glu Tin An Gin Pi- Glu hsp lie Aia Arg jIu lie Gly Ala Arg Pro 

2 Q 5 3C0 
Va) Arg II" Asn Fro Leu Ai.y Ser Ser Trp 31 u -31 u Glu lie Leu Hl-« 
305 310 31^ 320 

5 He Ala Ai<g Ala Leu Ala His Glu Arg 
32b 

(2) II J FORI (ATI OH FOR SEQ ID NO: 332 

10 (i» SEOUEIICE CHARACTERISTICS: 

(A) LENGTH: 87 0 amino acids 
{ B) TYPE: amino acid 
( D) TOPOLOGY j linear 

15 (ill MOLECULE TYPE: protein 

(Lxi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Porphyrotronas gingival!:? 

{ tr. ) FEATURE : 

(A) IIAME/KEY: misc feature 
(BJ LOCATION 1. . .878 



25 



(xi) SEOUEIICE DESCRIPTION: SEQ ID [JO: 332 



Aid Aep Scr lie Arg Tyr Pro Leu Tyr Phs Phe Gly Arg Asn Arg Lys 
1 5 10 15 

30 Lys Cys Phe Arg Glu Pro He Pro Thr Leu Tyr Asn Lys Asn I let He 
20 25 30 

Gly Lys Lys He Phe Phe He Leu Leu Ala Leu He Ala Phe Sor Gly 

35 40 45 

Leu Asn Ala Ala Thr Asp Thr Glu Phe Ly^ Tyr Pro Thr Asp Ala Asn 
35 50 55 60 

He He Gly His Val Lys Asp Ser Lys Thr Gly Glu His L*u Val Gly 
65 70 75 80 

lie Thr He Ala He Lys Gly Thr Thr Ph& Gly Thr Ser Thr Asp Ala 
85 90 95 

40 Thr Gly His Tyr Tyr Leu Arg Asn Leu Arg Pro Gly Glu He Thr Leu 
100 105 110 

He Met Arg Gly Met Gly Tyr Lys Ser Gin Glu Arg Val Val Arg Val 

115 120 125 

Glu Lys Asp Lys Thr He Glu Val Asn Phe Glu Ala Glu Glu Asp Ala 
45 130 135 140 

lie Asn Leu Asp Glu Val Val He Ser Ala Asn Arg Glu Leu Thr Leu 
14S 150 15S 160 

Arg Arg Leu Ala Pro Thr Leu Val Asn Val Leu Asn Glu Lys Val Phe 
165 170 175 

50 Ser Gin Val Asn Ala Ser Asn Leu Ala Gin Gly Leu Ser Phe Gin Pro 
180 1B5, 190 

Gly Val Arg Val Glu Asn Asn Cys Gin Asn Cys Gly Phe Asn Gin Val 

195 200 205 

Arg He Asn Gly Leu Asp Gly Arg Tyr Ala Gin lie Leu He Asp Ser 
55 210 21S 220 

Arg Pro He Met Ser Ala Leu Ala Gly Val Tyr Gly Leu Glu Gin He 
225 230 235 240 

Pro Ala Asn Met He Glu Arg Val Glu Val Val Arg Gly Gly Gly Ser 
245 250 255 

60 Ala Leu Tyr Gly Ser Ser Ala He Ala Gly Val Val Asn He He Thr 
260 265 270 

Lys Glu Pro Ser His Asn Ser Phe Thr Phe Asn Glu Ser Leu Ser Phe 

275 280 285 

Thr Gly Phe Ser Lys Leu Asp Asn Asn Thr Asn Phe Asn Ala Ser He 
65 290 295 300 

Val Ser Asp Asp Acn Arg Ala Gly Ala Met Val Phe Gly Gin Ala Arg 
305 310 315 320 

Tyr Arg Asn His Trp Asp Ala Asn Asn Asp Gly Tyr Ser Glu Leu Gly 
325 330 - 335 

70 Lys He Asp Ala Arg Ser Leu Gly Ala His Ser Tyr Leu Arg Leu Ser 
340 345 350 

Asp Tyr Ser Lys Leu Thr Gly Glu Phe His Thr He Ser Glu Phe Arg 

355 360 365 

Arg Gly Gly Asp Arg He Asp Leu Pro Pro His Val Val Gly Val Ala 
75 370 375 380 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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GlU 


Gl n 


Thr 


Asp 


His 


Ser 


Val 


Phe 


Ser 


ciy 


Asn 


Leu 


Lys 


Tyr 


Asp 


Lei) 


385 










390 










395 










400 


Phe 


Ser 


Ser 


Asn 


Tyr 


Lys 


His 


His 


Phe 


Gin 


Ala 


Tyr Thr 


Ser 


Gly 


Gin 










4 05 










410 










415 




lie 


Val 


Asn 


Arg 


Lys 


Ser Tyr 


Tyr 


Gly 


Gly 


He 


Gly Glu 


He 


Asp 


Val 








4 20 










425 










430 








Gly 


His 


Pro 


Gly Gly 


Thr 


Glu 


Gly Tyr 


Pro 


lie 


Pro 


Gin 


Asp 


Gin 






43 j 










440 










445 








Tyr 




sn 


sn 


Tyr 


Gly 


Val 


Thr 


Lys 


Gly 


Lys 


Thr 


Tyr 


Met 


Gly 


Gly 




450 










455 










460 










He 


Gin 


T,, r 
1 > r 


Scr 


Tyr 


Arp 


Leu 


Asp 


Lys 


Phe 


Leu 


Leu 


Met 


Pro 


Ser 


Gin 


4 65 










470 










4 75 










400 


Leu 




Phe 




Ala 


Glu 


Tyr 


Thr 


Arg 


Asp 


Glu 


Leu 


Asn 


Asp 


Val 


Met 










405 










4 90 










495 




Pro 


He 


Leu 




Trp 


Gin 


Thr 


Gly Glu 


Asp 


Ala 


Asn 


Gly 


Asn 


Thr 


He 








•■nn 










505 










510 






Pro 


1 


Tyr 


p 


Glu 


Leu 


Asp 


Gin 


Asn 


He 


Asn 


Asn 


Tyr 


Ser 


Leu 


Ph? 






515 










52 0 










525 










Gin 


Asn 


Glu 


Trp 


Ly3 


Asn 


Asp 


Arg 


Trp 


Ser 


He 


Leu 


Val 


Gly Ala 




S30 










535 










540 










Arg 




A<* 




His 


Ser 


Glu 


Val 


Lys Asp 


Met 


He 


Leu 


Ser 


Pro 


Arg 


545 










550 










555 










560 


Thr 


Thr 


eu 




Phe 


Asn 


Val 


Asn 


Pro 


Asp 


He 


Asn 


Leu Arg 


Ala 


Thi- 










565 










570 










575 




ryr 


Ala 


Lys 




Phe 


Arg 


Ala 


Pro 


Gin 


Val 


Phe 


Asp 


Glu 


Asp 


Leu 


ols 


















585 










590 






Val 


Gly 


a . 


Vo 1 


Gly Gl Y 


Glu 


Ala 


Gin 


Lys 


Val 


Phe Asn Asp 


Pre 


Asn 






59b 










600 










605 








Leu 


Ly^ 


Pro 




He 


Ser 


His 


Ala 


Phe 


Se£ 


Leu 


Ser 


Ala 


Asp 


llet 


Tyr 




610 










615 










620 






His 


Arg 




j-y 


Asn 


Val 


Gin 


Thr 


Asn 


Phe 


Leu 


Val 


Glu 


Gly 


Phe 


Tyr 


625 










630 










635 








64 0 


Thr 


rg 


eu 


eu 


Asp Val 


Phe 


Thr 


Asn 


Glu 


Glu 


Gin 


Pro 


Asp 


Gin 


His 










645 










650 








655 




Asp 


Gly 


He 


Lys 


Arg 


Tyr 


Thr 


Arg 


lit 


Aerv Gly 


Ser 


Gly Ala 


Lys 


Val 








660 










665 










670 






Phe 


Gly 


Leu 


Asn 


Leu 


Glu 


Gly Lys Val Ala 


Tyr 


Lys 


Ser 


Phe 


Gin 


Leu 






675 










680 










60S 








Gin 


Ala 


Gly 


Leu 


Thr 


Leu 


Ala 


Ser 


Asn 


Lys 


Tyr 


Asp Glu Ala 


Gin 


Glu 




690 










695 










7 00 










Trp 


Gly 


Leu 


Asn 


Thr 


Val 


Lys 


Asp 


Thr 


Asn 


Gly 


Ala 


Phe 


Val 


Thr 


Glu 


705 










710 










715 










720 


Ala 


Am 


Ala 


Asn 


Gly Gin 


Gin 


Glu 


Tyr 


Lys 


Asn 


Glu 


Ser 


Met 


Thr 


Asp 










72S 










730 










735 


Thr 


Gin 


He 


Thr 


Arg 


Thr 


Pro 


Ser 


Val 


Tyr 


Gly 


Tyr 


Phe 


Thr 


Leu 


Ala 








740 










745 










750 






Tyr 


Asn 


Pro 


Ala 


His 


Ser 


Trp 


Asn 


He 


Ala 


Leu 


The 


Gly 


Ala 


Tyr 


Thr 






755 










/60 










765 






Cly 


Gin 


Met 


Tyc 


Val 


Pro 


His 


Ala 


He 


Glu 


Tyr 


Gly Val 


Lys 


Ser 


Ala 




770 










775 










780 










Glu 


Leu 


Asp 


He 


Met 


Gin 


Asn 


Asn 


Pro 


Glu 


He 


Thr 


Asp 


Glu 


Thr 


Gly 


785 










790 










7 95 








800 


Lys 


Ala 


Pro 


Arg 


He 


Asp 


Glu 


Leu 


Lys 


Lys 


Thr 


Pro 


Ala 


Phe 


Phe 


Asp 










805 










810 










815 


Leu 


Gly 


Leu 


Lys 


Val 


Gly 


Tyr 


Asp 


Phe 


His 


Val 


Phe 


Bin 


Ala 


Thr 


Glu 








820 










825 










830 






Val 


Gin 


Leu 


Tyr Val 


Gly Met 


Asn 


Asn 


He 


Phe 


Asn 


Ser 


Phe 


Gin 


Lys 






635 










840 










845 






Asp 


Phe 


Asp 


Aeg 


Gly Ala 


Ala 


Arg 


Asp 


Ser 


Gly 


Tyr 


He 


Tyr 


Gly 


Pro 




050 










855 










860 








Thr 


Gin 


Pro 


Arg 


Thr 


Gly 


Tyr 


Met 


Gly 


Leu 


Val 


Val 


Lys 


Phe 







865 870 87L 



(2) INFORMATION FOR SEQ ID NO; 333 

<i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 206 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



WO 99/29870 



PCT/AU98/01023 



258/490 



10 



15 



20 



25 



30 



35 



55 



60 



65 



70 



75 



(A) ORGANISM: Porphyroroonaf gingivalis 

fix) FEATURE: 

{A) NAME/KEY: misc_f eature 
(B> LOCATION 1...206 

(Kt) SEQUENCE DESCRIPTION SEQ ID NO:333 



Lys 


Gin 


He 


Val 


Met 


Thr 


Val 


Lys 


Arg 


Ala 


Val 


Arg 


He 


Ala 


Leu 


Leu 


1 








5 










10 










15 




Thr 


Leu 


He 


Gly 
20 


He 


Leu 


Phe 


Ser 


Ser 
25 


Pro 


Ser 


Leu 


Val 


Arg 
30 


Ala 


Gin 


Ser 


Leu 


Phe 
3S 


Ser 


Thr 


GlU 


Ills 


Val 
40 


Leu 


Gin 


Leu 


Tyr 


Asn 
45 


Lys 


He 


Leu 


Tyr 


Gly 


Glu 


Ser 


Ala 


Ala 


Asp 


Thr 


Val 


Ala 


Glu 


Lys 


The 


Ala 


Gly Glu 




50 










55 










60 










Ssr 


Ala 


Phe 


Pro 


Phe 


He 


Asp 


Lys 


Leu 


He 


Asn 


Leu 


Gly Arg 


Thr 


Phe 


65 




















75 










80 


Leu 


Gly 


Lys 


Pro 


Tyr 
85 


Arg 


Tyr 


Arg 


Gly 


Pro 
90 


Ser 


Pro 


Trp 


Pro 


Met 
95 


Asp 


Cys 


Ser 


Gly 


Tyr 
100 


Val 


Ser 


Tyr 


Leu 


Tyr 
105 


Ser 


Lys 


Phe 


Asp 


He 
110 


Lys- 


Leu 


Pro Arg 


Gly 


Ala 


Ala 


Ala 


Gin 


Ser 


Gin 


Tyr 


Thr 


Asn 


Pro 


He 


Glu 


Arg 






115 










LZ0 










125 








GlU 


Asp 
130 


Val 


Arg 


Pro 


Gly 


Asp 
135 


Leu 


Leu 


Phe 


Phe 


Lys 
140 


Gly 


Arg 


Asn 


Ala 


Arg 


Ser 


Asn 


Arg 


He 


Gly 


His 


Val 


Ala 


Leu 


Val 


Val 


Ser 


Val 


Asp 


Glu 


14b 










150 










155 










160 


Asp Asp 


lis 


Thr 


Met 


Met 


His 


Ssr 


Arg 


Asn 


Ser 


Arg 


Gly 


He 


Val 


He 










165 










170 










175 




Glu 


Lys 


Leu 


Asn 


Axg 


Ssr 


Ala 


Tyr 


Phe 


Ser 


Arq 


Arg 


Leu 


Val 


Ser Tyr 








ieo 










185 










190 






Gly 


Arg 


Val 
195 


Pro 


Gly 


Ala 


Lys 


Arg 
200 


Val 


He 


Pro 


Arg 


Lys 
205 


Ser 







(2) INFORMATION FOR SEQ ID NO; 334 



(if SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 67 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE; protein 

45 (iii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE; 

(A) ORGANISM: Porphyromona. 1 ' gingivalis 

50 (ix) FEATURE: 

(A) NAME/ KEY: mi sc_f eature 
(0) LOCATION 1, . .467 



txi) SEQUENCE DESCRIPTION: SEQ ID HO: 334 



Lys 


Gly Thr 


Trp 


Asn 


Arg 


Lys 


Asn 


Arg 


Arg 


Glu 


Glu 


Met 


Lys Arg 


Thr 


1 








5 










10 








15 




He 


Leu 


Leu 


Thr 


Ala 


Leu 


Thr 


Val 


Leu 


Ser 


Ser 


Leu 


Ser 


Leu Leu 


Arg 








20 










25 










30 




Ala 


Gin 


Asn 


Glu 


Ser 


Glu 


Ala 


Ser 


Thr 


Asn 


Pro 


Met 


Ser 


Gly Leu 


Ser 






35 










40 










45 




J.eu 


Glu 


Asp 


Cys 


He 


Arg 


He 


Ala 


Lys 


Glu 


Arg 


Asn 


Leu 


Asn Leu 


Arg 




50 










55 










60 






Ara 


Gin 


Glu 


He 


GlU 


Gin 


Glu 


Asn 


Arg 


He 


He 


Ser 


Leu 


Asp Ala 


Ala 












70 










75 








80 


Arg 


His 


Ser 


Phe 


Leu 


Pro 


Ser 


Val 


Asn 


Ala 


Gly 


He 


Gly His Asn Tyr 










85 










90 








95 




Ser 


Phe 


Gly Arg 


Ser 


Lys 


A3p 


Lys 


Thr 


Gly 


Val 


Thr 


Val 


Asp Axg 


Ser 








100 










105 










110 




Ser 


Met 


Asn 


Thr 


Asn 


Leu 


Ser 


He 


Gly 


Ala 


Ssr 


Val 


Glu 


Val Phe 


Ser 






115 










120 








125 






Gly 


Thr 


Arg 


Arg 


Leu 


His 


Asp 


Leu 


Lys 


Gin 


Gin 


Lys 


Tyr 


Asn Val 


Glu 




130 










135 










140 








Asp Gly 


He 


Ala 


Arg 


Leu 


Gin 


Lys 


Ala 


Arg 


Glu 


Asp 


Leu 


Ser Leu 


Gin 


145 










150 










155 








160 
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lie Ala Ala Leu Tyr Tie Asn Leu Leu Phe Arg Gin Glu Met Thr Arq 

165 170 175 

Thr Ala Glu Thr Gin Leu Ala Leu Xle Arg Glu Gin Arg Asn Arg Thr 
180 1B5 " 190 

5 Ala Glu Met Val Arg Val Gly Lys Trp Ala Glu Gly Lys Leu Leu Asp 
195 200 205 

lie Asn Ala Gin Met Ala Lys Asp Glu Gin Leu Leu Val Gin Tyr Arg 

210 215 Z2Q 

Ser Glu Glu Glu Leu Ala Arg Leu Asp Leu Gly Gin Ala Leu Glu Leu 
10 225 230 235 240 

Glu His Pro Glu Ser He Ala Val Lys Ala Pro Asp Thr Asp Val Leu 

245 250 255 

Val Ala Glu Arg Leu Gly Ser Leu Leu Ala Pro Glu Glu He Tyr Arg 
260 265 270 

15 Thr Ala Leu Gly Leu Lys Pro Ala Leu His Ser Ser Glu Leu Gin He 
275 280 285 

Ala Ser Ala Arg Glu Gly Leu Ala Ser Ala Arq Ala Ala Tyr Phe Pro 

290 295 300 

Thr Leu Ser Leu Ser Ala Gly Tyr Ser Asn Gly Tyr Phe Arg Asp Leu 
20 305 310 315 320 

Gly Lys Glu Tyr Ala Ala He Asn Pro Ser Phe Ser Glu Gin Trp Lys 

325 330 335 

Asn Asn Gly Ser Tyr Ser lie Gly Leu Ser Leu Asn He Pro He Phe 
340 345 350 

25 Ser Ala Met Gin Thr Gin Asp Arg Val Arg Ser Ser Arg Leu Gin He 
35b 360 36b 

Arg Ser Ser Glu Leu Arg Leu Val Glu Glu Lys Lys Ala Leu Tyr Lys 

370 375 380 

Glu He Arg Gin Ala Tyr Ser Asn Ala Val Ala Ala Asp Lys Ala He 
30 38b 390 395 400 

Ala Ala Ala Glu Asn Ser Lys Ala Ala Thr Le\i Lys Ala Tyr Glu Tyr 

405 410 415 

Ala Arg Asp Ser Phe Glu Ala Gly Arg Leu Sor Ala Tyr Glu Tyr Ala 
420 425 430 

35 Glu Ala Lys Thr Lys Tyr Ala Leu Ser Gin Val Glu Glu Leu Arg Ala 
435 440 445 

Lys Tyr Asp Phe lie Tyr Lys Ala Lys Val Leu Asp Phe Tyr Gin Gly 

450 455 460 

Lys Asp Phe 
40 4 65 

[2) INFORMATION FOR SEQ ID NO: 335 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH; 4 SI amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



50 



55 



60 



65 



70 



75 



(11) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

tvij ORIGINAL SOURCE: 

(AJ ORGANISM: Porphyromonas gingival! s 

(ix) FEATURE : 

(A) NAME/ KEY: mis cofeature 

(B ) LOCATION 1-..451 





<xi> 


SEQUENCE DESCRIPTION: SEQ ID NO: 335 








Lys 


Thr 


Ser 


Tyr 


Arq 


Asn 


Asn 


Met 


Arg 


Phe 


Gin 


HJs 


Tyr 


Leu He 


Cys 


1 








5 










10 








15 




Thr 


Ala 


Ala 


Val 


Ala 


Ala 


Leu 


Ala 


Ala 


Asn 


Pro 


Leu 


Thr Gly Gin 


Ser 








20 










25 










30 




Asn 


Met 


Thr 


Leu 


Glu 


Glu 


Cys 


He 


Asp 


Tyr 


Ala 


Arg 


Arg 


His Ser 


Ser 






35 










40 










45 






Ala 


Val 


Ala 


Leu 


Ser 


Ala 


Ala 


Glu 


Leu 


Glu 


Gin 


Ser 


Lys 


Ala Asp 


Tyr 




50 










55 










60 








Leu 


Gin 


Ala 


Val 


Gly 


Asn 


Phe 


Leu 


Pro 


Arg 


Val 


Ser 


Ala 


Gly Thr 


Gly 


65 










70 










7S 








80 


Ala 


Ser 


Trp 


Asn 


Phe 


Gly 


Arg 


Gly 


Leu 


Asp 


Ala 


Glu 


Thr 


Asn Thr 


Tyr 










85 










90 








95 




Thr 


Asp 


He 


Asn 


Ser 


Phe 


Asn 


Asn 


Ser 


Tyr 


Ser 


He 


His 


Ala Thr 


Met 








100 










105 










110 





SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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10 



15 



20 



25 



30 



35 



40 



45 



65 



70 



75 



Thr 


Leu 


the 


Asp 


Gly 


Leu 


Gin 


Ser Val 


Tyr 


Arg 


L=u 


Arg 


Met 


Ala 


His 






115 










120 








125 








Ala 


Arg 


Arg 


Glu 


Ala 


Ser 


Arg 


Leu Ser 


Val 


Arq 


Glu 


Gin 


Gin 


Glu 


Leu 




130 










13S 








140 










Ala 


Ala 


Leu 


Gly 


Thr 


Thr 


Glu 


Ala Tyr 


Tyr 


Asp 


Leu 


Val 


Tyr 


Ala 


Arg 


145 










150 








155 










160 


Gin 


Met 


Gin 


Glu 


Leu 


Ala 


Met 


Gin Lys 


Tyr 


Glu 


Glu 


Ser 


Ser 


Arg 


Leu 










165 








no 










17 5 




His 


Arq 


Gin 


Thr 


Ala 


Arg Met 


Glu Glu 


Leu Gly 


Met 


Lye 


Ser 


Arg 
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(2) INFORMATION FOR SEQ ID NO; 336 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 962 amino acids 
50 (B) TYPE: amino acid 

|D> TOPOLOGY: linear 

(il) MOLECULE TYPE: protein 

55 (ill) HYPOTHETICAL: YES 

(vij ORIGINAL SOURCE : 

(A) ORGANISM: Porphyromonas gingivalis 

60 <ix) FEATURE: 

(A) NAME/KEY: misc_feature 
(8) LOCATION 1...962 



(xi> SEQUENCE DESCRIPTION: SEQ 10 UO:336 
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(2) INFORMATION FOR SEQ ID HO: 337 

(l) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH : 702 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(11) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

<ix) FEATURE: 

(A) NAME/ KEY: mlsc_f eature 
(DJ LOCATION 1. . ,702 



(XiJ SEQUENCE DESCRIPTION: SEQ 10 HO: 337 
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505 










S10 






Pro 


Pro 


Glu 


Asn 


Leu 


Phs 


Glu 


Asp 


Gly Thr 


Gin 


Thr 


Tie 


lie 


Gly Tyr 






515 










520 










525 








Pro 


Pro 


Thr 


Clu 


Thr 


He 


Ser 


Phe 


His 


Lys 


Lys 


Glu 


Asn 


Asn 


Thr 


Tyr 




530 










53 b 










54 0 








Pro 


He 


Val 


Trp 


Ser Asp 


Thr 


Asn 


Thr 


The 


Lys 


Gin 


Asp 


Leu 


Glu 


Thr 


545 










550 










555 










560 


Ser 


Arg 


Met 


He 


Ser 


Pro 


Ser 


Phe 


Glu 


Leu 


Ala 


Ser 


Gin Leu Gly Ala 










565 










570 










57 5 




Thr 


Leu 


Pro 


Met 


Pro 


Tyr 


Leu 


Glu 


Tyr 


Trp 


Pro 


Gly 


Thr 


Ser 


Tyr 


Leu 








580 










585 










590 






Leu 


Asp 


Tyr 


Sex 


Gly Asn 


Tyr 


Asn 


Asn 


Lys 


Arg 


Tyr 


Ala 


Leu 


Phe 


Asn 






595 










600 










605 








Cys 


Ala 


Phe 


Tyr Trp Glu 


Lys 


Arg 


Lys 


Val 


Asn 


Asn 


Glu 


Glu 


He 


Lys 




610 










615 










620 










Phe 


Asp 


Asp 


Trp Arg 


Leu 


Pro 


Thr 


Glu 


Ala 


Glu 


He 


Lys 


Lsu 


He 


Asp 


625 










630 










635 










640 


Lys 


Leu 


Gin 


His 


Asn 


Glu 


Gin 


Ser 


Ala 


val 


Gin 


Ala 


He 


Met 


Thr 


Gly 










645 










650 










655 


Asn 


Tyr 


Tyr 


Trp 


Asp 


Ser 


Tyr 


Ser 


Ala 


Asn 


Gly 


Ser 


Tyr 


Lys 


Met 


Gin 








660 










665 










67 0 






Gly 


Gly 


Gly 


Gly Gin Gly Asn 


Ser 


Ser 


Lys 


Ala 


Tyr 


Val 


Arg 


Cys 


Val 






67b 










600 










685 








Arg 


Asp 


Val 


Lys 


Lys 


Pro 


He 


Arg 


Asp 


Lys 


Lys 


Ser 


Gly Lys 








690 










695 










700 











(2) INFORMATION FOR SEQ ID HO: 338 
(i) SEQUENCE CHARACTERISTICS: 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



WO 99/29870 



PCT/AU98/01023 



264/ 490 



(A) LENGTH. 1312 amino acids 
tB) TYPE: amxrvo acid 
t D) TOPOLOGY: linear 

5 til] MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 
10 1A] ORGANISM: Porphyi omonas ginglvalis 

(ix) FEATURE: 

(A» NAME/ KEY*, rnxs cofeature 
tB) LOCATION 1 1312 

15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 





(Ki) 


SEQUENCE DESCRIPTION: SEQ ID NO: 338 










Lys 


Arg 


Val 


Asp 


Ser 


Tyr 


Glu 


Cys 


Arg 


Gin 


Lys 


Ala 


Cys 


Lys 


Cys 


Ala 


1 








5 










10 










15 




lie 


Cys 


Val 


He 


Gin 


Lys 


Phe 


Thr 


Asn 


Val 


Lys 


Leu 


Asn 


Asp 


Met 


Arg 








20 










25 










30 






Lys 


lie 




Ser 


Phe 


Leu 


Met 


Met 


Cys 


Ser 


Leu 


His 


Leu 


Gly 


Leu 


Gin 






35 










40 










45 








Ser 


Gin 


Thr 


Trp 


His 


Gly Asp 


Pro 


Asp 


Ser 


Val 


Ala 


Ala 


Leu 


Pro 


Ser 




50 










55 










60 










lie 


Gly 


lie 


Gin 


Glu 


Ser 


Ser 


Cys 


Thr 


Arg 


lie 


Thr 


Phe 


Glu 


Val 


Val 


65 










TO 










75 










80 


Phs 


Pro 


Gly 


Phe 


Tyr 


Ser 


Val 


Glu 


Lys 


Arg 


Glu 


Gly 


Asn 


Gin 


Val 


Phe 










85 










90 










95 




Gin 


Arg 


He 


Ser 


Met 


Pro 


Gly 


Cys 


Gly 


Ser 


Phe 


Gly 


Asn 


Leu 


Gly 


Glu 








100 










105 










110 






Ala 


Glu 


Leu 


Pro 


Val 


Leu 


Lys 


Lys 


Met 


He 


Ala 


Val 


Pro 


Glu 


Phe 


Ser 






115 










1?0 










125 








Thr 


Ala 


Asn 


Val 


Ala 


Val 


Lys 


He 


Lys 


Glu 


Thr 


Glu 


Thr 


Phe 


Asp 


Asn 




130 










135 










140 










Tyr 


Asn 


He 


Tyr 


Pro 


Asn 


Pro 


Thr 


Tyr 


Val 


Val 


GlU 


Glu 


Leu 


Pro 


Glu 


145 










150 










155 










160 


Gly 


Gly Thr 


Tyr 


Leu 


Val 


Glu 


Ala 


Phe 


Ala 


He 


Asn 


Asn Asp 


Tyr 


Tyr 










165 










170 










175 




Ser 


Gin 


Asn 


Val 


Ser 


Leu 


Pro 


Ser 


Thr 


His 


Tyr 


Val 


Tyr 


Ser 


Gin 


Asp 








ieo 










185 










190 






Gly 


Tyr 


Phe 


Arg 


Ser 


Gin 


Arg 


Phe 


He 


Glu 


Val 


Thr 


Leu 


Tyr 


Pro 


Phe 






195 










200 










205 








Arg 


Tyr 


Asn 


Pro 


Val 


Arg 


Gin 


Glu 


He 


Leu 


Phe 


Ala 


Lys 


Lys 


He 


Glu 




210 










215 










220 










Val 


Thr 


He 


Thr 


Phe 


Asp Asn 


Pro 


Gin 


Pro 


Pro 


Leu 


Gin 


Lys 


Asn 


Thr 


225 










230 










235 










240 


Gly 


He 


Phe 


Asn 


Lys 


Val 


Ala 


Ser 


Ser 


Ala 


Phe 


He 


Asn Tyr 


Glu 


Ala 










245 










250 










255 




Asp 


Gly Lys 


Ser 


Ala 


He 


Glu 


Asn 


Asp 


Met 


Val 


Phe 


Ser Arg 


Gly 


Thr 








260 










265 










270 




Thr 


Thr Tyr 


He 


Ser 


Gly Asn 


Val 


Ala 


Ser 


Asn 


Leu 


Pro 


Gin 


Asn 


Cys 






275 










280 










285 






Asp 


Tyr 


Leu 


Val 


He 


Tyr 


Asp 


Asp 


Met 


Phe 


Asn 


Val 


Asn 


Gin 


Gin 


Pro 




290 










295 










300 










His 


Asp 


Glu 


He 


Lys 


Arg 


Leu 


Cys 


Glu 


HI a 


Arg 


Ala 


Phe Tyr 


Asn 


Gly 


305 










310 










315 










320 


Phe 


Asp 


Val 


Ala 


Ala 


Val 


Ser 


He 


Lys 


Asp 


Val 


Leu 


Asn 


Ser 


Phe 


Pro 










325 










330 










335 




Ser 


Asn 


Ala 


Thr 


Ser 


Tyr 


He 


Asn 


Glu 


Thr 


Lys 


Leu 


Lys 


Asn 


Phe 


lie 








340 










345 










350 






Arg 


S-r 


Val 


Tyr 


Asn 


Gin 


Ser 


Asn 


Ala 


Lys 


Arg 


Thr 


Leu 


Asp 


Gly 


Lys 






355 










360 










365 








Leu 


Gly Tyr 


Vol 


Leu 


Leu 


He 


Gly 


Lys 


Pro 


Leu 


Ser 


Lys 


Tyr 


Leu 


Al* 




370 










375 










380 










Asp 


Thr 


Asp 


Asn 


Thr 


Lys 


Val 


Pro 


Thr 


Ser 


Phe 


He 


His 


Asn 


Val 


Ser 


385 










390 










395 










4 00 


Leu 


lis 


Pro 


Ser 


His 


Pro 


Thr 


Phe 


Gly 


Ser 


He 


cys 


Ala 


Ser 


Asp 


Tyr 










405 










410 










415 




Phe 


Phe 


Ser 


Cys 


Val 


Ser 


Pro 


Leu 


Asp 


Thr 


Val 


Gly" 


Asp 


Leu 


Phe 


He 








420 










425 










430 






Gly 


Arg 


Phe 


Ser 


Val 


Thr 


Asn 


Ala 


His 


Glu 


Leu 


His 


Asn 


Leu 


He 


Glu 






435 










440 










445 








Lys 


Thr 


He 


Asn 


Lys 


Glu 


He 


Ser 


Tyr 


Asn 


Pro 


He 


Ala 


His 


Lys 


Asn 




450 










455 










460 









SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



lie 




Tyr 


Ala 


Glu 


Gly 


l.ys 




Cys Asp 


Ala 


Pro 


He 


Leu 


Arg 




4 65 










4 70 








475 










480 


Phe 




Lys 


Glu 


Tie 


Ala 


Ser 


o y 


Tyr Thr 


Val 


Asn 


Ser 


He 


Leu 


Lys 










4 85 








4 90 










495 






n 


Gl n 


Val 


Ser 


Ala 


He 


As 

P 


Ser He 


Phe 


Asp Cys 


Leu 


Asn 


Asn 








500 










505 








510 






Gly 




His 


His 


Phe 


Tyr 


Phe 


Asn 


Thr His 


Gly 


Met 


Pro 


Th£ 


Val 


* r P 




51 5 










520 








525 








Gly 


lie 


Giy 


Gl n 


uiy 


Leu 


Asp 


Val 


Asn Thr 


Leu 


Thr 


Ala 


Arg 


Leu 


sn 




630 










535 








540 










A=n 


Thr 


Ser 


Ser 


Gin 


G] y Leu 




Thr Ser 


Leu 


Ser 


Cys 


Ser 


Ser 


Ala 


545 










550 








555 










560 


Val 


Ala 


Arp 


Ser 


Thr 


lie Arg 


Se 
er 


Leu Gly Glu 


Val 


Leu 


Thr 


Thr 


Tyr 










565 








57 0 










575 




Ala 


Pro 


Asn 


Lys 


Gly 


Phe 


Ser 


Ala 


Phe Leu 


Gly 


Gly Ser Arg Ala 


Thr 








580 










565 








590 






Gin 


Tyr 


Ala 


Val 


ryr 


Leu 


Glu 


Gly 


Pro Cys 


Pro 


Pro 


Ser 


a u 


Phe 


Tyr 






595 










600 








605 








Glu 


Tyr 


Leu 


Pro 


Tyr 


Ser 


Leu 


Tyr 


His Asn 


Leu 


Ser 


Thr 


Val 


Val 


Gly 




610 










615 








620 










Gl u 


Met 


Leu 


Leu 


Ser 


Ser 


Tie 


lie 


Asn Thr 


Asn 


Ser 


Val 


Asp 


Thr 


Tyr 


62S 










630 








635 










640 


Ser 


Lys 


Phe 


Asn 


Phe 


Asn 


Leu 


Leu 


Gly Asp 


Pro 


Ala 


Leu 


Asn 


He 


Met 


















650 










655 




Ala 


His 


Gly 


Met 


Glu 


Val 


Ser 


Asn 


Cys He 


Thr 


Leu 


Pro 


Asn 


Asn 


Thr 








660 










665 








67 O 






lie 


lie 


Stic 


Ser 


Pro 


He 


Thr 


He 


Lys Asn 


Gly 


Gly 


Cys 


Leu 


Lys 


He 






6 /5 










630 








685 








Pro 


Glu 


Lys 


Gly 


Val 


Leu 


His 


Phe 


Thr Asn 


Asn 


Gly 


Ser 


He 


Gin 


Val 




690 










695 








700 










Met 


Ser 


Gly 


Gly 


Th c 


Leu 


Glu 


He 


Gly Asn 


Gin 


Ala 


Lys 


He 


Ser 


Gly 


705 










710 








715 










720 


Glu 


Thr 


Gly Ala 


Asn 


Pro 


Thr 


Pho 


He Thr 


Val 


Tyr 


Gly Asp Gly Leu 










725 








730 










735 




Aid 


lie Asn Lye 


Gin 


Val 


Glu 


He 


Asp Asn 


He 


Asp 


Arg 


Leu 


Asn 


Leu 








740 










745 








750 








Ser 


Thr 


His 


Ser 


Val 


Met 




Lys Phe 


His 


Phe 


Asp 


Ser 


Val 


Lys 






755 










760 








765 








Phe 


Asn 


Ser 


Ala 


Pro 


Leu 


Tyr 


Thr 


Thr Asn 


Cys 


He 


Val 


Glu 


He 


Ser 




770 










775 








730 










Asn 


Cys 


Clu 


Phe 


Thr 


Asn Arg 


er 


Asp He 


He 


Ser 


Lys 


Asn 


Cys 


Asp 


785 










790 








795 










800 


Leu 


Ser 


Val 


Glu 


Asn 


Ser 


Met 


Phe 


Ser Ser 


Ser 


Gly He 


Thr 


Val 


Phe 










805 








810 










815 




Lys 


Pro 


Met 


Ala 


Thr 


Ser 


Ser 


He 


Thr Gly Lou 


Ser 


Thr Lys 


Ala Lys 






020 










825 








830 






lie 


Thr 


Asp 


Asn 


Thr 


Phe 


Phe 


Ala 


Thr Gly Asn. 


Phe 


Ala 


Tyr 


His 


He 






635 










840 








845 








Thr 


Asn 


Thr 


Pro 


Gly 


Leu 


Thr 


Ala 


Thr Ser 


Aen 


Ala 


Ala 


He 


Lys 


Leu 




850 










855 








860 










Asp 


Asn 


lie 


Pro 


Glu 


Tyr 


Tyr 


He 


Ser Gly 


Asn 


Lys 


He 


Val 


Asn 


Cys 


665 










87 0 








87 5 










860 


Asp 


Glu 


Ala 


Leu 


Val 


Leu 


Asn 


Asn 


Ser Gly Asn 


Arg 


Thr 


Asn 


Arg 


Leu 










885 








890 










895 




His 


Asn 


IlG 


Thr 


Arg 


Asn 


Val 


He 


Lys Asn 


Cys 


Arg 


He 


Gly 


Ser 


Thr 








900 










905 








910 






Leu 


Tyr Asn 


Ser 


Tyr Gly lie 


Tyr 


Asn Arg 


Asn 


Lys 


He 


Ser 


Asn 


Asn 






915 










920 








925 








His 


lie 


Gly 


Val 


Arg 


Leu 


Leu 


Asn 


Asa Ser 


Cys 


Phe 


Tyr 


Phe 


Asp 


Asn 




930 










935 








940 










Ala 


Pro 


Val 


lie 


Asn 


Glu 


Glu 


Asp 


Lys Gin 


Thr 


Phe 


He 


Ser 


Asn 


Arg 


945 










950 








955 










960 


Thr Trp Gin 


Leu 


Tyr 


Ser 


Ser 


Asn 


Gly Thr 


Phe 


Pro 


Leu 


Asn 


Phe 


His 










965 








97 0 










975 




Tyr 


Asn 


Ser 


Leu 


Gin 


Gly Gly Asp Thr Asp T>"»r 


Trp 


He 


Tyr Asn 


Asp 








980 










985 








990 






Thr 


Tyr 


Thr 


Ar*n 


Arg 


Tyr 


He 


Asp 


Val Ser 


Asn 


Asn 


His 


Trp 


Gly Asn 






995 










1000 






1005 






Asn 


Asp 


Leu 


Phe 


Asp 


Pro 


Asn 


Gin 


Val Phe 


Asn 


Thr 


Pro 


Asp 


Leu 


Phe 




1010 








1015 






1020 






lie 


Trp 


He 


Pro 


Phe 


Trp 


Asp 


Gly 


Leu Pro 


Asn 


Gly Arg 


Ser 


Gly Asn 


1025 








1030 






103b 










Ser 


Ser 


Ala 


Glu 


Ala 


Val 


Glu 


Phe 


Gin Thr 


Ala 


Leu Asp Cys 


He 


Gly 










1045 






10! 


50 








1055 


Asn 


Ser 


Asp 


Tyr 


Leu 


Ser 


Ala 


Lys 


Val Ala 


Leu 


Lys 


Met 


Met 


Val 


Glu 



1040 
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1060 1065 1070 

Thr Tyr Pro Glu Ser Asp Phe Ala He Ala Ala Leu Lys Glu Leu Phe 

1075 1080 1085 

Arg He Glu Lys Met Ser Gly Asn Asp Tyr Glu Gly Leu Lys Asp Tyr 
5 1090 1095 1100 

Phe Arg Ser Asn Pro Thr He He Ser Sex Gin Asn Leu Phe Pro Thr 
1105 1110 1115 U20 

Ala Asp Phe Leu Ser Ala Arg Cys Asp He Val Cys Glu Asn Tyr Gin 
1125 1130 1135 

10 Ser Ala He Asp Trp Tyr Glu Asn Arg Leu Asn Ser Glu He Sor Tyr 
1140 1145 1150 

Gin Asp Ser Val Phe Ala Val He Asp Leu Gly Asp He Tyr Trp Asn. 

1155 1160 1165 

Met Gin Leu Asp Ser Leu Arg Gly Thr Gly He Asp Leu Asn He Leu 
15 1170 1175 1180 

Ser Cys Glu Gin Arg Lys Ser Leu Glu Ser His Gin A.m Val Lys Asn 
1135 1190 1195 1200 

Tyr Leu Leu Ser Thr Leu Pro Glu Ser Thr Gly Thr Leu Leu Pro Pro 
1205 1210 1215 

20 Leu Glu Cys Asn Lys Ser Ser Leu Asp Lys Ser Lys He lie Ser He 
1220 1225 1230 

Scr Pro Asn Pro Ala Lys Ala Val Val Thr He He Tyr Tyr Thr Asp 

1235 1240 1245 

Asn Pro Ser Cys Ser Val He Lys He Tyr Gly He Asn Gly Ala Scr 
25 1250 1255 1260 

Ala Asp He Thr Gly Leu Pro Lys His Leu Ser Glu Gly Tyr Tyr Ser 
1265 1270 127b 1280 

He Gin Fhe Asn Thr Ser Asn Phe Asp Pro Gly the Tyr Leu Val Thr 
126S 1290 1295 

30 Leu Asn Val Asp Gin Lye He He Asp Thr Glu Lys Leu Arg He Lys 
1300 1305 1310 

(2) INFORMATION FOR SEQ ID NO; 339 

35 (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 936 amino acids 

(B) TYPE: amino acid 
(DJ TOPOLOGY: linear 

40 (ill MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(vi> ORIGINAL SOURCE: 
45 (A) ORGANISM: Porphyromonas gingival is 

(Ik) FEATURE: 

(A) NAME/ KEY: misc_featur* 
IB) LOCATION 1. . . 938 



50 



55 



60 



65 



70 



75 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339 



Ser 


Glu 


Asn 


Tyr 


Arg 


Tyr 


Gly 


Lys 


He 


Thr 


Asn 


Gin 


He 


Met 


Ala 


lie 


1 








5 










10 










15 




Tfet 


Met 


Lys 


Ser 


lis 


Val 


Phe 


Arg 


Ala 


Phe 


Leu 


Thr 


He 


Leu 


Leu 


Ser 








20 










25 










30 






Trp 


Ala 


Ala 


He 


Thr 


Asn 


Pro 


Thr 


Ala 


Gin 


GlU 


He 


Ser 


Gly 


Met 


Asn 






35 










40 










45 








Ala 


Ser 


Cys 


Leu 


Ala 


Ala 


Pro 


Ala 


Gin 


Pro 


Asp 


Thr 


Ho 


Leu 


Tyr 


Glu 




50 










55 










60 








Ser 


Phe 


Glu 


Asn 


Gly 


Pro 


Val 


Pro 


Asn 


Gly 


Trp 


Leu 


Glu 


He 


Asp 


Ala 


65 










70 








75 








80 


Asp 


Ala 


Asp 


Gly 


Ala 


Thr 


Trp 


Gly 


Ser 


Pro 


Ser 


Gly 


Ser 


Phe 


Ser 


Val 










as 










90 










95 




Pro 


Tyr 


Gly 


His 


Asn 


Gly 


Leu 


Cys 


Thr 


Tyr 


Ser 


His 


He 


Arg 


Ser 


Gly 








100 










105 










110 






He 


Ser 


Thr 


Ala 


Gly 


Asn 


Tyr 


Leu 


He 


Thr 


Pro 


Asn 


lie 


Glu 


Gly 


Ale 






115 










120 










125 






Lys 


Arg 


Val 


Lys 


Tyr 


Trp 


Val 


Cys 


Asn 


Gin 


Tyr 


Ser 


Thr 


Asn 


Pro 


GlU 




130 










135 










14*0 










His 


T V r 


Ala 


Val 


Mel. 


Val 


Ser 


Thr 


Thr 


Gly 


Thr 


Ala 


He 


Glu 


Asp 


Phe 


145 










150 










155 








160 


Val 


Leu 


Leu 


Phe 


Asp 


Asp 


Ser 


He 


Thr 


Gly 


Lys 


Pro 


Thr 


Pro 


Leu 


Val 










165 










170 










175 




Trp 


Arg 


Arg 


Arg 


lie 


Val 


Asp 


Leu 


Pro 


Glu 


Gly 


Thr 


Lys 


Tyr 


lie 


Ala 
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180 

Trp Arq His Tyr 
195 

Asp Asp Val Thr 
5 210 

Asp Phe Thr Val 
225 

Asn Tyr Pro Glu 

10 Gin Leu Ser Gly 

260 

lie Lys Asp Val 
275 

Arq Asp Asn Pro 
15 290 

Glu Ser lie Glu 
305 

Asp Ala lie Leu 

20 Trp Leu Val He 
340 

Leu Asn Ala Tyr 
355 

Leu Ser Ala Ser 
25 370 

Tyr Leu He Thr 
385 

Val Ser Thr Gin 

30 Ala Ser Thr Thr 
420 

Glu Thr Met Thr 
435 

Asn Leu Pro Glu 
35 450 

Thr Acp lie Tyr 
465 

Pro Ala Ser Glu 

40 Glu Asn Asn Lys 
500 

Pro Asp Lys Thr 
515 

He Tyr Ala Asn 
45 530 

Leu Glu Tyr lie 
545 

Val Glu Tyr Cys 

50 Ser Val Cys Asp 
580 

Tyr Glu Gly Phe 
595 

Asp Ala Asp Gly 
55 610 

Tyr Gly His Asp 
625 

Met He Gly Val 

60 Glu Gly Ala Lys 

660 

Tyr Ser Ala Glu 
67 5 

Val Glu Asp Phe 
65 690 

Asn Gly Ala Trp 
705 

Tyr He Ala Trp 

70 Leu Asp Asp He 
740 

Val The Asp Phe 

755 

Lys Trp Asn Tyr 
75 770 



185 



Lys 


Val 


Thr 




Ser 


His 














Val 


Tyr 






He 








215 








T 1 =i 

tie 


Asn 


He 


Gl y 


Gin 


As 1 > 




230 








Asp 


Tyr 


Gin 


Pro 


Glu 


Gly 


245 










250 


Tyr 


Asn 


He 


Tyr 


Ala 


Asn 










265 




Ser 


He 


Leu 


Gl u 


Tyr 


Val 








2B0 






Leu 


Gin 


Val 


Glu 


Tyr 


Cys 






295 








Ser 


Ser 


Thr 


Val 


Cys 


Gly 




310 










Tyr 


Glu 


Asn 


Phe 


Glu 


Asn 


325 










330 


Asp 


Ala 


Asp 


Gly 


Asp 


Gly 










3-15 




Asp 


Ala 


Phe 


Pro 


Gly 


Mis 








360 






Tyr 


Val 


Pro 


Gly 


He 


Gly 






37 5 








Pro 


Lys 


Val 


Glu 


Gly 


Ala 




390 










Asp 


Ala 


Asn 


Trp 


Ala 


Ala 


4 05 










410 


Gly 






Val 


Gly 


Asp 










425 




Ala 


Lys 


Pro 


Thr 


Gly 


Ala 








44,0 






Gly 


Thr 


Lys 


Tyr 


He 


Ala 






455 








Phe 


Leu 


Lys 


Leu 


Asp 


Asp 




470 










Pro 


Glu 


Pro 


Val 


Thr 


Asp 


485 










490 


Gly 


Arg 


Leu 


Lys 


Trp 


Asn 










505 




Asp 


Asp 


Lys 


Asp 


Pro 


Leu 








520 






Gly 


Ser 


Leu 


Leu 


Val 


His 






535 








Asp 


Glu 


Thr 


Tyr 


Ser 


Ser 




Th° 










Val 




Al a 


Val 


Tyr 




565 










570 


Lys 


Leu 


He 


Tyr 


Asp 


Ser 










585 




Glu 


Ala 


Gly 


Ser 


He 


Pro 








600 






Asp 


Asn 


Val 


Asn 


Trp 


Asp 






615 








Ser 


Glu 


Lys 


Cys 


He 


Ala 




630 










Leu 


Thr 


Pro 


Asp 


Asn 


Tyr 


645 










650 


Leu 


Val 


Lys 


Tyr 


Trp 


Val 










665 




His 


Tyr 


Ala 


Val 


Met 


Val 








6B0 






Val 


Leu 


Leu 


Phe 


Glu 


Glu 






695 








Tyr 


Glu 


Arg 


Thr 


He 


Thr 




710 










Arg 


His 


Tyr 


Asp 


Cys 


Thr 


725 










730 


Thr 


Val 


Tyr 


Arg 


Ser 


Thr 










745 




Val 


Val 


Ser 


Leu 


He 


Glu 








760 






Pro 


Asn 


Gly 


Tyr 


Glu 


Pro 



775 



267/ 490 



190 



Thr 


Gl u 


Phe 


Leu 


Lys 


Leu 






205 








Gly 




Glu 


Pro 


Ala 


Thr 




220 










Val 


Gly 


Arg 


Leu 


Thr 


Trp 


235 










240 


Lys 


Gly 


Asn 


Glu 


Glu 


Leu 










255 




tsly 


Thr 


Leu 


Leu 


Ala 


Gin 








270 






Asp 


Ser 


Thr 


Tyr 


Ser 


Leu 






285 








Val 


Thr 


Ala 


Val 


Tyi 


Asp 




300 










Thr 


Leu 


His 


Tyr 


Ala 


Thr 


315 










320 


Gly 


Pro 


Val 


Pro Asn Gly 










335 




Phe 


Ser 


Trp Gly His 


Tyr 








350 






Asn 


Gly 


Gly 


His 


Cys 


Ser 






365 








Pro 


Val 


Thr 


Pro 


Asp 


Asn 




380 










Lys 


Arg 


Val 


Lys 


Tyr 


Trp 


395 










4 00 


Glu 


His 


Tyr 


Ala 


Val 


Met 










415 




Phe 


Val 


He 


Leu 


Phe 


Glu 








430 






Trp 


Tyr 


Glu 


Arg 


Thr 


lie 






445 








Trp 


Arg 


His 


Tyr 


Asn 


Cys 




460 










He 


Thr 


Val 


Phe 


Gly 


Thr 


47 5 










480 


Phe 


Val 


Val 


Ser 


Leu 


He 










4 95 




Tyr 


Pro 


Asn 


Gly Tyr 


Glu 








510 






Gin 


Leu 


Ala 


Gly Tyr 


Asn 






525 








He 


Gin 


Asp 


Pro 


Thr 


Val 




54 0 










Arg 


Asp 


Asp 


Gin 


Val 


Glu 


555 










560 


Asp 


Asn 


He 


Glu 


Ser 


Gin 










57 5 




Gin 


Ser 


Asp 


He 


He 


Leu 








590 






Glu 


Gly 


Trp 


Leu 


Leu 


He 






€05 








Tyr 


Tyr 


Pro 


Trp 


Thr 


Met 




620 










Ser 


Pro 


Ser 


Tyr 


Leu 


Pro 


635 










640 


Leu 


Val 


Thr 


Pro 


Arg 


Leu 










655 




Ser 


Ala 


Gin 


Asp 


Ala 


Val 








67 0 






Ser 


Thr 


Thr 


Gly 


Thr 


Ala 






605 








Thr 


Met 


Thr 


Ala 


Lys 


Ala 




700 










Leu 


Pro 


Ala 


Gly 


Thr 


Lys 


715 










720 


Asp 


Met 


Phe 


Phe 


Leu 


Leu 










735 




Glu 


Thr 


Val 


Pro 


Glu 


Pro 








7S0 






Asn 


Asn 


Lys Gly Arg 


Leu 






765 








Asp 


Lys 


Thr 


Asp 


Asp 


Lys 




780 
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Lys 


Pro 


Leu 


Gin 


Leu 


Thr 


Gly 


Tyr 


Asn 


He 


Tyr 


Ala 


Asn 


Gly 


Ser 


Leu 


785 










790 










795 








800 


Leu 


Val 


His 


He 


Gin 


Asp 


Pro 


Thr 


Val 


Leu 


Glu 


Tyr 


He 


Asp 


Glu 


Thr 










605 










810 








815 




Tyr 


Ser 


Ser 


Arg 

820 


Asp 


Gly 


Gin 


Val 


Glu 
825 


Met 


Glu 


Tyr 


Cys 


Val 
830 


Thr 


Ala 


Val 


Tyr 


Asn 


Asp Asn 


He 


Glu 


Ser 


Gin 


Ser 


Val 


Cys 


Asp 


Lys 


Leu 


Asn 






835 










B40 










845 








Tyr 


Thr 

aso 


lie 


Thr 


Ser 


IrOU 


Asp 

855 


Asn 


He 


Gin 


Ser 


Acp 
860 


Thr 


Ser 


Leu 


Lys 


lie 


Tyr 


Pro 


Asn 


Pro 


Ala 


Ser 


Tyr Val 


Val 


Arg 


He Glu Gly 


Leu 


Ser 


865 










870 










875 










880 


Arg 


Ser 


Lys 


Ser 


Thr 
885 


He 


Glu 


Leu 


Tyr 


Asn 
B90 


Ala 


Leu 


Gly 


lie 


cys 
895 


He 


Leu 


Arg 


GlU 


Glu 
900 


The 


His 


Ser 


Glu 


Lys 
905 


Thr 


Glu 


He 


Asp 


Val 
910 


Ser 


Arg 


Leu 


Asn 


Asp 


Gly Val 


Tyr 


Leu 


He 


Lys 


Val 


Val 


Gly Gly Asn 


Lye 


Thr 






915 










320 










925 








The 


Thr 
930 


Glu 


Lys 


Val 


Glu 


11 3 

935 


Lys 


Arg 


Pro 















(2 1 IliroHIlATIOM FOR SEQ ID NO: 34 0 

(i! SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 606 amino acids 
(Q I TYPE; amino acid 
(D| TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL : YES 

tvii ORIGINAL SOURCE: 

(A) ORGANISM; Porphyromonss gingivalls 

(ix) FEATURE: 

(A) NAME/ KEY: (aisc_f eature 
( 8 ) LOCATION 1 . . . 606 



(xij SEQUENCE DESCRIPTION: SEQ ID NO: 34 0 



He 


He 


Phe 


Cys 


Thr 


He 


His 


His 


Ser 


Glu 


Leu 


Glu 


He 


Met 


Asn 


Ser 


1 








5 










10 










15 




He 


I1et 


Lys 


Tyr 


Gin 


Leu 


Tyr 


Thr 


Ala 


Val 


He 


Met 


Ala 


Leu 


Ser 


Val 








20 










25 










30 






Ser 


Ser 


Val 


Cys 


Gly 


Gin 


Thr 


Pro 


Arg 


Asn 


Thr 


GlU 


Thr 


Lys 


Arg 


Pro 






35 










40 










45 








Asp Thr 


Leu 


Arg 


Arg 


Glu 


Leu 


Thr 


He 


Val 


Asn 


Asp 


Gin 


Thr 


Val 


Glu 




50 










55 










60 










Met 


Glu 


His 


Ala 


Asp 


Pro 


Leu 


Pro 


Ala 


Ala 


Tyr 


Lys 


Ala 


He 


Glu 


Pro 


65 










70 










75 








80 


Arg 


Leu 


Lys 


Pro 


Phe 


Arg 


Pro 


Glu 


Tyr 


Asn 


Lys 


Arg Thr 


Phe 


Gly 


Phe 










85 










90 










95 




Val 


Pro 


Glu 


Val 


Ser 


Ser 


Ser 


Gly 


Arg 


Asn 


Asn 


Leu 


Pro 


Asn 


He 


Leu 








100 










105 










110 






Pro 


Thr 


Glu 


Gly 


His 


Met 


Lys 


His 


Arg 


Gly 


Tyr 


Leu 


Asn 


lit 


Gly He 






115 










120 










125 








Gly 


His 


Thr 


Lou 


Asn 


Gin 


Arg 


HeL 


Asp 


Ala 


Gly 


Tyr 


Arg 


Leu 


He 


Asp 




130 










135 








140 








Ala 


Glu 


Gin 


Glu 


Arg 


Leu 


Asn 


Leu 


Phe 


Leu 


Ser 


Tyr 


Arg 


Gly Met 


Lys 


145 










150 










155 










160 


Ser 


Ala 


Phe 


Asn 


Thr 


Gly 


Asp 


Phe 


Asp 


Gly 


Asp 


Arg 


Lys 


Asp 


Arg 


Axg 










165 










1V0 










175 




Met 


Met 


Ala 


Gly 


Val 


Asp 


Tyr 


Glu 


Gin 


Arg 


Arg 


Pro 


Ser 


Phe 


Val 


Leu 








180 










185 










190 






Ala 


Thr 


Gly 


Leu 


Tyr 


Tyr 


Ser 


Asn 


His 


Tyr 


Bh» 


-Aen 


-Asn 


Tyr 


Gly 


Arg 






195 










200 










205 




Gly Ala 


Thr 


Thr 


Asn 


Val 


Gly 


Ser 


He 


Pro 


Gin 


Leu 


Ser 


Thr 


Pro 


Val 




210 










215 










22Q. 










Thr 


Pro 


Gin 


Met 


Asp 


Asn 


Gly 


Thr 


His 


Asn 


Val 


Arg 


Val 


Tyr 


Leu Gly 


225 










230 










235 








2-10 


Ala 


Lys 


Asn 


Asp 


Val 


He 


Asp 


Ala 


Arg 


lie 


Asp 


Tyr Arg 


Phe 


Phe 


Arg 










245 










250 










255 




Ser 


He 


Pro 


Tyr 


Leu 


Gly 


Thr 


Asp 


Pro 


Met 


Lys 


Ala 


Leu 


Thr 


Glu 


His 








260 










265 








270 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



Thr 


Pro 


Glu 


Leu 


Asn 


Va 1 


Thr 


Met 


Ser Asn 


Glu Leu 


Ser 


Asp 


Asp 


He 






27 t> 










200 






285 








Lys 


Levi 


Gly 


Val 


Glu 


Val 


Arg 


Thr 


Gly Gly 


Leu Phe 


Phe 


Ala 


Lys 


Asn 


290 










295 






300 










i'er 


Glu 


Het 


lie 


Gin 


Thr 


Gly 


Val 


Leu Ser 


Glu Thr 


Asp 


Arg 


Asn 


Leu 


305 










310 








315 








320 


Tyr 


Tyr 


Val 


Glu 


Gly 


Ala 


Pro 


Thr 


He Gly 


Phe Val 


Gly 


Asp 


Ser 


Asp 








325 








330 








335 




Asn 


Met 


Gin 


Trp 


Asn 


lie 


Gin 


Ala 


Gly Val 


Gly He 


Ser 


Ser 


His 


Phe 








340 










345 






350 






Gly 


Ala 


Lys 


Gly 


Arg 


Leu 


Phe 


Phe 


Trp Pro 


Lys Leu 


Asp 


Ala 


Ser 


Leu 






355 










360 






365 








Ser 


lie 


Phe 


Pro 


ser 


Trp 


Arg 


Val 


Tyr Ala 


Lys Ala 


Phe 


Gly Gly 


Val 




370 










37 5 






380 










He 


Arg 


Asn 


Gly 


Leu 


Ala 


Asp 


Val 


Met Gin 


Glu Glu 


Het 


Pro 


Tyr 


Leu 


385 










390 








395 








400 


Met 


Pro 


Asn 


Thr 


He 


Val 


Leu 


Pro 


Ser Arg 


Asn Ala 


Leu 


Thr 


Ala 


Gin 










405 








410 








415 




Leu 


Gly 


Val 


Lys 


Gly 


Asn 


He 


Ala 


Asp Val 


Val Arg 


Met 


Glu 


Val 


Tyr 








420 










425 






430 






Gly 


Asp 


Phe 


Ser 


Lys 


Leu 


Thr 


Gly 


Val Pro 


Phe Tyr 


Thr 


Pro 


Thr 


Leu 






435 










440 






445 








Pro 


Leu 


Tyr 


Asn 


Pro 


Ser 


Asp 


Leu 


Tyr Gin 


Tyr Asn 


Val 


Ser 


Phe 


Leu 




4 50 










4 55 






460 










Pro 


lie 


Tyr 


Ala 


Asp 


Gly 


Ser 


Arg 


Trp Arg 


Ala Gly Gly 


Lys 


I.eu 


Glu 












4-70 








475 








480 


Tyr 


Ser 


Tyr 


Arg 


Asp 


Hot 


L-u 


Arg 


Phe Leu 


Val Asp 


Ala 


Ser 


Tyr 


Gly 










485 








490 








4 95 




Lys 


Tip 


Asn 


Leu 


Asp 


Gly 


Gly 


Leu 


Val Ala 


Ser Met 


Gin 


Pro 




Leu 








500 










505 






510 






lis 


Leu 


Lys 


Ala 


Glu 


Val 


Gly 


Val 


His Pro 


lis Ala 


Pro 


Leu 


Asp 


Val 






515 










520 






525 








Arg 


Leu 


Arg 


Tyr 


Thr 


Gin 


Leu 


Aen 


Gly Arg 


Tyr Arg 


Tyr 


Ser 


Phe 


Gly 




530 










535 






540 










Ser 


Ala 


Gly 


Ser 


Glu 


Ala 


Leu 


Gly 


He Gly 


Aen Val 


His 


Leu 


Leu 


Ser 


545 










550 








555 








560 


Ala 


Asp 


Val 


Ser 


Tyr 


Lys 


Leu 


Lys 


Lys Asn 


Leu Ser 


Leu 


Tyr 


Leu 


Lys 










S65 








57 0 








575 




He 


Asp 


Asn 


Het 


Leu 


Ala 


Glu 


Thr 


Thr Glu 


Leu He 


Gly 


Tyr 


Tyr 


Pro 








580 










585 






590 






Met 


Gin 


Pro 


Phe 


His 


Cys 


Phe 


Ala 


Gly Phe 


Ser Trp 


Thr 


Phe 










595 










600 






605 









(2) INFORMATION FOR SEQ XD NO: 341 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 357 amino acids 
IB) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

tlx) FEATURE: 

(A) NAME/ KEY: mi.se feature 
60 IB) LOCATION 1...357 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341 

65 



70 



75 



Asn 


Ser 


His 


Ala 


Thr 


Val 


Ser 


Val 


He 


Cys 


Ser 


Met 


Met 


Glu 


Lys 


Cys 


1 








5 










10 










15 




He 


Phe 


Ala 


His 


Tyr 


Pro 


His 


Asn 


Leu 


Val 


Phe 


Met 


He 


Arg 


Lys 


His 








20 










25 










30 




Phe 


Gly 


He 


He 


Leu 


Gly 


Phe 


Leu 


Ser 


Leu 


Val 


Phe 


Ser 


Ala 


Gly 


Ala 






35 










40 










45 






Gin 


Gin 


Glu 


Lys 


Gin 


Val 


Phe 


His 


Phe 


Leu 


Asn 


Leu 


Pro 


Ala 


Thr 


Ala 




50 










55 










60 










Gin 


Ala 


Leu 


Ala 


Ala 


Gly Gly 


Lys 


Ala 


He 


Thr 


He 


Val 


Asp Asp 


Asn 


65 










70 










75 










80 


Pro 


Gly 


Leu 


Ala 


Phe 


Glu 


Asn 


Pro 


Ala 


Lou 


Leu Gly Tyr 


Glu 


Ser 


Gly 










85 










90 










95 
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10 



15 



20 



25 



30 



35 



55 



60 



65 



70 



75 



Gly Arg 


Ala 


Phe 


Leu 


Ser 


Tyr 


Leu 


Tyr 


Tyr 


Iter. 


Ser 


Gly 


ser 


His 


Met 








100 










105 










110 






Gly Asn 


Ala 


Cys 


Tyr 


Ala 


Ser 


Ser 


Val 


Gly 


Glu 


Arg 


Gly Met 


Trp 


Gly 






115 










120 










125 








Val 


Gly 


Met 


Arg 


Phe 


Leu 


Asn Tyr 


Gly 


Ser 


Met 


Gin 


Gly 


Tyr 


Asp 


Gin 




130 










135 








140 












Ala 


He 


Ala 


Thr 


Gly Ser 


Phe 


Ser 


Ala 


Ser 


Asp 


He 


Ala 


Val 


Gin 


1-15 










150 










155 










160 


Gly 


Phe 


Tyr 


Ser 


Hif* 


Glu 


Leu 


Ser 


Asn 


His 


Phe 


Arg 


Gly Gly Val 


Ser 










165 










170 










175 




L & u 


Lys 


Ala 


Leu 


Tyr 


Ser 


Ser 


He 


Glu 


Thr 


Tyr 


Ser 


Ser 


Phe Gly Leu 








180 










185 










190 






Gly 


Val 


Asp 


Val 


Gly lie 


Ser 


Tyr 


Tyr 


ASp 


Asp 


Asp 


Lys 


Gly 


Tyr 


Ser 






195 










200 










205 








Ala 


Ser 


Ala 


Leu 


Phe 


Lys 


Asn 


Val 


Gly 


Ala 


Gin 


Leu 


Lys 


Gly Tyr Asn 




210 










215 










220 








Glu 


Glu 


Arg 


GlU 


Pro 


Leu 


Asp Trp Asp 


Phe 


Gin 


Leu 


Gly 


Phe 


Ser 


Arg 


225 










230 










235 










240 


Ser 


Phe 


He 


A5n 


Ala 
245 


Pro 


Phe 


Arg 


Leu 


His 

250 


He 


Thr 


Leu 


Phe 


Asn 
255 


Leu 


Asn 


Pro 


His 


Tyr 
260 


Phe 


Lys 


Arg 


Leu 


Val 
265 


Pro 


Arg 


Asp 


Leu 


Ser 
270 


Lys 


Met 


Gin 


Lys 


Phe 


Leu 


Arg 


His 


Phe 


Ser 


He 


Gly Alo 


Glu 


Phe 


Thr 


Pro 


Ser 






275 










280 










285 








Glu 


Arg 


Phe 


Trp 


Val 


Gly 


Leu 


Gly Tyr Thr 


Pro 


Gin 


He 


Ala 


Gin 


Asp 




290 










295 










300 










Phe 


Glu 


Vol 


Glu 


Gly 


Gly 


Asn 


Lys 


Trp 


Gly 


Gly 


Leu 


Ser 


Ala 


Gly 


Val 


305 










310 










315 








320 


Gly 


Phe 


Thr 


Ser 


Gly 


Val 


Val 


Arg 


Val 


Gly Val 


Ser 


Ala 


Ala 


Thr 


Tyr 










325 










330 










335 




His 


Pro 


Ala 


Ala 
340 


Leu 


Ser 


Phe 


Met 


Cys 
345 


Ser 


Val 


Gly 


He 


Arg 
350 


Leu 


Asp 


Asp 


Lys 


Ser 
355 


He 


Phe 

























(2i INFORMATION FOR SEQ ID NO: 34 2 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH : 337 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 (Hi) HYPOTHETICAL: YES , 

(vx) ORIGINAL SOURCE: 

<A) ORGAlUSl-l: Porphyromonas gingivalis 

50 (ix) FEATURE: 

(A) MAME/KEY: misc_feature 
(D) LOCATION 1...337 



Ui) SEQUENCE DESCRIPTION; SEQ ID HO: 342 



Pro 


Gin 


Pro 


Val Gly Leu 


Lys 


Glu 


He 


Thr 


He 


Lys 


Pro Hot 


Cys 


Leu 


1 






5 










10 








15 




GlU 


Pro 


He 


He Ala 


Pro 


He 


Ser 


Ser 


Glu 


Leu 


Leu 


Glu Gin 


Glu 


Leu 








20 








25 








30 






Thr 


Ala 


Asp 


Arg Phe 


Leu 


Arg 


Met 


Thr 


Asn 


Lys 


Ala 


Gly Asn 


Glu 


He 






35 








40 








45 






Tyr Val 


Phe 


Thr Ala 


Glu 


Glu 


Ala 


Pro 


His 


Cys 


Met 


Lys Glu 


Val 


Gly 




SO 








55 










SO 




Arg 


Leu 


Arg 


Glu Glu 


Ala 


Phe 


Arg 


His 


Tyr 


Gly 


Gly 


Gly Thr 


Gly 


Lys 


65 








70 










75 








80 


Ala 


He 


Asp 


He Asp 


Glu 


Phe 


Asp 


Thr 


Met 


Pro 


Gly 


Ser Tyr 


Lys 


Gin 








65 










90 








9b 




l.ou 


He 


Val 


Trp Asp 


Pro 


Gin 


Asn 


Lys 


Ala 


lie 


Leu 


Gly Gly 


Tyr 


Ary 








100 








105 








110 






Phe 


He 


Tyr 


Gly Arg 


Asp 


Val 


Ala 


Phe 


Asp 


Thr 


Asp 


Gly Lys 


Pro 


Leu 






115 








120 










125 






Leu 


Ala 


Thr 


Ala Glu 


Met 


Phe 


Arg 


Phe 


Sec 


Asp 


Ala 


Phe Leu 


His Asp 




130 








135 








140 








Tyr 


Leu 


Pro 


Tyr Thr 


Val 


Glu 


Leu 


Gly 


Arg 


Ser 


Phe 


Val Ser 


Leu 


Gin 


145 








150 








155 








160 
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Tyr 


Gin 


Ser 


Thr 


Arg 


Met 


Gly Thr 


Lys 


Ala 


He 


Phe 


Val 


Leu 


Asp 


Asn 










165 










170 










175 




Leu 


Trp 


Asp 


Gly 


Tie 


Gly Ala 


Leu 


Thr 


Val 


Val 


Asn 


Pro 


Glu 


Ala 


Leu 








180 










185 










190 






Tyr 


Phe 


Tyr 
195 


Gly 


Lys 


Val 


Thr 


Met 
200 


Tyr 


Lys 


Asp 


Tyr 


Asp 
205 


Arg 


Arg 


Ala 


Arg 


Asn 


Leu 


Tie 


Leu 


Tyr 


Phe 


Leu Arg 


Lys 


His 


Phe 


Ser Asp 


Pro 


Glu 




210 










215 










220 










Gly 


Leu 


Val 


Lys 


Pro 


He 


His 


Pro 


Leu 


Pro 


He 


Glu 


He 


Ser 


Ala 


Glu 


225 










230 










235 










240 


Asp 


Glu 


Alo 


Leu 


Phe 


Ser 


Ser 


Ser 


Asp 


Phe 


Asp Thr Asn Tyr 


Lys 


Thr 










245 










250 










255 




Leu 


Asn 


lie 


Glu 


Val 


Arg 


Lys 


Leu 


Gly He 


Asn 


He 


Pro 


Pro 


Leu 


Val 








260 










265 










270 






Ser 


Ala 


Tyr 


Tie 


Ala 


Leu 


Ser 


Pro 


Glu 


Met 


Arg 


Val 


Phe 


Gly 


Thr 


Ala 






275 










280 








285 








Val 


Asn 
290 


Glu 


Ser 


Phe 


Gly 


Glu 
295 


Val 


Glu 


Glu 


Thr 


Gly 
300 


He 


Phe 


Tie 


Ala 


Val 


Gly 


Lys 


He 


Leu 


Glu 


Glu 


Lys 


Lys 


Gin 


Arg 


His 


He 


Glu 


Ser 


Phe 


305 










310 










315 










320 


lie 


Leu 


Ser 


Arg 


Asn 


Glu 




Lys 


Gly Leu Asp 


Ser 


Ser 


Asn 


Gly 


Arg 










325 










330 










335 




Ser 

































(2/ IN FORI IATION FOR SEQ ID HO; 343 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 amino acids 

( B) TYPfcl: amino acid 
<D> TOPOLOGY: linear 

(ill MOLECULE TYPE; protein 

Uii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE; 

(A] NAME/ KEY : miscfeature 

(B) LOCATION 1. . .566 



Cxi) SEQUENCE DESCRIPT ION: SEQ ID NO: 34 3 



Tyr 


Asp 


Gly Ala 


Arg 


Leu 


Val 


Tyr 


Thr 


Leu 


Phe 


Arg 


Asn Arg Asn Asp 


1 






b 










10 








15 


He 


His 


Pro Met 


Lys 


Thr 


He 


Val 


Arg 


Tyr 


Ser 


Arq 


Leu Pro 


Val Ala 






20 










25 








30 




Leu 


Phe 


Phe Cys 


Leu 


Leu 


Gly 


Ala 


Val 


His 


Leu 


Ser 


Val Glu 


Ala Gin 






35 








40 










45 




Met 


Leu 


Asn Thr 


Pro 


Phe 


Glu 


Leu 


Ser 


Asp 


Gin 


He 


Val Leu 


Ser Pro 




50 








55 








60 






Thr 


Glu 


Arg Gin 


Tyr 


Arg 


Glu 


He 


Cys 


Val 


Gin 


Thr 


Lys Glu 


Ly3 Arq 


65 








70 










75 






B0 


Gly Ala 


Asp Leu 


Phe 


Pro 


Leu 


Ser 


Asp 


Lye 


Leu 


Arg 


Asp Ser 


Al a Tyr 








85 










90 








95 


Val 


Arg 


Phe Gly 


Ser 


Ala 


Tyr 


Gly 


Asp 


He 


Ala 


Gly 


Asp Tyr 


Leu Pro 






100 










105 








110 




Tyr 


Asn 


Gly Asn Asn 


Tyr 


Ser 


Ser 


Leu 


Ser 


Leu 


Glu 


Ser Gly Gly Arg 






115 








120 










125 




He 


Ser 


Val Arg 


Asn 


Tyr 


Gly 


Thr 


Leu 


Gin 


Gly 


Ser 


Ala Ser 


Tyr Ser 




130 








135 










140 






Arg 


Gly 


Met His 


Lys 


Arg 


He 


Gly 


Trp 


Asn 


Ala 


Leu 


Arg Asn 


Ala Glu 


145 








150 










155 






160 


Ala 


Tyr 


Tyr Pro 


Tyr 


Leu 


Val 


Ser 


Asp 


Sen 


Thr 


Gly 


Gly Asp Tyr His 








165 










170 








175 


Phe 


Glu 


Asp Tyr 


Arg 


Leu 


Ala 


Gly 


Tyr 


Tyr 


Ser 


Phe 


Arg Ala 


Gly Arg 






180 










185 








190 




Leu 


Pro 


Leu Gly 


lie 


Gly 


Phe 


Ser 


Tyr 


Axg 


Gly 


Glu 


Val Ala 


Tyr Arg 






195 








200 










205 




Leu 


Thr 


Asp Pro 


Arg 


Thr 


Thr 


Asn 


Thr 


Thr 


Gly 


Ala 


Leu Glu 


Leu Ser 




210 








215 










220 






Cys 


Ala 


Thr Ser 


Leu 


Thr 


Leu 


Pro 


Arg 


Glu 


A.-*n 


Arg 


Leu Ser 


Leu Ser 


22S 








230 










235 






240 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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272/490 

Ala Ala ryr Leu Tyr Hi. Arg Gin Hi, Leu Thr Gin Tyr Asn Trp Ar, 
Pro Gly Gl„ Gin A ., p Ly , phs phe ^ »0 ^ ^ ^ ^ 255 ^ 

5 Asp Val ser Asn Ser Pro Ile T Phe Gly He Ser flrg ™ Asn Tyt 

280 00 1 

Val Asn Gly Trp Lys Leu Ser Ser Arg Leu Asp Thr Ar, Arg Gly Asp 

10 3oS "* ASP l{ r 0 SBC aly Phe L «» Thr Glu Olu Arg 

Ser Ser lie ten Leu Phe M , Leu Leu Tyr ^ ^ ^ ^ 320 

Gly Ser Trp His Leu Scr Asp Phe Asp Phe Ser Phe Ser Ala A^p Tyr 
15 A1 a Leu Arg Gin Gly Ile Glu Arg Ire Tyr Glu Asp Tyr Lys Pro Asp 

Asp Asn Tyr His He Tyr Asp Leu Arg lie Leu Ala Ue Arg Arg Trp 
J / o 380 
20 385 ° 1U T 9 l Ala G1 " G1 " ^a Ser Tyr Arg Ile 

Arg Thr Asp Arg Gly Cys Ala Leu Arg Val Ser Ala Gly Ser Asp Phe 

Tyr Gly Tyr Asp Glu Thr Tyr Arg Lys «ts Gly His hi. Thr Met Ser 
25 Gly Met Leu Arg Pro P he Ala a, rle Ala Tyr Asp His Ma Gly Ser 

Lys Leu Asp Phe Gly Leu Ser Leu Ser Ala Ala Tyr Arg Het Val Lc. 

455 460 
30 «s " i= S " ^ "0 " e G1 " Ly£ Glu Gln L =" Asp Tyr 

01.. Leu A: a Tyr Leu Pro Tyr Ala Tyr Arg Asn Arg Glu Gly Val Gru 

490 . _ — 

val Arg Ser Ser Leu Tyr Val SEr Pro Met Gln As „ Thr «» Arg 

35 Leu Met Thr Glu Leu Arg Leu Tyr Gly Asp Leu Met Lys Irg Lys Asp 

Gly lie Ala Tyr Gly Lys Thr Pro Gly V.1 Ile Ser 2ff He Leu Ser 

535 540 
_ Thr Ser Gly His Thr 
550 555 
Phe 

565 

(2) INFORMATION FOR SEQ ID HO: 34 4 



40 



45 



50 



540 

545 P " ^ G1U Th ' «y Hi. Thr He Gly Ala lie Cys 

Asn Ile s er Tyr Leu Phe 555 5&0 



(1) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 819 amino acids 

(B) TYPE: amino acid 
iD) TOPOLOGY: linear 



(11) MOLECULE TYPE: protein 
(ill) HYPOTHETICAL: YES 

55 (vl) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivals 
tlx) FEATURE: 
Rn tA) NAME/ KEY : misc feature 

m tB) LOCATION 1...8I9 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO:344 

65 Arg Thr A,n Val Phe Leu Ser Leu Ser His Lys H e Gly ^ Arg Gly 
Ala Ser Cys Ser Asn Arg Asn Ala Trp Met Ala Glu A-n Lys Pro Ser 
Ser Pro Glu Jro Asp Asn Thr Gly Val Gly Asn Ser Pro S~r Asp Tyr 

70 Leu „j, G1 y fllu AIa Ile T1 p ° o pro ^ u ^-« u ^ 



75 



Asn Asp Lys Arg Phe Met Lys Lys Leu His Me, He Ala Ala Leu Ala 

Val Leu Pro Phe Cys Leu Thr Ala Gln Ala Pro Val Ser Asn Ser Glu 

90 
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He 




Ser 


Leu 


Ser 


Acn 


Val 


i» n 




















Arg 


A3 a 


Thr 


Ala 


Lys 


Thr 








115 










120 


Ala 


Glu 


LeU 


Ser 


Lys 


Scr 


Asn 


Tyr 




130 










135 




MeL 


Leu 


Thr 


Pro 


Ser 


Val 


Val 


Ala 


14 5 










150 






GJ y 


Tyr 


er 


Gly 


Phe 


Arg 


Val 


Arg 










165 








He 


THc 


Thr 


Ann 


Gly Val 


Pro 


Leu 








180 










Phe 


Trp 


Val 


Asn 


Met 


Pro 


Asp 


Phe 






1 95 










200 


Val 


Gin 


Arg 


Gly Val 


Gly 


Thr 


Ser 




21 0 










215 




Ala 


Ser 




Asa 


Mel. 


Arg 


Thr 


Asp 


22 5 










230 






Arg 


Val 


ASp 


Leu 


Ser 


Gly Gly 


Ser 










245 








Lys 


Leu 


Gly 


Ser Gly Arg 


He 


Gly 








260 










Leu 


Ser 




He 


Gly 


ser 


Asp 


Gly 






2 7 j 










280 




Lys 


Ser 


Tyr 


Phe 


Ala 


Gin 


Val 




290 










295 




L-u 


Axy 


Phe 


Tie 


Thr 


Phe 


Gly Gly 


305 










310 






Asn 


Gly 


Leu 


Ser 


Lys 


Glu 


Asp 


GlU 










325 








Ser 


Ala 


ill \# 


Leu 


Met 


Tyr 


Val 


Asp 








340 










His 


Asn 


Thr 


Asp Asn 


Tyr 


Glu 


Gin 
















360 


Hi s 


Scr 


Phe 


Ser 


Pro 


Ser 


Val 


He 




370 










375 




Ala 


Gly 


Tyr 


Gly Tyr Thr Asp Glu 


385 










390 






Glu 


Tyr 


3 


Leu 


Gin 


Pro 


Tyr Val 










405 








Thr 


As 


eu 


lis 


Arg 


Gin 


Lys 


Tyr 








420 










He 


Gly 


Ser 


Leu 


Asn 


Trp 


His 


Thr 






4 35 










440 


Ala 




toll 


Asn 


He 


Tyr 


Lys 


Gly 




4 50 










455 




lie 


Lys 




Tyr 


Asn 


Gin 


Pro 


Leu 


4 65 










470 








Arg 


Ala 


Asp 


Lys 


Arg 


Glu 


Gly 










485 








Gin 


He 


Thr 


Pro 


Glu 


Leu 


Asn 


Met 








500 










He 


Gly 


Tyr 


Thr 


lie 


Asn 


Gly 


He 






515 










520 


Gly 


Ser 


Met 


Gin 


His 


He 


Asp 


Leu 




530 










535 




Pro 


Lys 


Ala 


Gly Leu Thr Tyr 


Ser 


545 










550 






Ala 


Ser 


Val 


Ala 


Val 


Ala 


His Arg 










565 








Glu 


Ala 


Gly 


He 


Gly Gin 


Tyr 


Pro 








580 










Glu 


Leu 


Gly 


Tyx 


Arg 


Tyr 


Ala 


Ser 






595 










600 


Leu 


Tyr 


Tyr 


Met 


Gin 


Tyr 


Lys 


Asp 




610 










615 




Scr 


Asp 


Val 


Gly Gin 


Met 


Leu 


Thr 


625 










630 






Il*t 


Gly 


Leu 


Glu 


Leu 


Thr 


Leu 


Gly 










645 








Arg 


Trp 


Asp 


Ala 


Ser 


Phe 


Thr 


Met 








660 










Val 


Gin 


Tyr 


Thr 


Ser 


Val 


Tyr 


Asp 






67 5 










660 


Lys 


Glu 


Glu 


Thr 


Leu 


Glu 


Ser 


Thr 



273/490 



Leu 


Gin 


Thr 


Val 


Gin 


Val 


Val 


Ala 


105 










110 






Val 


Ala 


T.yr 


Thr 


Asn 


Val 


Arg 


1 \j r- 

L.y^ 










125 








Gly 


Arg 


sp 


He 


Pro 


Tyr 


Leu 


eu 








140 










Thr 


Ser 


Asp 


Ala 


Gly Thr 


Gly 


He 






1 55 










160 


Gly Thr 


Atp 


Ala 


Asn 


Arg 


He 


Asn 




170 










175 




Asn 


Asp 


Ser 


Glu 


Ser 


Gin 


Ser 


Val 


185 










190 






Ala 


Ser 


S*"*r 


He 


Glu 


Asp 


Leu 












205 








Thr 


Asn 


Csi. y 


Ala 


Gly 


Ala 


Phe 


vi a y 








220 










Asn 


Leu 




Leu 


Ala 


Pro 


Tyr 


Gly 






235 












Phe 


Gly 


Thr 




Arg 


Arg 


Ser 


Va 1 




250 










255 




Arg 


His 


Trp 


Ala 


Val 


Asp 


Ala 


Arg 


265 










270 






Tyr 


Val 


Asp 


Arg 


Gly Ser Val 


Asp 










285 








Gly 


Tyr 


Phe 


Gly 


Ser 


Asn 


Thr 


Ala 








300 










Lys 


Glu 


Val 


Thr 


Gly 


He 


Ala 


Trp 






315 










320 


Ala 


Lys 


Tyr 


Gly 


Arg 


Arg 


Tyr 


Asn 




330 










335 




Ala 


Gin 


y 


Val 


Pro 


Hie 


Tyr 


Tyr 


345 










350 






Arg 


His 


Tyr 


His 


Ala 


He 


Met 


Thr 










365 








Leu 


Asn 


Leu 


Thr 


Ala 


His 


Tyr 


Thr 








380 










Tyr 


Arg 




Gly 


Arg 


Lys 


Leu 








395 










don 


Glu 


Asn 


Ser 


Val 


Thr 


Val 


Lys 


ys 




410 










415 




Leu 


Asp 


n 


p 


Phe, Gly Gly 


eu 


425 










430 






Gly Ala 


*■ r P 




Leu 


Gin 


Phe 












445 








Asp 


His 


Phe 


Gly 


Arg 


He 


Thr 


i yr 








4 60 










Ala 


Pro 


A«=-p 


Phe 


Glu 


Tyr 


Tyr 


Arg 






475 










4 80 


Ala 


Ala 


Phe 


/via 


Lys 


Ala 


Asn 


Trn 




4 90 










495 




Tyr 


Ala 


Asp 


Leu 


Gin 


Tyr Arg 


Thr 


505 










510 






Thr 


Asp 


Glu 


Tyr 


Asp Glu 


Val 


Gin 










525 








Asp 


Lys 


Thr 


Phe 


Arg 


Phe 


Leu 


Asn 








540 










Phe 


Asp 


Asp 


Ala 


His 


Thr 


Ala 


Tyr 






555 










560 


Glu 


Pro 


Asn 


Arg 


Thr 


Asn 


Tyr 


Thr 




570 










575 




Thr 


Pro 


Glu 


Arg 


Leu 


He 


Asp 


Tyr 


585 










590 






Pro 


Leu 


Leu 


Ser 


Ala 


Gly 


Val 


Giy 










605 








Gin 


Leu 


Val 


Leu 


Asp Gly Arg 


Leu 








620 










Sec 


Asn 


Val 


Pro 


Asp 


Ser 


Tyr 


Arg 






635 










640 


Trp 


Gin 


He 


Leu 


Pro 


Arg 


Leu 


Leu 




650 










655 




Ser 


Arg Asn 


Lys 


lie 


Asp 


Arg 


Tyr 


665 










67 0 






Ala 


Asp Tyc 


Asn 


Trp Leu Glu 


Leu 










685 








Asp 


He 


Ala 


Tyr 


Ser 


Pro 


Asn 


Vol 
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690 








695 




He 


Ala 


Gly Ser 


Met 


Leu 


Thr 


Leu 


"705 








710 






Trp 


Thr 


Ser Arg 


Phe 


Val 


Ser 


Lys 








725 








Ser 


Asp 


Arg Met 


Leu 


Ser 


Ser 


Tyr 






740 










Tyr 


Val 


Leu Pro 


Val 


Ills 


Phe 


Val 




755 








760 


Leu 


Asn 


Asn Leu 


Phe 


Asn 


Leu 


Met 




770 








775 




Asp 


Ala 


Gly Tyr Val 


Gin 


Ala 


Ser 


785 








7 90 






Leu 


Arg 


Tyr Tyr 


Pro 


Gin 


Ala 


Gly 








005 








He 


Asp 


Phe 

















700 










Ser 


His 


Ala 


Gly 


Phe 


Glu 


Met 


Ala 






715 










720 


Gin 


Tyr 


Leu 


Asp 


Asn 


Thr 


Gin 


Arg 




730 










735 




Trp 


Val 


Asn 


Asp 


Leu 


Arg 


Leu 


Gly 


745 










750 






Lys 


Axg 


Val 


Ala 


Leu 


Gly Val 


Gin 










765 








Tyr 


Ala 


Ser 


Asn 


Ala 


Tyr 


lie 


Tyr 








780 










Gly 


Glu 


Leu 


Ser 


Ala 


Tyr 


Ala 


Asp 






7 95 










BOO 


Phe 


Asn 


Ala 


Leu 


Gly Scr 


Leu 


Thr 




810 










015 





(2) INFORMATION FOR SEQ ID WO: 34 5 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 532 amino acids 
IB) TYPE: amino acid 
(Dt TOPOLOGY: linear 

{ii J MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(AJ ORGANISM: Porphyromonas gingivalis 

(lx) FEATURE: 

(A) NAME/KF.Y: misc feature 

(B) LOCATION 1. . .532 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 34 5 



Tyr 


Arg 


Arg 


Ser 


Gly 


Arg 


Val 


Cys 


Pro 


Arg 


He 


Leu 


Arg 


Asn 


Lys 


Arg 


1 








5 










10 










15 




Ser 


Tyr 


Ala 


Asp 


Thr 


Arg 


Leu 


Gin 


Pro 


Arg 


Arg 


Ala 


Val 


Arg 


Leu 


Arg 








20 










25 










30 






Ala 


Gly 


Thr 


Arg 


Thr 


Lys 


Met 


Lys 


Arq 


Arg 


Phe 


Leu 


Ser 


Leu 


Leu 


Leu 






35 










40 










45 








Leu 


Tyr 


lie 


Leu 


Ser 


Ser 


He 


Ser 


Leu 


Ser 


Ala 


Gin 


Arg 


Phe 


Pro 


Met 




SO 










55 










60 










Val 


Gin 


Gly 


lie 


Glu 


Leu 


Asp 


Thr 


Asp 


Ser 


Leu 


Phe 


Ser 


Leu 


Pro 


Lys 


65 










70 










75 










80 


Arg 


Pro 


Trp 


Arg 


Ala 


He 


Gly 


Lys 


Thr 


lie 


Gly 


Val 


Asn 


Leu 


Ala 


Val 










85 










SO 










95 




Trp Gly 


Phe 


Asp 


His 


Phe 


He 


Met 


Asn 


Glu 


Asp 


Phe 


Ala 


Asp 


He 


Ser 








100 










105 










110 






Trp 


Gin 


Thr 


lie 


Lys 


Ser 


Asn 


Phe 


Gin 


Thr 


Gly 


Phe 


Gly Trp Asp Asn 






115 










120 










125 








Asp 


Lys 


Phe 


Val 


Thr 


Asn 


Leu 


Phe 


Ala 


His 


Pro 


Tyr 


His 


Gly Ser 


Leu 




130 










135 










140 










Tyr 


Phe 


Asn 


Ala 


Ala 


Axg 


Ser 


Asn 


Gly 


Leu 


Ser 


Phe 


Arg 


His 


Ser 


Ala 


145 










150 










155 










160 


Pro 


Phe 


Ala 


Phe 


Phe 


Gly 


Ser 


Leu 


Met 


Trp 


Glu 


T.eu 


Leu 


Met 


Glu 


Asn 










165 










170 










175 




Glu 


Pro 


Pro 


Ser 


lie 


Asn 


Asp 


Leu 


Cys 


Ala 


Thr 


Thr 


He 


Gly Gly He 








180 










185 










190 






Ala 


Leu 


Gly 


Glu 


Met 


Gly 


His 


Arg 


Leu 


Ser 


Asp 


Leu 


Leu 


He 


Asp Asn 






195 










200 










205 








Arg 


Thr 


Thr 


Gly 


Trp 


Glu 


Arg 


Met 


Gly 


Arg 


Glu 


Val 


Ala 


lis 


Ala 


Leu 




210 










215 










220 










lie 


Asn 


Pro 


Met 


Arg 


Phe 


Leu 


Asn 


Acg 


Leu 


Thr 


Ala 


Gly Glu 


Val 


Thr 


225 










230 










235 










240 


Ser 


Val 


Gly 


Ser 


Arg 


Ser 


Gly 


Gin 


He 


Phe 


Gin 


Ser. 


.Val 


Pro 


He 


Asn 










245 










250 










255 




lie 


Val 


Val 


Asp 


Ala 


Gly 


Phe 


Arg 


Phe 


Leu 


Ala 


Asp 


Lys 


Arg 


His 


Ala 








260 










265 










27 0 






Arg Thr 


Gly 


Ala 


Thr 


Ala 


Leu 


Thr 


Leu 


Asn 


Leu 


Arg 


Phe 


Asp Tyr 


Gly 






275 










280 










285 






Asp 


Pro 


Phe 


Arg 


Ser 


Glu 


Thr 


Phe 


Ser 


Pro 


Tyr 


Asp 


Phe 


Phe 


Gin 


Phe 
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290 










295 










300 










Lys 


Ala 


Gly 


Leu 


Ser 


Phe 


Sec 


Glu 


Ser 


Gin 


Pro 


Leu 


Leu 


Ser 


Gin 


He 


305 










310 










315 










320 


Asn 


Leu 


He 


Gly He 


Leu 


Ser 


Gly Cys 


Gin 


Leu 


Leu 


Ala 


His 


Glu 


Arg 










325 










330 










335 




Thr 


Val 


Leu 


Val 
340 


Gly 


Gly 


Leu 


Phe 


Gin 
345 


His 


Phe 


Asp 


Tyr 


Tyr 
350 


Asn 


Ser 


Glu 


Lys 


Arg 

355 


He 


Scr 


Lys 


Asn 


Ser 
360 


Glu 


Glu 


Val 


Leu 


Val 
365 


Thr 


Pro 


Tyr 


Arg 


He 


Ser 


Gin 


Val 


Ala 


Ala 


Leu 


Gly Gly Gly 


Leu 


lie 


Phe 


Gin 


His 




37 0 










375 










380 










His 


Gly 


Lys 


Phe 


Arg 


Arg 


Arg 


Pro 


Leu 


Glu 


Leu 


Tyr 


Ala 


Glu 


Thr 


Tyr 


385 










390 










395 










400 


Leu 


Asn 


Val 


Val 


Pro 


Met 


Gly Ala 


Ser 


Leu 


Ser 


Asp 


His 


Tyr Asn Val 










405 










410 










415 




Asp 


Asn 


Arg 


Asp Tyr 


Asn 


Leu 


Gly Ser 


Gly Leu 


Ser 


Gly Lys 


Leu 


Tyr 








420 










425 










430 






Leu 


Gly 


Ala 


Thr 


Tyr 


Asn 


Asp 


Leu Trp 


Ser 


Trp 


Leu 


Leu 


Gly 


Val 


Glu 






435 










440 










445 








Ser 


Tyr 


Arg 


Leu 


Tyr 


Thr 


Trp 


He 


Giy Tyr 


Glu 


Glu 


Pro 


His 


Gin 


Lys 




450 










4 55 










460 










Asn 


Thr 


Asp 


Val 


Ser 


Ser 


Phe 


Met 


Val 


Gin 


Gly 


Asp 


Glu 


Ser 


Lys 


Ala 


465 










470 










475 










4B0 


Arg 


Leu 


Leu 


Val 


Thr 


Ser 


Ser 


Glu 


Phe 


Ala 


Phe 


His 


Pro 


Gly 


Pro Trp 










485 










490 










4 95 




His 


Val 


Ala 


He 
SOO 


Val 


Ala 


Arg 


Arg 


Phe 
505 


He 


Arg 


T.ys 


Thr 


Ala 
510 


Tyr 


Gin 


Ph«? 


Tyr 


Pro 


Asn 


Val 


ser 


Phe 


Asp 


Thr 


Gly Asp 


lie 


Gin 


Leu 


Arg Val 






515 










520 










525 








Gly 


Phe 
530 


His 


Phe 



























(2) INFORMATION FOR SEQ ID NO: 34 6 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH t 300 amino acids 
W TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

till) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION i.,.300 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 346 



Lys Arg 


Arg 


Lys 


Lys 


Gin 


Met 


Lys 


Arg 


Leu 


He 


Val 


Phe 


Leu 


Ala Met 


1 






5 










10 










15 


Gly Gly Lsu 


Leu 


Phe 


Thr 


Leu 


Ala 


Asn 


Ala 


Gin 


Glu 


Ala 


Asn 


Thr Ala 






20 










25 










30 




Ser A3p Thr 


Pro 


Lys 


Lys 


Asp 


Trp 


Thr 


He 


Lys 


Gly 


Val 


Thr 


Gly Leu 




35 










40 










45 






Asn Ala 


Ser 


Gin 


Thr 


Ser 


Leu 


Thr 


Asn 


Trp 


Ala 


Ala 


Gly Gly Glu Asn 


50 










55 










60 








Thr Val 


Ala 


Gly 


Asn 


Leu 


Tyr 


Leu 


Asn 


He 


Asp 


Ala 


Asn 


Tyr 


Leu Lys 


65 








70 










75 








80 


Asp Lys 


Trp 


Ser 


Trp 


Asp 


Asn 


Gly 


Leu 


Axg 


Thr 


A.sp 


Phe 


Gly Leu Thr 








8b 










90 










95 


Tyr Thr 


Thr 


Ala 


Asn 


Lys 


Trp 


Asn 


Lys 


Ser 


Val 


Asp 


Lys 


He 


Glu Leu 






100 










105 








110 




Phe Thr 


Lys 


Ala 


Gly 


Tyr 


Glu 


He 


Gly 


Lys 


His 


Trp 


Tyr 


Gly 


Sar Ala 




115 










120 










125 






Leu Phe 


Thr 


Phe 


Leu 


Ser 


Gin 


Tyr 


Ala 


Lys 


Gly 


Tyr 


Glu 


Lye 


Pro Ser 


130 










135 










14TJ 








Asp His 


Leu 


Thr 


Gly 


Val 


Ly3 


His 


He 


Ser 


Asn 


Phe 


Phe 


Ala 


Pro Ala 


145 








150 










155 








160 


Tyr Leu 


Thr 


Leu 


Gly 


He 


Gly 


Ala 


Asp 


Tyr 


Lys 


Pro 


Asn 


Glu 


Lyst phe 








155 










170 










175 


Ser Leu 


Tyr 


Leu 


Scr 


Pro 


Thr 


Thr 


Gly 


Lys 


Leu 


Thr 


Val 


Val 


Ala Asp 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



276/ 490 



ISO 185 190 



Asp 


Tyr 


Leu 


Ser 


Ser 


Leu 


Gly 


Ala 


Phe 


Gly 


Val 


Lys 


Val 


Gly Glu 


Lys 






195 










200 










205 








Thr 


Met 


Phe 


Glu 


Leu 


Gly 


Ala 


Leu 


Va] 


Val 


Gly Set 


Ala 


Asn 


He 


Asn 




210 










215 










220 










Leu 


Met 


Glu 


Asn 


Val 


Asn 


Leu 


lie 


Thr 


Lys 


Ala 


Ser 


Phe 


Phe 


Ser 


Ala 


225 










230 










235 










240 


Tyr Thr 


His 


Asp 


Phe 


Gly 


Asn 


He 


Asp 


He 


Asn 


Trp 


Glu 


Ala 


Met 


Leu 










245 










250 










255 




Ala 


Met 


Lys 


lie 
260 


Asn 


Lys 


Phe 


Lea 


Thr 
265 


Ala 


Thr 


lie 


Ala 


Thr 
270 


Asn 


Leu 


lie 


Tyr 


Asp 
275 


Asp 


Asp 


Val 


Lys 


lie 

zao 


Asn 


Asp 


Gly 


Pro 


Lys 
265 


He 


Gin 


Phe 


l.ys 


Glu 


Val 


Val 


Gly 


Val 


Gly 


Val 


Ala 


Tyr 


Thr 


Phe 











290 295 300 



(2) INFORMATION FOR SEQ ID NO: 34 7 

(1) SEQUENCE: CHARACTERISTICS: 

(A) LENGTH: 221 amino acids 
(BJ TYPE: amino acid 
ID] TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE; 

(A) NAME /KEY ; misc_feature 

(B) LOCATION 1 . . .221 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347 



Thr 


Arg 


Glu 


Ser 


Val 


Leu 


His 


Cys 


Arg 


Thr Lys 


Leu 


Lys 


Lys 


Glu 


Arg 


1 








5 










10 








15 




Lys 


Met 


Lys 


Lys 


Met 


He 


Leu 


Ala 


Ala 


Thr Met 


Leu 


Leu 


Ala 


Thr 


He 








20 










25 








30 






Gly 


Phe 


Ala 


Asn 


Ala 


Gin 


Ser 


Arg 


Pro 


Ala Leu 


Arg 


Leu 


Asp 


Ala 


Asn 






35 










40 








45 








Phe 


Val 


Gly 


Ser 


Asn 


Leu 


Met 


Gin 


Lys 


Val Ala 


Asn 


Thr 


Ser 


Val 


Asn 




50 










55 






60 










Asn 


Lys 


Met 


lie 


Val 


Gly 


Leu 


Arg 


Val 


Gly Ala 


Ala 


Ala 


Glu 


Phe 


Ala 


65 










70 








75 










80 


Leu 


Ser 


Asn 


Asp 


Gly 


Phe 


Tyr 


Leu 


Ala 


Pro Gly 


Leu 


Ala 


Tyr 


Thr 


Met 










95 










90 








95 




Arg 


Gly 


Ala 


Lys 


Met 


Glu 


Ser 


Leu 


Ser 


Glu Thr 


Thr 


Thr Arg 


Leu 


His 








100 










105 








110 






Tyr 


Leu 


Gin 


lie 


Pro 


Val 


Asn 


Ala 


Gly Met Arg 


Phe 


Ser 


Phe 


Ala 


Asp 






115 










120 








125 






Asn 


Met 


Ala 


He 


Ser 


Leu 


Glu 


Ala 


Gly 


Pro Tyr 


Phe 


Ala 


Tyr 


Gly Val 




130 










135 








140 








Ala 


Gly Thr 


He 


Lys 


Thr 


Lys 


Val 


Ala 


Gly Val 


Thr 


Ala 


Ser 


Val 


Asp 


145 










150 








155 










160 


Ala 


Phe 


Gly 


Asp 


Asn 


Gly 


Tyr 


Asn 


Arg 


Phe Asp 


Leu 


Gly Leu Gly Leu 










165 










170 








17 5 




Ser 


Ala 


Ala 


Leu 


Ser 


Tyr 


Asp 


Arg 


Tyr 


Tyr Val 


Gin 


He 


Gly Tyr 


Glu 








180 










185 








190 






His Gly Leu 


Leu 


Asn 


Met 


Leu 


Lys 


Asp 


Ala Pro 


Asp 


Lys 


Thr 


Ser 


Leti 






195 










200 








205 








Arg 


Asn 


His 


Asp 


Phe 


Phe 


Val 


Gly 


Leu 


Gly Val 


Arg 


Phe 









210 215 220 



(2) INFORMATION FOR SEQ ID NO: 348 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 248 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

til > MOLECULE TYPE: protein 

Url» HYPOTHETICAL: YES 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



WO 99/29870 
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50 



65 



70 



75 



(vi| ORIGTt4AL SOURCE: 

(A) ORGANISM: Porphyroraon^s gingivalis 

(i*;> FEATURE: 

(A) NAME/ KEY: misc_£eature 

(B) LOCATION 1. . .24 0 

(xil SEQUENCE DESCRI PTION : SEQ ID NO: 34 8 



10 



15 



20 



25 



30 



35 



40 



45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 211 amino aclcis 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protean 

(ill) HYPOTHETICAL: YES 

55 Ivi) ORIGINAL SOURCE: 

(A) ORGANTSM: Porphyromonas gingivalis 

iix) FEATURE: 

IA} NAME/KEY; mific feature 
60 (B) LOCATION X...Z11 



lie 


Lys 


Axg 


He 


Glu 




Lys 




He 


Phe 


Thr 


Val 


Ala 


Leu 


Val 


Leu 


1 








5 










10 










15 




Leu 


Ala 


Ser 


Val 


Thr 


Met 


Ala 


He 


Gly Gin 


Ser 


Arg 


Pro 


Ala 


Leu 


Arg 








20 










25 










30 






Val 


Asp 


Ala 


Asn 


Phe 


Val 


Gly 


Ser 


Asn 


Gin 


Ser 


Met 


Lys 


Arg 


Asp 


Gly 






35 










40 










45 








Tyr Val 


Trp 


Asp 


Thr 


Lys 


Met 


Asn 


Val 


Gly 


Leu 


Arg 


Val 


Gly Ala 


Ala 




50 










55 










60 










Ala 


Glu 


Phe 


Met 


He 


Gly 


Ser 


Arg 


Gly 


Phe 


Tyr 


Leu 


Ala 


Pro 


Gly Leu 


65 










70 










75 










80 


As n 


Tyr 


Thr 


Met 


Lys 


Gly 


Ser 


Lys 


Thr 


Glu 


Trp 


Asp 


He 


Pro 


Glu 


Met 










85 










on 










9b 




Val 


Pro 


Gly 


Thr 


Tyr 


He 


Thr 


Met 


Val 


Ser 


Thr 


Arg 


T^eu 


His 


Tyr 


Leu 








100 










105 










lie 






Gin 


Leu 


P-o 


He 


Asn 


Ala 


Gly 


Met 


Arg 


Phe 


Asp 


Leu 


Met 


Asn 


Asp 


Mot 






115 










120 










125 






Ala 


Val 


Ser 


He 


Glu 


Ala 


Gly 


Pro 


Phe 


Leu 


Ala 


Tyr 


Gly 


He 


Tyr 


Gly 




130 










135 










140 










Thr 


Tyr 


Arg 


Gin 


Lys 


Leu 


Glu 


Gly 


Trp 


Lys 


Pro 


A*n 


Asn 


Tyr 


ser 


Thr 


145 










150 










155 










160 


Glu 


Phe 


Phe 


Gly 


Pro 


Thr 


Leu 


Gly 


Gly 


Pro 


Thr 


Asn 


He 


Arg 


Trp 


Asp 










165 










170 










175 




He 


Gly Ala 


Asn 


He 


He 


Ala 


Ala 


Phe 


His 


Tyr 


Lys Arg 


Tyr 


Tyr 


He 








180 










185 










190 






Gin 


He 


Gly 


Tyr 


Glu 


His 


Gly 


Phe 


Val 


Asp 


He 


Val 


Ser 


Gly Gly Gly 






195 










200 










205 








Ser 


Asp 


He 


Pro 


Arg 


Leu 


Aen 


Asp 


Asn 


Arg 


Gin 


Ser 


Ser 


Ser 


Thr 


Thr 




210 










215 










220 










Ala 


Leu 


Arg 


Glu 


Lys 


Gly 


Asn 


Asn 


Glu 


Tyr 


Ala 


Tyr Asn 


Arg 


Asp 


Phe 


22 5 










230 










235 










240 


Phe 


Val 


Gly 


He 


Gly 


Tyr 


Arg 


Phe 


























245 
























(2) 


INFORMATION 


FOR 


SEQ 


ID NO:349 



















(xi) SEQUENCE DESCRIPTION: SEQ ; 


ID NO: 34 9 










Lys 


Arg Lys 


Ser 


Met Lys 


Arg 


Met 


Leu 


Leu Leu 


Leu 


Val 


Val 


Leu 


Leu 


1 














10 








15 




Tyr 


Gly He 


Ala 


Gly Arg 


Lou 


Ala 


Ala 


Gin Asp 


Val 


He 


Arg 


Pro 


Trp 






20 








25 








30 






Ser 


Leu Gin 


Val 


Gly Ala 


Gly 


Tyr 


Ser 


Asp Thr 


Glu 


Asn 


He 


Pro 


Gly 




35 








40 








45 








Gly 


Phe Thr 


Tyr 


Gly Phe 


Tyr 


T.eu 


Gly 


Lys Arg 


Met 


Gly Ser 


Phe 


Leu 




50 






b5 








60 










Glu 


Val Gly 


Leu 


Ser Met 


Tyr 


Asn 


Ser 


Thr Arg 


Gin 


Thr 


Ala 


Asn 


Aan 


65 






70 








75 










ao 


Ala 


Asp Scr 


Phe 


Ala Ser 


Asn 


Glu 


Gly 


Asp Gly 


Ser 


Phe 


Gin 


Val 


Asn 








85 








90 








95 





SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



278/490 





Met. 


Ser 


Ser 


Pro 
100 


Asn 


Glu 


Lys 


Trp 


Ser 
105 


Phe 


Phe 


A?p 


Ala 


Gly 
110 


Ser 


Ala 




Am 


Cys 


Tyr 
115 


Het 


lie 


Val 


Val 


Cly 
120 


Val 


Asn 


Pro 


Leu 


His 
125 


Leu 


Phe 


Trp 


5 


Gin 


Asn 
130 


Ser 


Arg 


His 


Asn 


Leu 
135 


Phe 


Leu 


Ala 


Val 


Gin 
140 


Ala 


Gly 


Leu 


Ser 




A.'n 


Lys 


His 


Asn 


He 


His 


Phe 


11* 


Tyr 


Gly 


Asp 


Lys 


Gly 


Ala 


Lys 


Val 




145 










150 










155 










160 


10 


Ser 


lie 


Tyr 


Thr 


Asn 


Ser 


Asn 


Thr 


Tyr 


He 


Gly 


Tyr 


Gly 


Ala 


Arg 


Val 










165 










no 










175 






Ala 


Tyr 


Glu 


Tyr 


Gin 


lie 


His 


Lys 


Asn 


Val 


Gly Ala 


Gly 


Ala 


Ala 


Val 










180 








185 










190 








Met 


Tyr 


Asp 


His 


Gly 


Asn 


Lys Met 


Leu 


Thr 


Ala 


Met 


Ala 


Thr 


Leu 


Ser 








195 










200 










205 








15 


Thr 


His 
210 


Phe 





























(2) INFORMATION FOR SEQ ID HO: 350 

20 (11 SEQUE1ICE CHARACTERISTICS: 

(A) LENGTH: 953 amino acids 

(B) TYPE: amino acid 
[ D) TOPOLOGY: linear 

25 (II) MOLECULE TYPE: protein 

(iiil HYPOTHETICAL; YES 

(Vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Porphyr onion as gingival is 

(ix) FEATURE: 

(A) NAME/KEY: ntisc_f eature 

(B) LOCATION 1. . . 953 



35 



40 



45 



50 



55 



60 



65 



70 



75 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 350 



He 


Arg 


Het 


Arg 


Val 


Ser 


Asp 


Leu 


Cys 


Ser 


Arg 


Leu 


Ser Trp 


Leu 


Leu 


1 








5 










10 








IS 




Pro 


Val 


He 


Leu 


Val 


Gly 


Leu 


Leu 


Cys 


Ala 


Thr 


Leu 


Val Ala 


Ala 


Glu 








20 










25 








30 






Arg 


Pro 


Met 


Ala 


Gly 


Ala 


Val 


Gly 


Leu 


His 


His 


Arg 


Arg His 


Ala 


Ala 






35 










40 










45 






Leu 


Ser 


Asp 


Ser 


Thr 


Ala 


Lys 


Asp 


Thr 


Val 


Pro 


Leu 


Ala Lys 


Pro 


He 




50 










55 










60 






Pro 


Asp 


Ser 


Ala 


Phe 


Arg 


Asp 


Ser 


Leu 


Pro 


Ala 


Asp 


Ser Thr 


Gly Ser 


65 










70 










75 








80 


Met 


Arg 


Gin 


Asp 


Ser 


Val 


Tyr 


Asp 


Asp 


Glu 


Phe 


Glu 


Leu Glu 


Asp 


Tie 










85 










90 








95 




Val 


Glu 


Tyr 


Glu 


Ala 


Ala 


Asp 


Ser 


He 


Val 


Leu 


Lsu 


Gly Gin 


Asn 


Arg 








100 










105 








no 






Ala 


Tyr 


Leu 


Phe 


Gly 


Lys 


Ser 


Tyr 


Val 


Ser 


Tyr 


Gin 


Lys Ssr 


Arg 


Leu 






115 










120 










125 






Glu 


Ala 


Asn 


Phe 


Met 


Tyr 


Leu 


Asn 


Thr 


Asp 


Ser 


Ser 


Thr Val 


Tyr 


Thr 




130 










135 










140 








Arg Tyr 


Val 


Leu 


Asp 


Thr 


Ala 


Gly 


Tyr 


Pro 


I let 


Ala 


Phe Pro 


Val 


Phe 


145 










150 










155 








160 


Lys 


Asp 


Gly 


Glu 


Gin 


Ser 


Phe 


Glu 


Ala 


Lys 


Asn 


Phe 


Thr Tyr 


Asn 


Phe 










165 










170 








17 5 




Arq 


Thr 


Glu 


Lys 


Gly 


He 


He 


Ser 


Gly 


Val 


He 


Thr 


Gin Gin 


Gly Glu 








180 










185 








190 






Gly 


Tyr 


Leu 


Thr 


Ala 


Gly 


Lys 


Thr 


Lys 


Lys 


Met 


Pro 


Asp Asn 


He 


Met 






195 










200 










205 






Phe 


Met 


Gin 


Gly 


Gly 


Arg 


Tyr 


Thr 


Thr 


Cys 


Asp 


Asn 


His A«*p 


His 


Pro 




23 0 










215 










220 








His 


Phe 


Tyr 


He 


Asn 


Leu 


Ser 


Lys 


Ala 


Ly£ 


Val 


His 


Pro Glu 


Lys 


Asp 


225 










230 










235 








240 


Tie 


Val 


Thr 


Gly 


Pro 


Val 


Asn 


Leu 


Val 


He 


Ale 


Asp 


Met Pro 


Leu 


Pro 










24S 










250 








255 




He 


Gly 


Leu 


Pro 


Phe 


Gly 


Tyr 


Phe 


Pro 


Phe 


Ser 


Asn 


Lys Tyr 


Ser 


Ser 








260 










26b 








270 






Gly 


He 


Leu 


Het 


Pro 


Thr 


Tyr 


Gly 


Glu 


Asp 


Asn 


Acg 


Tyr Gly 


Phe 


Tyr 






275 










280 










285 






Leu 


Arg 


Asn 


Gly 


Gly 


Tyr 


Tyr 


Pha 


Ala 


Phe 


Ser 


Asp 


Tyr He Asp Leu 




290 










2 95 










300 









SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



Ala 


Leu 


Arg 


Gly 


Gl u 


Tie 


Phe 


Ser 


305 










310 






Gin 


Ser 


Ly? 


Tyr 


Lys 


Lys 


A 


Tvr 










325 








A«m 


T 


Leu 


Val 


Ser 




Ser 


Gly 








340 










Ser 


Lys 


Thr 


Thr 


Ser 


Leu 


Asn 


He 
















360 


Lys 


Ala 




Pro 


Leu 


Gin 


Thr 


Leu 


370 










375 




Glv 


Ser 




Phe 


Gin 


Asn 


Ser 


Leu 


385 










390 






Arg 


Thr 


Ala 


Thr 


Thr 


Arg 


Ser 


Ser 










405 








Pro 


y 


Thr 


Pro 


Phe 


Ser 


He 


Thr 








420 










e 


Arg 


Asp 


Thr 


Thr 


Val 


Ser 


Leu 






4 35 










440 






Thr 


Arg 


Tyr 


Pro 


Phe 


Lys 














455 




j ^? 


Trp 


_ 

■* r 


Glu 


Lys 


Leu 


Ser 


Val 


465 










470 






Ssr 


Tie 


Leu 


Thr 


Lys 


Glu 


Lys 


Asp 










405 








Asp 


Trp 


Lys 


As n 


Gly 


Met 


Arg 


His 








500 










Pro 


Leu 




Asp 


Tyr 


He 


Asn 


Leu 






515 










520 


Trp 


Trp 


Tyr 


Thr 


Lys 


Gly 


He 


Arg 




530 










535 




Thr 


Phe 


Leu 


Pro 


Ser Asp Thr Thr 


545 










550 






Tyr 


er 


eu 


Ser 


Ala 


Gly Leu 


Ser 










565 








Pro 


trp 


Lys 


Pro 


Phe 


Ser 


Phe 


Gly 








580 








rg 


Ph 

e 


Thr 


Pro 


Thr 


Val 


Ser 


Phe 






595 










600 


rg 


Arg 


Tyr 


Gly 


Phe 


Trp 


Glu 


Leu 




610 










615 




J s 


Leu 


His 


Thr 


Leu 


Leu 


Tyr 


Ser 


625 










630 






Ala 


Pro 


Ser 


Met 


Gly Asn 


Ala 


Gly 










645 








Asn 


Leu 


Glu 


Ala 


Lys 


He 


Lys 


Ser 








660 










- 


lie 


Ser 


Leu 


He 


Asp 


Gin 


Phe 






67 5 








680 




Ala 


Asp 


Ser 


He 


Arg 


Trp 


Ser 




690 










695 




Arg 


Lgu 


ser 


Lys 


Ser 


Phe 


Thr 


Leu 


705 










710 






Tyr 


Leu 


Thr 


Lys 


Tyr Tyr Glu Gly 










725 








Lys 


Ser 


Asn 


Asp 


Leu 


Arg 


He 


Phe 








740 










He 


Ser 


Thr 


Gly Thr 


Ser 


Phe 


Ser 






75b 










760 


Ser 


Gly 


Leu 


He 


Ala 


Leu 


Phe 


Ser 




770 










775 




Lys 


Lys 


Asn 


Thr 


Gly Ala 


Thr 


Pro 


7 85 










790 






lie 


Leu 


Glu 


Gly Gly 


Arg 


Pro 


Gin 










005 








Glu 


Arg 


Asn 


Arg 


Gin Gly Gly Ala 








820 










Tyr 


Ser 


He 


Pro 


Trp 


Ser 


Leu 


Ser 






835 










84 0 


Thr 


Asp 


Tyr 


A.*?n 


Arg 


Tyr 


Asn 


Val 




850 










055 




Val 


Thr 


Gin 


Asn 


Leu 


Ser 


Phe 


Arg 


865 










870 




Trp 


Ser 


Phe 


Gly 


Phe 


Asn 


Ala 


Asn 










885 








Thr 


Sec 


Leu 


Thr 


Cys 


Asn 


Val 


Thr 
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Lys 


Gly 


Ser 


Trp 


Gly 


He 


Ser 


Ala 






315 










320 


Lys 


Tyr 




Gly 


See 


Phe 


GlU 


Ala 




330 










335 




Asp 


Lys 




Va 1 


Pro 


Gly 


nap 


- 

1 y r 


345 










350 






Arg 


Trp 


Thr 


His 


Ser 


Gin 


Asp 


Pro 










365 








Ser 


Ala 


Asn 


Val 


Asn 


Phe 


Ala 


Thr 








380 










Asn 


Thr 


Thr 


Tyr 


Asp 


Val 


Asn 


Ala 






395 










400 


Ala 


Val 


Ser 


ly r 


Ser 


Arg 


Lys 


Phe 




410 










415 




Gly 


Ser 


Met 


Asp 


He 


Ser 


Gin 


As 


425 










430 






Thr 


Leu 


Pro 


sn 


Leu 


Ser 


He 


Asn 










445 








Arg 


Lys 


Thr 




Val 


Gly 


Pro 


U 


















Gly Tyr 


Ser 


y 


Gin 


Leu 


Arg 


A^n 






475 










480 


Leu 


Leu 


Gin 


Ser 


Asn 


Leu 




Arg 




4 90 










4 95 




Ser 


Val 


Pro 


T 


Ser 


Leu 


Thr 


Va 


505 










510 






Thr 


Met 


Gly 


Val 


Asn 


Tyr 














525 






Lys 


Ser 


Trp 


Asn 


Glu 


Asp 


ys 


Lys 








540 










Tyr 


Lys 


Phe 


Arg 


Arg 


Leu 


Tyr 


Asp 






555 












The 


Thr 


Leu 


ryr 


Gly Met 


Phe 






570 










575 




Gly Asn 


Leu 


He 


Met 


lie 


Arg 


His 


585 










590 






Ser Tyr Met 


Pro 


Asp 


Phe 


Thr 


Lys 










605 








Leu 


Glu 


His 


Thr 


Asp Gin 


Asn 


Gly 








620 










Pro 


Tyr 


Phe 


Glu 


Gin 


He 


Phe 


Glv 






635 










64 0 


ser 


Val 


Asn 


Phe 


Ser 


Phe 


Asp 


Asn 




650 










655 




Lys 


Ser 


Asp 


Ser 


Thr 


Gly 


He 


L"s 


665 










67 0 






Thr 


Trp 


Ser 


Thr 


Ser 


Tyr 


Asn 


Met 










685 








Asn 


He 


Ser 


Ala 


Ser 


Leu 


Ala 


Leu 








700 










Arg 


Leu 


Ser 


Gly 


Leu 


Phe 


Asp 


Pro 






715 










720 


Glu 


Asp Gly 


Lys 


He 


He 


Pro 


Tyr 




730 










735 




Asn 


Gly Lys 


Gly 


Leu 


Ala 


Arg 


Leu 


745 










750 






Tyr 


Thr 


Leu 


Asn 


Lys 


Glu 


Ser 


Leu 










765 








Gly 


Lyo 


Lys 


Glu 


Arg 


Arg 


Asp 


Glu 








780 










His 


Glu 


Gly 


Asp 


Asp Ala 


Ala 


Asp 






7 95 










800 


Asn 


Glu 


Ser 


Gly 


Gly 


Ser 


Leu 


Leu 




810 










815 




Val 


Asp 


Gin 


Asp 


Gly Tyr 


Phe 


Ala 


825 










830 






Phe 


Asp 


Tyr 


Ser 


Trp Asn 


He 


Ala 










845 








Asn 


Lys 


Met 


Glu 


His 


Tyr 


Tyr 


Arg 








860 










Gly 


Asn 


lie 


Gin 


Pro 


Thr 


Pro 


Asn 






87 5 










880 


Tyr. 


Asn 


Phe 


Asp 


Leu 


Lys 


Lys 


He 




890 










895 




Arg 


Asp Met 


His 


Cys 


Trp 


Ala 


He 
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900 905 910 

Ser Ala Ser Phe lie Pro He Gly Ala Tyr Lys Ser Tyr Asn Phe Val 

515 920 925 

He Ser VqI Lys Ser Ser Leu Leu GLn Asp Leu Lys Tyr Gin Gin Ser 

930 935 940 

Asn Arg Pro He Thr Asn Thr Trp Tyr 
945 950 



(2) INFORMATION FOR SEQ 10 NO; 351 

ti) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1251 amino acids 
(Bl TYPE: amino acid 
* D) TOPOLOGY i linear 

ill) MOLECULE TYPE: protein 

I ill) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyroraonas glngivalis 

(ixl EtATURE; 

(A) NAME/ KEY : misc_f eature 

(B) I.OCATION 1.,.1251 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351 



Lys 


Trp 


Lys 


Leu 


Ala 


Cys 


Ala 


Phe 


Asp 


Cys 


Ala 


Cys 


Cys 


Phe 


Asp 


Pro 


1 








5 










10 










15 




Phe 


v^l 


Val 


Thr 


Asn 


GlU 


Val 


He 


He 


Het 


Met 


Lys 


Arg 


Tyr 


Thr 


He 








20 










25 










30 






He 


Leu 


Ala 


Val 


Phe 


Leu 


Leu 


Phe 


Cys 


Thr 


val 


Phe 


Thr 


Phe 


Gin 


He 






35 










40 










45 








Lys 


Ala 


Arg 


Pro 


Tyr 


Glu 


Arg 


Phe 


Ala 


Asp 


Val 


Glu 


Lys 


Pro 


Trp 


He 




50 










55 










60 










Gin 


Lys 


His 


Ser 


Met 


Asp 


Ser 


Lys 


Leu 


Val 


Pro 


Ala 


Asn 


Lys 


Gly Asn 


65 










70 










75 










80 


Leu 


He 


Gin 


Ala 


Glu 


He 


Val 


Tyr 


Gin 


Ser 


Val 


Ser 


Glu 


His 


Ser 


Asp 










65 










90 










95 




Leu 


Val 


He 


Ser 


Pro 


Val 


Asn 


Glu 


He 


Arg 


Pro 


Ala 


Asn 


Ac<g 


Phe 


Pro 








100 










105 










110 






Ser 


His 


Arg 


Lys 


Ser 


Phe 


Phe 


Ala 


Glu 


Asn 


Leu 


Arg 


Ala 


Ser 


Pro 


Pro 






115 










120 










125 








Val 


Val 


Pro 


Val 


Ala 


Val 


Asp 


Lys 


Tyr 


Ala 


Val 


Pro 


Val 


Ala 


Asn 


Pro 




130 










135 








140 










Met 


Asp 


Pro 


Glu 


Asn 


Pro 


Asn 


Ala 


Trp 


Asp 


Val 


Thr 


I.eu 


Lys 


He 


Thr 


145 










150 










155 










160 


Thr 


Lys 


Ala 


Val 


Thr 


Val 


Pro 


Val 


Asp 


Val 


Val 


Met 


Val 


He 


Asp 


Gin 










165 










no 










175 




Ser 


Ser 


Ser 


Het 


Gly 


Gly 


Gin 


Asn 


He 


Ala 


Arg 


I.eu 


Lys 


Ser 


Ala 


He 








180 










185 










190 






Ala 


Ser 


Gly 


Gin 


Arg 


Phe 


Val 


Lys 


Lys 


Het 


Leu 


Pro 


Lys Gly Thr 


Ala 






195 










200 










205 








Thr 


Glu 


Gly 


Val 


Arg 


He 


Ala 


Leu 


Val 


Ser 


Tyr 


Asp 


His 


Glu 


Pro 


His 




2J0 










215 








220 










Arg 


Leu 


Ser 


Asp 


Phe 


Thr 


Lys 


Asp 


Thr 


Ala 


Phe 


Leu 


Cys 


Gin 


Lys 


He 


225 










230 










235 








240 


Arg 


Ala 


Leu 


Thr 


Pro 


He 


Trp 


Gly 


Thr 


His 


Thr 


Gin 


Gly 


Gly 


Leu 


Lys 










245 










250 










255 


Met 


Ala 


Arg 


Asn 


He 


Met 


Ala 


Thr 


Ser 


Thr 


Ala 


Val 


Asp 


Lys 


His 


Tie 








260 










265 








27 0 






He 


Leu 


Met 


Ser 


Asp 


Gly 


Leu 


Ala 


Thr 


Glu 


Gin 


Tyr 


Pro 


Val 


Lys 


Asn 






275 










280 










285 






Val 


Thr 


Thr 


Ala 


Asp 


Phe 


He 


Gly 


Lys 


Thr 


Gly 


Asn 


Ala 


Asn 


Asp 


Pro 




290 










295 










30O 










He 


Asp 


Leu 


Val 


He 


Gin 


Gly 


Ala 


He 


Asn 


Phe 


Pro 


Thr 


Asn 


Tyr 


Val 


305 










310 










315 








320 


Ser 


Asn 


Asn 


Pro 


Ser 


Thr 


Pro 


Leu 


Thr 


Pro 


Asn 


Tyr 


Pro 


Thr 


His 


Ser 










325 










330 










335 




Ser 


Lys 


Val 


Gly 


Arg 


Arg 


Asn 


Leu 


Pro 


Glu 


S*r 


Lys 


Phe 


Asp 


Tyr 


Ser 








340 










345 










35G 






Asn 


Leu 


Ser 


Ala 


Arg 


He 


Thr 


Phe 


Asp 


Gly 


Val 


Ala 


Gly Ala 


Leu 


Val 






355 










360 








365 








Tyr 


Glu 


Pro 


Arg 


Phe 


Pro 


His 


Pro 


Tyr 


Tyr 


Tyr 


Tyr 


Phe 


Pro 


Cys 


Asn 
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3"/0 








37 5 










380 










Al a 




lie 


Asn 


Glu 


Ala Gin 


Phe 


Ala 


Lys 


Asn 


Ser 


Gly 


Tyr 


Thr 


lie 


385 










390 








395 








400 


His 


Thr 




Gly Tyr Asp Leu 


Gly Asp 


Phe 


Ala 


Leu 


Ala 


Asn 


Asn 


Ser 










4 05 








410 










415 




Leu 


Lys 


Leu 


Thr 
420 


Ala 


Thr Asp 


Glu 


Asn 
425 


His 


Phe 


Phe 


Thr 


Ala 
430 


Thr 


Pro 


Ala 


Asn 


Leu 
435 


Ala 


Ala 


Ala Phe 


Asp 
440 


Asn 


He 


Ala 


Gin 


Thr 
445 


He 


Asn 


Tie 


Gly 


lie 


Gin 


Arg 


Gly Glu Val 


Thr 


Asp 


Phe 


Val 


Ala 


Pro 


Gly 


Phe 


He 




4 50 








455 










460 










Val 


Lys 


Asn 


Leu 


Thr 


Gin Ser 


Gly 


Asp 


Val 


Thr 


His 


Leu 


Leu 


Asn 


Val 


465 










470 








475 










480 


Ser 


Asn 


Gly 


Thr 


Val 


Ills Tyr 


Asp 


Val 


Ser 


Thr 


Lys 


Lys 


Leu 


Thr 


Trp 










4B5 








4 90 










4 95 


The 


Thr 


Gly 


Thr 


lie 


Leu Ser 


Ser 


Ser 


Glu 


Ala 


?nr 


He 


Thr Tyr Arg 








500 








505 










510 






lie 


Tyr 


Ala 


Asp 


Leu 


Asp Tyr 


He 


Gin 


Asn 


Asn Acp 


He 


Pro 


Val 


Asn 






515 








520 










525 








Thr 


Thr 


Ser 


Ala 


lie 


Gly Pro 


Asp 


Leu 


Gly 


Gly 


Phe 


Asp 


Thr 


Asn 


Thr 




530 








535 










540 








Glu 


Ala 


Lys 


Leu 


Thr 


Tyr Thr 


Asn 


Ser 


Ann 


Gly Glu 


Pro 


Asn 


Gin 


Gin 


545 










550 








555 










560 


Leu 


lie 


Phe 


Pro Arg 


Pro Thr 


Val 


Lys 


Leu 


Gly 


Tyr 


Gly 


Val 


He 


Lys 










565 








570 










575 


Arg 


His 


Tyr 


Val 
580 


Leu 


Val Asn 


Lys 


Asp 
585 


Gly 


Gln 


Pro 


He 


Gin 
590 


Ala 


Asn 


GJy 


Thr 


Val 

5 95 


Val 


Ser 


Sex Leu 


Ser 
600 


Glu 


Ala 


His 


Vai 


Leu 

605 


Gin 


Ser 


Gin 


Asp 


Phe 


Phe 


Leu 


Pro 


Ser Gly 


Gly Gly 


His 


He 


Val 


Pro 


Lys 


Trp 


He 




610 


















620 








Lye 


Leu 


Asp 


Lys 


Thr 


Thr Glu 


Ala 


Leu 


Gin 


Tyr 


T V r 


Ser 


Val 


Pro 


Pro 


625 










630 








635 








€4 0 


Thr 


Asn 


Thr 


Val 


He 
645 


Thr Thr 


Ala 


Asp 


Gly 
650 


Lys 


Arg 


Tyr 


Arg 


Phe 
655 


Val 


Glu 


Val 


Pro 


Gly 


Ser 


Thr Pro 


Asn 


Pro 


Gly 


Gin 


He 


Gly 


He 


Ser 


Trp 








660 








665 








670 






Lys 


Lys 


Pro 


Ala 


Gly Asn Ala 


Tyr 


Phe 


Ala 


Tyr 


Lye 


Leu 


Leu 


Asn 


Tyr 






675 








680 










685 






Trp 


Hfit 


Gly 


Gly Thr 


Thr Asp 


Gin 


Gin 


S-r 


Glu 


Trp Asp Val 


Thr 


Ser 




690 








69b 










700 










Asn 


Trp 


Thr 


Gly 


Ala 


Gin Val 


Pro 


Leu 


Thr 


Gly Clu 


Asp 


Val 


Glu 


Phe 


705 










710 








715 










720 


Ala 


Thr 


Thr 


Glu 


Asn 


Phe Gly 


Ser 


Pro 


Ala 


Val 


Ala 


Asp 


Leu 


His 


Val 










725 








730 








735 




Pro 


Thr 


Thr 


Asn 


Pro 


Lys He 


He Gly 


Asn 


Leu 


He 


Asn 


Asn 


Ser 


Asp 








7-10 








745 










750 




Lys 


Asp 


Leu 


Val 


Val 


Thr Thr 


Ser 


Ser 


Gin 


Leu 


Thr 


He 


Asn 


Gly Val 






755 








760 










7 65 








Val 


Glu 
770 


Asp 


Asn 


Asn 


Pro Asn 
775 


Val 


Gly 


Thr 


He 


Val 
7 80 


Val 


Lys 


Ser 


Ser 


Lys 


Asp Asn 


Pro 


Thr 


Gly Thr 


Leu 


Leu 


Phe 


Ala 


Asn 


Pro 


Gly Tyr Asn 


785 










790 








795 










800 


Gin 


Asn 


Val 


Gly Gly Thr Val 


Glu 


Phe 


Tyr 


Asn 


Gin 


Gly Tyr Asp Cys 










80S 








B10 










815 




Ala 


Asp Cys 


Gly 


Met 


Tyr Arg 


Arg 


Ser 


Trp 


Gin 


Tyr 


Phe 


Gly 


He 


Pro 








B20 








025 










830 






Val 


Asn 


Glu 
835 


Ser 


Gly 


Phe Pro 


He 
840 


Asn 


Asp 


Val 


Gly 


Gly 
845 


Asn 


Glu 


Thr 


Val 


Asn 


Gin 


Trp 


Val 


Glu Pro 


Phe 


Asn 


Gly 


Asp 


Lys 


Trp 


Arg 


Pro 


Ala 




850 








855 








86C 








Pro 


Tyr Ala 


Pro 


Asp 


Thr Glu 


Leu 


Gin 


Lys 


Phe 


Lys 


Gly 


Tyr 


Gin 


lie 


865 










870 






875 






880 


Thr 


Asn 


Asp 


Val 


Gin 


Ala Gin 


ero 


Thr 


Gly 


Val 


Tyr 


Ser 


Phe 


Lys 


Gly 










865 








090 










8 95 


Met 


lie eye 


Val 


Cys 


Asp Ala 


Phe 


Leu 


Asn 


Leu 


Thr 


Arg 


Thr 


Ser 


Gly 








900 








905 








910 




Val 


Asn Tyr 


Ser Gly Ala Asn 


Leu 


He 


Gly 


Asn 


Ser 


Tyr 


Thr 


Gly Ala 






915 








920 










925 








lie 


Asp lie 


Lys 


Gin 


Gly He 


Val 


Phe 


Pro 


Pro 


Glu 


Val 


Glu 


Gin 


Thr 




930 








935 










940 










Val 


Tyr 


Leu 


Phe 


Asn 


Thr Gly 


Thr 


Arg 


Asp 


Gin Trp Arg 


Lys 


Leu 


Asn 


94 5 










950 








955 










960 


Gly 


Ser 


Thr 


Val 


Ser 
965 


Gly Tyr 


Arg 


Ala 


Gly 
97 0 


Gin 


Tyr 


Leu 


Ser 


Val 
9/5 


Pro 
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l.yc Asn Thr Ala Gly Gin Asp Asn Leu Pro Asp Arg He Pro Ser Met 

980 965 990 

His Ser Phs Leu Val Lys Met Gin Asn Gly Ala Ser Cys Thr Leu Unk 
995 1000 1005 

5 He Leu Tyr Asp Lye Leu Leu Lys Asn Thr Thr Val Asn Asn Gly Asn 
1010 1015 L020 

Gly Thr Gin He Thr Trp Arg Ser Gly Asn Ser Gly Ser Ala Asn Met 
1025 1030 1035 1040 

Pro Ser Leu Val Met Asp Val Leu Gly Asn Glu Ser Ala Asp Arg Leu 
10 1045 1050 1055 

Trp He Phe Thr Asp Gly Gly Leu Ssr Phe Gly Phe Asp Asn Gly Trp 

1060 1065 1070 

Asp Gly Arg Lys Leu Thr Glu Lys Gly Leu Ser Gin Leu Tyr, Ala Met. 
1075 1080 1085 

15 Ser Aep He Gly Asn Asp Lys Ehe Gin Val Ala Gly Val Pro Glu Leu 
1090 1095 1100 

Asn Asn Leu Leu He Gly Phe Asp Ala Asp Lys Asp Gly Gin Tyr Thr 
1105 1110 1115 1120 

Leu Glu Phe Ala Leu Ser Asp His Ph= Ala Lys Gly Ala Val Tyr Leu 
20 1125 1130 1135 

Hi« Asp Leu Gin Ser Gly Ala Lys His Arg He Thr Asn Ser Thr Ser 

1140 H45 1150 

Tyr Ser Phe Asp Ala Lys Arg Gly Asp Ssr Gly Ala Arg Phe Arg Leu 
1155 H60 1165 

25 Ser Tyr Gly Cys Asp Glu Asn Val Asp Asp Ser His Val Val Ser Thr 
1170 1375 1180 

Asn Gly Arg Glu He He He Leu Asn Gin Asp Ala i,eu Asp Cys Thr 
1185 1190 1195 1200 

Val Thr Leu Phe Thr lie Glu Gly Lys Leu Leu Arg Arg Leu Lys Val 
30 1205 1210 1215 

Leu Aid Gly His Arg Glu Val Met Lys Val Gin Thr Gly Gly Ala Tyr 

1220 1225 1230 

He Val His Leu Gin Asn Ala Phe Thr Asn Asp VaL His Lys Val Leu 
1235 1240 1245 

35 Val Glu Tyr 
1250 

(2) IN FORMAT ION FOR SEQ ID MO; 352 

40 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (ii) HOLECULE TYPE: protein 

<ili> HYPOTHETICAL: YES 

(vij ORIGINAL SOURCE : 
50 (AJ ORGANISM: Porphyromonas gingival is 

liz) FEATURE: 

(A) NAME/KEY: misc_feature 
JB) LOCATION 1. . .426 

55 



60 



65 



70 



75 





Ui) 


SEQUENCE PESCRiPTION: SEQ ID MO: 352 






Thr 


Ilet 


Glu 


Val 


Lys 


Lys 


Asn 


Thr 


Val 


Val 


Leu 


Arg 


Leu 


Leu He Trp 


1 








5 










10 








15 


Phe 


Val 


Ala 


Tie 


Leu 


Leu 


Phe 


His 


Ser 


Ser 


Arg 


Leu 


Trp 


Gly Gin Glu 








20 










25 










30 


Gly Clu Gly 


Ser 


Ala 


Axg 


Tyr 


Arg 


Phe 


Lys 


Gly 


Pne 


Val 


Asp Thr Tyr 






35 










40 










45 




His 


Ala 


Val 


Arg 


Ser 


Ser 


Ser 


Pro 


Phe 


Asp 


Phe 


Met 


Ser 


Ser Arg Thr 




50 










55 










60 




Arg 


Val 


Arg 


Gly 


Glu 


Leu 


Glu 


Arg 


Ser 


Phe 


Gly 


Asn 


Ser 


Lys Val Ala 


65 










70 










75 






90 


Val 


Ser 


Val 


Asn 


Ala 


Thr 


Tyr 


Asn 


Ala 


Leu 


Leu 


Lys 


Asp 


Glu Thr Gly 










85 










90 








95 


Leu 


Arg 


Leu 


Arg 


Glu 


Ala 


Phe 


Phe 


Glu 


His 


Gin 


Glu 


Glu 


1U3 Trp Gly 








100 










105 










110 


Leu 


Arg 


Leu 


Gly 


Arg 


Gin 


He 


Val 


He 


Trp 


Gly 


Ala 


Ala 


Asp Gly Val 






115 










120 










125 




Arg 


He 


Thr 


Asp 


Leu 


He 


Ser 


Pro 


Met 


Asp 


Met 


Thr 


Glu 


Phe Leu Ala 




130 










135 








140 
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55 



60 



65 



70 



75 



Gin Asp 


Tyr 


Asp 


Asp 


He 


Arg 


Met 


Pro 


Val 


Asn 


Ala 


Leu Arg 


Phe 


Ser 


145 








150 










155 








160 


Val Phe 


Asn 


Glu 


Ser 


Met 


Lys 


Val 


Glu 


Val 


V«l 


Val 


Leu Pro 


Val 


Phe 








165 










170 








175 




Glu Gly 


Tyr 


Arg 


Leu 


Pro 


Val 


Asp 


Pro 


Arg 


Asn 


Pro 


Trp Asn 


He 


Phe 






180 










185 








190 






Ser Leu 


Ser 


Pro 


He 


Ala 


Gin 


Gly Met 


Asn 


He 


Val 


Trp Lys 


Glu 


Glu 




195 










200 










205 






Ala Gly 


Lys 


Pro 


Ala 


Phe 


Lys 


Val 


Ala 


Asn 


He 


Glu 


Tyr Gly Ala 


Arg 


210 










215 










220 








Trp Ser 


Thr 


Thr 


Leu 


Ser 


Gly 


He 


Asp 


Phe 


Ala 


Leu 


Ala Ala 


Leu 


Hi^ 


225 








230 










235 








240 


Thr Trp 


Asn 


Lys 


Met 


Pro 


Val 


He 


Glu 


Val 


Gin 


Gly 


He Val 


Pro 


Thr 








245 










250 








255 




Glu He 


He 




Ser 


Pro 


Arg 


Tyr 


Tyr 


Arg 


Met 


Gly 


Phe Val 


Gly 


Gly 






260 










265 








270 






Asp Leu 


Ser 


Val 


Pro 


Val 


Gly Gin 


Phe 


Val 


Phe 


Arg 


Gly Glu Ala 


Ala 




27 5 










280 










285 






Phe Asn 


He 


Asp 


Lys 


His 


Phe 


Thr 


Tyr 


Lys 


Ser 


His 


Ala Glu 


Gin 


Glu 


290 










295 










300 








Gly Fhe 


Gin 


Thr 


He 


Asn 


Trp 


Leu 


Ala 


Gly Ala 


Asp Trp Tyr 


Ala 


Pro 


305 








310 










31S 








320 


Gly Clu 


Trp 


Met 


He 


Ser 


Gly Gin 


Phe 


Ser 


Met 


Glu 


Ser He 


Phe 


Arg 








325 










330 








335 




Tyr Arg 


Asp 


Phe 


He 


Ser 


Gin 


Arq 


Gin 


His 


Ser 


Thr 


Leu lie 


Thr 


Leu 






340 










345 








350 






Apn Val 


Ser 


Lys 


Lys 


Phe 


Phe 


Gly 


Ser 


Thr 


Leu 


Gin 


Leu Ser 


Asp 


Phe 




355 










360 










365 






Thr Tyr 


Tyr 


Asp 


Leu 


Thr 


Gly Lys 


Gly Trp 


Phe 


Ser 


Arg Phe 


Ala 


A3 * 


37 0 










375 










380 








Arp Tyr 


Ala 


Leu 


Asn 


Asp 


Gin 


He 


His 


Leu 


Met 


Ala 


Gly Tyr 


Asp 


Trp 


385 








390 










395 








4 00 


Phe Ser 


Ser 


Lye 


Gly Ser Gly 


He 


Phe 


Asp 


Arg 


Tyr 


Lys Asp 


Asn 


Ser 








405 










410 








415 




Glu Leu 


Trp 


Phe 


Lys 


Ala 


Arg 


Tyr 


Ser 


Phe 
















420 










425 














(2) INFORMATION 


FOR 


SEQ 


ID NO: 353 















10 



15 



20 



25 



30 



35 



40 <i> SEQUENCE CHARACTERISTICS: 

I A) LENGTH : 4 64 amino acids 
(B) TYPE: amino acid 
(DJ TOPOLOGY: linear 

45 (11) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

ivi) ORIGINAL SOURCE: 
50 (A J ORGANISM: Porphyroroonas gingival! s 



(ix) FEATURE: 

{A) NAME/ KEY: mis cofeature 
(B) LOCATION 1, . . 464 





<XH 


SEQUENCE DESCRIPTION: SEQ ID NO: 353 










Tyr 


Gly 


Lys 


Arg 


Arg 


Lys 


Leu 


Gly 


Thr 


Ser 


Val 


Arg 


Pro 


Ser 


Val 


Leu 


1 








5 










10 








15 




Thr 


Gin 


He 


Arg 


Phe 


He 


Leu 


Asp 


Leu 


His 


Leu 


He 


Thr Asp 


Phe 


Phe 








20 










25 










30 






Glu 


Gli- 


Leu 


Arg 


Val 


Asn 


Pro 


He 


Gly 


Ala 


Ala 


Ala 


He 


Val 


Ala 


Phe 






35 










40 










45 








He 


lle 


Asp 


Leu 


Leu 


Leu 


Leu 


Cys 


Cys 


Ser 


Ala 


Phe 


Met 


Ser 


Ser 


Cys 




50 










55 










60 








Glu 


Val 


Ala 


Tyr 


Phe 


Ser 


Leu 


Lys 


Pro 


He 


Asp 


Leu 


Gin 


Asn 


He 


Arg 


65 










70 










75 










80 


Glu 


Arg 


Asn 


His 


Ser 


Ser 


Asp 


He 


Ala 


Leu 


Ser 


Asn 


Leu 


Leu 


Asp 


Asn 










85 










90 










95 




Ser 


Asn 


Gin 


Leu 


Leu 


Ala 


Thr 


He 


Leu 


He 


Gly 


A.-?n 


Asn 


Val 


He 


Asn 








100 










105 








110 






Val 


Ala 


He 


Val 


He 


Leu 


Ser 


Asn 


Tyr 


Ala 


He 


Glu 


Gin 


Thr 


Phe 


Val 






US 










120 










125 








Phe 


Ser 


Ser 


Pro 


He 


He 


Gly 


Phe 


Leu 


He 


Gin 


Thr 


He 


Leu 


Leu 


Thr 




130 










13S 










140 











SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



284/ 490 



Thr 


Val 


Leu 


Leu 


Leu 


Phe 


Gly Glu 


He 


Leu 


Pro 


Lys 


Val 


Tyr 


Ala 


Arg 


145 










150 










155 










160 


Lys 


Asn 


Pro 


Leu 


Gin 
165 


Tyr 


Ser 


Arg 


Phe 


Sec 
170 


Ala 


Ala 


Ala 


Met 


Ser 
175 


Val 


lie 


Tyr 


Lys 


He 
160 


Leu 


Ser 


Pro 


Phe 


Ser 
165 


Lys 






Val 


l.ys 

190 


Ser 


Thr 


Gly He 


Val 


Thr 


Arg 


Gly 


He 


Ser 


Lys 


Lys 


Lys 


Tyr 


A-sp 


Met 


Ser 


Val 






195 










200 










205 








Asp 


Glu 


Leu 


St»r 


Lys 


Ala 


Val 


Ala 


Leu 


Thr 


Thr 


Thr 


Glu 


Gly Glu 


Pro 




210 










215 










220 










Glu 


Glu 


L 


Glu 


Met 


He 


Asn 


Glu 


He 


11 e 


Lys 


Phe 


Tyr 


Asn 


Lys 


Thr 


225 










230 










235 










240 


Ala 


Cys 


Glu 


lis 


Met 
245 


Val 


Pro 


Arg 


He 


Asp 
2 50 


He 


Val 


Asp 


Val 


Asp 

255 


Leu 


Scr 


Trp 


Pro 


Phe 


Arg 


Lys 


Met 


Leu 


Asp 


Phe 


Val 


Val 


Ser 


Ser 


Gly Tyr 








260 










265 










2">0 






Ssr 


Arg 


Leu 
275 


Pro 


Val 


Ser 


Glu 


Gly 
280 


Ser 


Glu 


Asp 


Asn 


He 
285 


l.ys 


Gly 


Val 


lie 


Tyr 


He 


Lys 


Asp 


Leu 


He 


Pro 


His 


Met 


Asp 


Lys 


Gly Asp 


Glu 


Phe 




290 










225 










300 










Asp 


Trp 


His 


Pro 


Leu 


He 


Arg 


Lys 


Ala 


Tyr 


Phe 


Val 


Pro 


GlU 


Asn 


Lys 


305 










310 










315 










320 


Arg 


lie 


Asp 


Asp 


Leu 

325 


Leu 


Glu 


Glu 


Phe 


Arg 
330 


Ala 


Asn 


Lys 


Val 


His 
335 


Val 


Ser 


He 


Val 


Val 


Asp 


Glu 


Phe 


Gly Gly 


Thr 


Cys 


Gly 


Leu 


He 


Thr 


Het 








340 










345 










350 






Glu 


ACp 


Tie 
355 


Leu 


Glu 


Glu 


He 


Val 
360 


Gly 


Glu 


He 


Thr 


Asp 

365 


Glu 


Tyr 


Asp 


Glu 


Glu 
37 0 


Glu 


Leu 


Pro 


Phe 


Lys 
375 


Val 


Leu 


Gly 


Asp 


Gly 
380 


Ser 


Tyr 


Leu 


Phe 


Glu 


Gly 


Lys 


Thr 


Ser 


Leu 


Ser 


Asp Val 


Arg 


(lis 


Tyr 


Leu 


Asp 


Leu 


Pro 


385 










390 










3 95 










400 


Glu 


Asn 


Ala 


Phe 


Gly Glu 


Leu 


Gly Asp 


Glu 


Val 


Asp 


Thr 


Leu 


Ser 


Gly 










4 05 










410 










415 




Leu 


Phe 


Leu 


Glu 


He 


Lys 


Gin 


Glu 


Leu 


Pro 


His 


Val 


Gly Asp 


Thr 


Ala 








420 










425 










430 






Val 


Tyr 


Glu 
435 


Pro 


Phe 


Arg 


Phe 


Gin 
440 


Val 


Thr 


Gin 


Met 


Asp 
445 


Lys 


Arg 


Arg 


lie 


He 
450 


Glu 


He 


Lys 


He 


Phe 
455 


Pro 


Phe 


Glu 


Arg 


Thr 
460 


Trp 


Glu 


Val 


Glu 



\2) INFORMATION FOR SEQ ID NO: 354 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 amino acids 
|B) TYPE: amino acid 
ID) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas qingivalis 

Ux) FEATURE: 

IA) NAME/ KEY: misc_feature 

(B) LOCATION 1. - .266 



UiJ SEQUENCE DESCRIPTION: SEQ ID NO: 354 



He 


He 


Tyr 


Arg 


Ser 


Thr 


Met 


Lys 


Leu 


Leu 


Leu 


Tyr 


Leu Leu 


Leu 


Val 


3 








5 










10 






15 




Leu 


Ser 


Thr 


Leu 


Ser 


Pro 


Met 


Tyr 


Ser 


Gin 


list 


Leu 


Phe Ser 


Glu 


Asn 








20 










25 








30 






Leu 


Thr 


Heu 


Acn 


Tie 


Asp 


Ser 


Thr 


Lys 


Thr 


He 


Gin 


Gly Thr 


He 


Leu 






35 










40 










45 






Pro 


Val 


Leu 


Asp 


Phe 


Lys 


Thr 


Glu 


Lys 


Glu 


Asn 


Val 


Phe Thr 


Phe 


Lys 




50 










55 
















Asn 


Thr 


Ala 


Asn 


Leu 


Asn 


Leu 


Leu 


He 


Lys 


His 


Gly Gin Val 


He 


Am 


65 










70 








75 








80 


Leu 


He 


Asn 


Lys 


Leu 


Glu 


Phe 


Ser 


Thr 


Tyr 


Gly Asn 


Lys Val 


Thr 


Val 










85 










90 








95 




Ser Gly Gly 


Tyr 


Val 


His 


Thr 


Glu 


Tyr 


Arg 


Tyr 


Leu 


Leu His 


His 


Val 








100 










105 






110 







SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



285/490 



30 



35 



50 



55 



60 



65 



70 



75 



Phe 


Glu 


Val 


Tyr 


Pro 


Tyr 


Val 


Glu 


Ser 


Gin 


Trp 


Ala 


Glu 


Ser 


Arg 


Cli- 






115 










120 










125 








lleL 


Lye 


Tyr 


Lys 


V;\l 


Ser 


Thr 


Gly Leu 


Gin 


Ser 


Arg 


Tyr 


Arg 


Leu 


val 




130 










135 










140 










Asn 


Ser 


Asp 


Asn 


Cys 


Leu 


Met 


Phe 


Ala 


Thr 


Leu 


Gly 


Val 


Phe 


Phe 


Glu 


145 










150 










1SS 










160 


Phe 


Glu 


Lys 


Trp 


Glu 


Gin 


Pro 


Ala 


Thr 


Ser 


Leu 


Phe 


Ala 


Gly Thr 


Tyr 










165 










170 










175 




Ala 


Tyr 


Ser 


Acq 


Ser 


He 


Lys 


Ser 


His 


Leu 


Ser 


He 


Ser 


Phe 


Arg 


Hi;? 








180 










185 










190 






Arg 


Leu 


Gly 


Glu 


His 


Trp 


Glu 


Phe 


Thr 


Thr 


Thr 


Ala 


He 


His 


Gin 


Gly 






195 










200 










205 








Lys 


Pro 


Asp 


Ser 


Tyr 


Phe 


Lys 


Lys 


Ala 


Arg 


Phe 


Gly 


Gly 


Ala 


He 


Asp 




210 










215 










220 










Leu 


Lys 


Tyr 


Ht3 


He 


Thr 


Pro 


Thr 


He 


Gly 


He 


Arg 


Gly 


Ala 


Tyr 


Arg 


225 










230 










235 










240 


He 


He 


Tyr 


Asp 


Thr 


Ala 


Pro 


He 


Val 


Pro 


Val 


Arg 


Lys 


Asp 


Tyr 


Asn 










245 










250 










255 




Thr 


Val 


Asp 


Val 


Gly He Asp 


He 


Ser 


Phe 




















260 










265 
















(2) 


INFORMATION 


FOR 


SEQ 


ID MO: 355 

















10 



15 



20 



U) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 907 amino acids 

IB) TYPE: amino acid 
<D) TOPOLOGY: linear 



<ii) MOLECULE TYPE; protein 

(Hi) HYPOTHETICAL; YES 

(Vi> ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



Ux) FEATURE: 

(A) NAME/ KEY: misc_feature 
(Bl LOCATION 1...907 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355 



45 



Thr 


Tie 


Cys 


Val 


Ala 


Phe 


Leu 


Ser 


Ala 


Pro 


Val 


Ala 


Ala 


Leu 


Axg 


GlU 


1 








5 










10 










15 




Ser 


Pro 


Pro 


Met 


Gly 


Ala 


Glu 


Arg 


Lys 


Thr 


Pro 


Ser 


Leu 


Leu 


Pro 


Leu 








20 










25 










30 






Leu 


Phe 


Glu 


His 


Ser 


Asp 


Lys 


Gly 


Lys 


Gly 


Phe 


Asp 


Tyr 


Arg 


Leu 


Phe 






35 










40 










45 








Thr 


Ser 


Asn 


Lys 


Leu 


Lys 


Val 


Phe 


Ala 


Thr 


Gly 


Asn 


Ser 


Arg 


Tyr 


He 




SO 










55 










60 










His 


Asn 


Lys 


Pro 


Thr 


He 


He 


Gin 


Ala 


Met 


Lys 


Arg 


Xle 


Val 


Leu 


Ser 


6b 










70 










75 










80 


Ser 


Phe 


Leu 


Phe 


Val 


Leu 


Ser 


He 


Leu 


Ser 


Leu 


Met 


Ala 


Gin 


Asn 


Asn 










85 










90 










95 




Thr 


Leu 


Asp 


Val 


His 


He 


Ser 


Gly 


Thr 


Xle 


Lys 


Asp 


Ala 


Ser 


Ser 


Gly 








1O0 










105 










110 






Glu 


Pro 


Val 


Pro 


Tyr 


Ala 


Thr 


Val 


Ser 


He 


Arg 


Leu 


Thr 


Gly 


Ala 


Asp 






115 










120 










125 








Thr 


Thr 


Gin 


Val 


Phe 


Arg 


Gin 


Val 


Thr 


Asp 


Gly 


Asn 


Gly 


Tyr 


Phe 


Val 




130 










135 










140 










He 


Gly 


Leu 


Pro 


Ala 


Ala 


Pro 


Ser 


Tyr 


His 


Leu 


Thr 


Ala 


Ser 


Phe 


Val 


145 










150 










155 










ISO 


Gly Met 


Lys 


Thr 


His 


Thr 


Met 


Gin 


He 


Ser 


Arg 


Gly 


Asn 


Gly Gin 


His 










165 










170 










175 




Asp 


He 


Ly3 


Ser 


He 


Asp 


He 


Ser 


Leu 


Glu 


Ser 


Glu 


Asp 


Lys 


Gin 


Leu 








180 










185 










ISO 






Ser 


Thr 


Val 


Thr 


Val 


Ser 


Ala 


Ala 


Arg 


Pro 


Leu 


Val 


Lys 


Met 


Glu 


lie 






195 










200 










205 








Asp Arg 


Leu 


Ser 


Tyr 


Asn 


Met 


Lys 


Asp 


Asp 


Pro 


Ala 


Ala 


Lys Thr 


Asn 




210 










215 










220 










Asn 


Leu 


Leu 


Glu 


Met 


Leu 


Arg 


Asn 


Val 


Pro 


Leu 


Val 


Thr 


Val 


Asp 


Gly 


225 










230 










235 








240 


Gin Gly 


Asn 


He 


Gin 


Val 


Lys 


Gly 


Ser 


Ser 


Asn 


Phe 


Lys 


He 


His 


Leu 










245 










250 










255 




Asn 


Gly 


Arg 


Pro 


Ser 


Thr 


Met 


Val 


Ser 


Ser 


Asn 


Pro 


Lys 


Glu 


Val 


Phe 








260 










265 










270 







SUBSTITUTE SHEET (Rule 26) (RCVAU) 



WO 99/29870 



PCT/AU98/01W3 



286/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Arg 


Ser 


lie 


Pro 


Ala 


His 


Thr 


lie 


Lys 


Arg 


Val 


Glu 


Val 


He 


Thr 


Asp 






275 










280 










285 








Pro 


Gly 


Val 


Lys 


Tyr 


Asp 


Ala 


Glu 


Gly Thr 


Ser 


Ala 


Tie 


Leu 


Asp 


He 




290 










295 










300 










Val 


Thr 


Glu 


Glu 


Gly 


Lys 


Lys 


Leu 


Glu 


Gly Tyr 


Ser 


Gly 


Ser 


He 


Thr 


305 










310 










315 










320 


Ala 


Ser 


Val 


Ser 


Asn 


Asn 


Pro 


Thr 


Ala 


Asn 


Gly 


Ser 


Tie 


Phe 


Leu 


Thr 










325 










330 










335 




Ala 


Lys 


Ser 


Gly 


Lys 


Val 


Gly 


Leu 


Thr 


Thr 


Asn 


Tyr 


Asn 


Tyr Tyr Gly 








340 










345 










350 






Gly 


Lys 


Asn 


Lys 


Gly Ser Arg 


Tyr 


Phe 


Thr 


Glu 


Arg 


Thr 


Thr 


Ser 


Met 






35b 










360 










365 








Leu 


Gin 


Thr 


lie 


Glu 


Glu 


Gly 


Lys 


Gly Gin 


Glu 


Thr 


Phe 


Gly Cly 


His 




37 0 










375 










380 










Phe 


Gly 


Asn 


Ala 


Leu 


L-u 


Ser 


Phe 


Glu 


lie 


Asp 


Ser 


Leu 


i\sn 


Leu 


Phe 


385 










390 










395 










400 


Thr 


Val 


Gly 


Gly 


Asn 


Val 


Arg 


Leu 


Trp 


Glu 


Het 


Thr 


Thr 


Asp 


Arg 


Asn 










4 05 










410 










415 




Ser 


Val 


Glu 


Lye 


Scr 


Phe 


Ala 


Gly 


Ser 


Asn 


Lou 


Het 


Ser 


Tyr 


He 


Asp 








420 










425 










430 






Arg 


Lys 


L-vt 


Lys 


Thr 


Gin 


Met 


Asp 


Ala 


Gly 


Ser 


Tyr 


Glu 


L*u 


Asn 


Al a 






435 










440 










445 








Asp 


Tyr 


Gin 


His 


Ser 


Thr 


Arg 


Leu 


Pro 


Gly 


Glu 


Leu 


Leu 


Thr 


Val 


Ser 




450 










455 










460 










Tyr 


Arg 


Phe 


Thr 


His 


Asn 


Pro 


Asn 


Asn 


Scr 


Glu 


Thr 


Phe 


He 


Asp 


Gin 


466 










470 










475 










480 


Trp 


Lys 


Arg 


Asp 


Pro 


Leu 


Asn 


Thr 


Ala 


Asn 


Thr 


He 


Gin 


Tyr 


Ala 


Gly 










485 










490 










4 95 




Gin 


His 


Ser 


Lys 


Ser 


Asp 


Ala 


Gly Met 


Asp 


Glu 


His 


Thr 


Ala 


Gin 


VaL 








500 










505 










510 






Asp 


Tyr 


Thr 


Arg 


Pro 


Leu 


Gly 


Gin 


Ala 


His 


Ser 


Leu 


Glu 


Ala 


Gly 


Leu 






515 










520 










52 5 






Lys 


Tyr 


lie 


Tyr 


Arg 


His 


Ala 


Thr 


Ser 


Asp 


Pro 


Leu 


Tyr 


GlU 


He 


Arg 




530 










535 










540 






Pro 


Ser 


Gl u 


Asp 


Ala 


Pro 


Trp 


Gin 


Pro 


Gly 


Ser 


Leu 


Tyr Ala 


Gin 


Asn 


545 










550 










555 










560 


Pro 


Ser 


Asn 


Gly 


Lys 


Phe 


Arg 


His 


Asp 


Gln 


Tyr 


He 


Gly Ala 


Ala 


Tyr 










565 










57 O 










575 




Ala 


Gly 


Tyr 


Asn 


Tyr 


Arg 


Lys 


Asp 


Gin 


Tyr 


Ser 


Leu 


GXn 


Thr 


Gly Leu 








580 










585 










590 






Arg 


Val 


Glu 


Ser 


Ser Arg 


Leu 


Lys 


Ala 


Leu 


Phe 


Pro 


Glu 


Asn 


Ala 


Ala 






595 










600 










605 








Ala 


Asp 


Phe 


Ser 


His 


Asn 


Ser 


Phe 


Asp 


Trp 


Val 


Pro 


Gin 


Leu 


Thr 


Leu 




610 










615 






620 










Gly 


Tyr 


Thr 


Pro 


Ser 


Pro 


Met 


Lys 


Gin 


Leu 


Lys 


Leu 


Ala 


Tyr 


Asn 


Phs 


€25 










630 










635 










640 


Arg 


lis 


Gin 


Arg 


Pro 


Ala 


He 


Gly 


Gin 


Leu 


Aan 


Pro 


Tyr 


Arg 


Leu 


Gin 










645 










650 










555 




Thr 


As n 


Asp 


Tyr 


Gin 


Vdl 


Gin 


Tyr 


Gly 


Asn 


Pro 


Asp 


Leu 


Lys 


Ser 


Glu 








660 










665 










67 0 






Lis 


Arg 


His 


His 


Val 


Gly Leu 


Ser Tyr 


Asn 


Gin 


Tyr 


Gly Ala 


Lys 


Val 






675 










680 










685 








Het 


Leu 


Thr 


Ala 


Ser 


Leu 


Asp Tyr Asp 


Phe 


Cys 


Asn 


Asn 


Ala 


He 


Gin 




690 










695 










7 00 










Asn 


Tyr 


Thr 


Phe 


Ser 


Asp 


Pro 


Ala 


Asn 


Pro 


Asn 


Leu 


Phe 


His 


Gin 


Thr 


705 










710 










715 










720 


Tyr 


Gly 


Asn 


lie 


Gly Arg 


Glu 


His 


Ser 


Phe 


Ser 


Leu 


Asn 


Thr 


Tyr 


Ala 










725 










730 










735 




HeL 


Tyr 


Thr 


Pro 


Ala 


Val 


Trp Val 


Arg 


lie 


Met 


Leu 


Asn 


Gly Asn 


He 








740 










745 










750 






Asp 


Arg 


Thr 


Phe 


Gin 


Lys 


Ser 


Glu 


Ala 


Leu 


Gly 


lie 


Asp 


Val 


Asn 


Ser 






755 










760 










765 








Trp 


S«r 


Gly 


Het 


Val 


Tyr 


Ser 


Gly 


Leu 


Met 


Phe 


Thr 


Leu 


Pro 


Lys 


Asp 




770 










775 










780 








Trp 


Thr 


Val 


Asn 


Lou 


Phe 


Gly Gly Tyr 


Tyr 


His 


Gly 


Gly 


Arg 


Ser 


Tyr 


765 










790 










795 










800 


Gin 


Thr 


Lys 


Tyr Asp Gly Asn 


Val 


Phe 


Asn 


Asn 


He 


Gly 


He 


Ala 


Lys 










805 










810 










815 




Gin 


Leu 


Phe 


Asp 


Lys 


Lys 


Leu 


Arg 


Val 


Ser 


Leu 


Ser 


Ala 


Asn 


Asn 


lie 








820 










825 










830 






His 


Ala 


Lys 


Tyr 


Ser 


Thr 


Trp 


Lys 


Ser 


Arg 


Thr 


He 


Gly 


Asn Gly 


Phe 






635 










840 










845 








Thr 


lie 


Tyr 


Ser 


Glu 


Asn 


Ala 


Gly 


lie 


Gin 


Arg 


Ser 


Val 


Ser 


Leu 


Ser 




850 










055 








060 










Leu 


Thr 


Tyr 


Ser 


Phe 


Gly 


Lys 


Het 


Asn 


Thr 


Gin 


Val 


Arg 


Lys 


Val 


Glu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



287/490 



365 870 875 880 

Arg Thr lie Val Asn Asp Asp Leu Lys Gin The S*r Scr Gin Gly Gin 

885 990 395 

Gin Gly Gly Gly Gin Gly Asn Pro Thr Gly Asn 
900 905 



(2) INFOR11ATION FOR SEQ ID HO: 356 

(1) SEQUEUCE CHARACTERISTICS: 

(A) LENGTH: 450 amino acids 

(B) TYPE: amino acid 
ID) TOPOLOGY: linear 

(iil HOLECULE TYPE: protein 

till) HYPOTHETICAL; YES 

(vi) ORIGINAL SOURCE: 

(AJ ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : miscjeature 

(B) LOCATION l. . . 4 50 



(Xl) SEQUENCE DESCRIPTION: SEQ ID N0:356 



Tr P 


Cys 


Gin 


Ser 


Asp 

c 


Pro 


He 


Pro 


His 


Cys 


His 


Ala 


Val 


Tyr 


Arq 


Asn 


















10 










15 




Ala 


Gin 


Cly 


His 

20 


Gin 


Gin 


Gly 


Arg 


Ser 
25 


Asp 


Gin 


Asp 


Arg 


Leu 
30 


Pro 


Pro 


Val 


Arg 


Gly 
35 


Tyr 


Arg 


Tyr 


Ala 


Arg 
40 


Cys 


Leu 


Gly 


Arg 


Tyr 
45 


Glu 


Gly 


Phe 


Gly 


Glu 
50 


Arg 


net 


Arg 


Leu 


Ser 
55 


Ala 


He 


Leu 


Ile 


Ala 
60 


Leu 


He 


Val 


Met 


Leu 


Pro 


Ala 


Val 


Leu 


Ser 


Gly 


Gin 


His 


Tyr 


Tyr 


Ser 


Met 


Ala 


Gly Glu 


65 










70 










75 










00 


Arg 


Leu 


Glu 


Thr 


Asp 
OS 


Ser 


He 


Arg 


Pro 


Asn 
90 


Glu 


Leu 


Ser 


Ala 


Scr 
95 


He 


Arg 


Ser 


Ala 


Leu 
100 


<Phe 


Phe 


Arg 


Asn 


Asn. 
105 


Glu 


Tyr 


Asn 


Ala 


Arg 
110 


Ser 


Val 


Lys 


Gly 


Tyr 
13 5 


Thr 


Leu 


Pro 


Gly 


Ala 
120 


Arg 


Val 


Ser 


Ala 


Phe 
125 


Ala 


Ser 


Tyr 


Ser 


Leu 


Pro 


Ala 


Ala 


His 


Gly 


Val 


Lys 


Leu 


Ser 


Leu 


Gly Val 


Ser 


Thr 




130 










135 










140 










Leu 


Asn 


Tyr 


Trp 


Gly 


Ala 


Ser 


Arg 


Tyr 


Fro 


Ala 


Gly 


He 


Ala 


Tyr 


Ser 


115 










150 










155 










160 


Asp 


Leu 


Pro 


Tyr 


Trp 
165 


Thr 


Asp 


Tyr 


Asn 


Asp 

170 


Tyr 


Val 


Arg 


Leu 


Arg 
175 


lis 


Leu 


Pro 


Tyr 


Val 
180 


Gin 


Ala 


Met 


Leu 


Lys 
185 


Pro 


Thr 


Ala 


Thr 


Thr 
190 


Ala 


Leu 


Met 


Leu 


Gly 
195 


Asn 


He 


Ala 


Gly 


Gly 
200 


Thr 


Ala 


Mis 


Gil- 


Leu 
205 


He 


Glu 


Pro 


He Tyc 


Asn 


Pro 


Glu 


Leu 


Asp 


I^eu 


Thr 


Ala 


Asp 


Pro 


Glu 


Ala 


Gly 


Vai 




210 










215 










220 










Gin 


Phe 


Arg 


Gly 


Asp 


Trp 


Thr 


Arg 


Phe 


Axg 


Met 


Arp 


Val 


Trp 


Val 


Asn 


225 










230 










235 










240 


Trp Met 


Ser 


Met 


lie 


Phe 


Lys 


Asn 


Asp 


Asn 


His 


Gin 


Glu 


Ser 


Phe 


Val 










245 










250 










255 




Phe 


Gly 


Leu 


Ser 


Thr 


Thr 


Ser 


Lys 


Leti 


Leu 


Ser 


Gly 


Glu Gly 


Lys 


Trp 








260 










265 










270 






Arq 


Leu 


Glu 


Leu 


Pro 


Leu 


Gin 


Ala 


He 


Ala 


Thr 


His 


Arg 


Gly 


Gly 


Glu 






275 










260 










285 






Tyt 


Am 


Trp 


Ala 


Gin 


Gin 


Asp 


Thr 


Val 


His 


Thr 


Trp 


Val 


Asn 


Gly Ala 




290 










295 










300 










Val 


Gly 


Leu 


Lys 


Leu 


Ser 


Tyr 


Arg 


Pro 


Arg 


Thr 


Asp 


Lys 


Pro 


Met 


Gin 


305 










310 










315 










320 


He 


Trp 


Gly 


Ser 


Ala 


Tyr 


Gly 


Val 


Ala 


Ala 


Leu 


Ser 


Ser Gly Gly Tyr 










325 










330 










335 




Phe 


Pro 


Tyr 


Glu 


Arg 


Gly 


Trp 


Gly 


Gly 


Tyr 


Leu 


Ser 


Leu 


Gly Met 


Asp 








340 










345 










350 






Leu 


Glu 


His 


Phe 


Ala 


Phe 


Arg 


Thr 


Asp 


Tyr 


Trp 


Tyr 


Gly Arg 


His 


Tyr 






355 










360 










365 








Val 


Ser 


Pro 


Phe 


Ala 


Ala 


Pro 


Phe 


Ala 


Asn 


Set" 


Leu 


Thr Tyr Asp Lys 




37 0 










375 










380 










Gin 


Pro 


Leu 


,Thr 


Asn 


Gly 


Trp 


Gly 


Asp 


Tyr 


He 


Arg 


Leu 


Tyr 


Ala 


Asp 



SUBST ITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



288/ 490 



385 390 395 400 

Tyr Ser Trp Arg Met Ma Arg Ssr Val Ser Leu Ala Ala Val Ala Arg 

405 410 415 

Val Trp Phe Gin Pro Ser Asp Arg Phe Ala list Ser His Ala Leu Glu 

420 425 430 

Leu Thr Met Arg lie Asp Pro Lys Phe Pro lie Ala Phe Leu Lys Gl/ 
435 440 445 

Asn His 
450 



(?) INFORMATION FOR SEQ ID NO: 357 

U) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 447 amino acids 
(DJ TYPE: amino acid 
( D J TOPOLOGY; linear 

111) MOLECULE TYPE: protein 

till) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A> ORGANISM: Porphyromonas qinqivalis 

lix) FEATURE: 

(A) NAME/KEY: miacjsature 

(B) LOCATION 1. . .447 



Ui) SEQUENCE DESCRIPTION; SEQ ID NO: 357 



Pro 


Phe 


Ser 


Pro 


Pro 


Trp 


Val 


Cys 


Ala 


Gly 


Ala 


Ala 


Ser 


Ser 


Val 


Arg 


1 








5 










10 










15 




Thr 


Arg 


Pro 


Cys 


Ser 


Thr 


Val 


Ser 


Ser 


Thr 


Ala 


Ala 


Ser 


Ser 


His 


Arg 








20 










25 










30 






Arg 


Ser 


Glu 


His 


Met 


Arg 


Ser 


Ala 


Asp 


Ser 


Ser 


Pro 


Ala 


Tyr Ser 


Pro 






3S 










40 










45 








lie 


Gly 


Ser 


Lou 


Ser 


ser 


Ser 


His 


Ser 


Phe 


Asp 


Ser 


He 


Pro 


Asp 


Gin 




SO 










55 










60 










Arg 


Met 


Asn 


Lys 


Ser 


Leu 


Leu 


Ser 


Leu 


Ala 


Cys 


Leu 


He 


Leu 


Cys 


Gly 


65 










70 










75 










80 


J let 


Pro 


Ala 


He 


Ala 


Gin 


Gin 


Thr 


Gly 


Pro 


Ala 


Glu 


Arg 


Ser 


Gly 


GlU 










85 










90 










95 




Pro 


Ser 


Leu 


Ala 


Glu 


Arg 


Val 


Phe 


Gly 


Leu 


Glu 


Gin 


Lys 


Gin 


Lys 


Lys 








lOO 










105 










110 






Leu 


Lys 


Val 


Tyr 


Leu 


Gly 


He 


Gin 


Ser 


Phe 


Tyr 


Asp 


Gin 


Pro 


Leu 


Val 






115 










120 










125 








Asp Asp 


Glu 


Ser 


His 


He 


Gly 


His 


Phe 


Lys 


Val 


Gin 


Glu 


Leu 


Arg 


Het 




130 










135 










140 










Ser 


Ala 


Hie 


Gly 


Glu 


Leu 


Asn 


Arg 


His 


Leu 


Ser 


Phe 


Asp 


Trp 


Arg 


Gin 


145 










L50 










155 










160 


Arg 


Leu 


Asn 


Arg 


Ala 


Ala 


Asp 


Gly 


Thr 


Ser 


Phe 


Ala 


Asp 


Asn 


Leu 


Ser 










165 










170 










175 




Asn 


Ala 


lis 


Asp 


He 


Ala 


Gly 


Val 


Asp 


Trp 


His 


Pro 


Asn 


Asp 


Lys 


Val 








180 










185 










190 






Ser 


Phe 


Phe 


Phe 


Gly 


Arg 


Gin 


Tyr 


Ala 


Arg 


Phe 


Gly 


Gly 


lie 


Glu 


Tyr 






195 










200 










205 








Asp 


Het 


Asn 


Pro 


Val 


Glu 


He 


Tyr 


Gin 


Tyr 


Ser 


Asp 


Leu 


Val 


Asp 


Tyr 




210 










215 










220 










Met 


Thr 


Cys 


Tyr 


Thr 


Scr 


Gly 


Val 


Asn 


Phe 


Ala 


Trp 


Asn 


Phe 


His 


Pro 


225 










230 










235 










240 


Glu 


Gin 


Gin 


Leu 


Gin 


Leu 


Gin 


Val 


Leu 


Asn 


Ala 


Tyr 


Asn 


Asn 


Arg 


Phe 










245 










250 










2S5 




Ala 


Asp 


Arg 


Tyr 


His 


Val 


Thr 


Pro 


Asp 


Val 


Ala 


Thr 


Ala 


Thr 


Ser 


Tyr 








260 










265 










270 






Pro 


Leu 


Leu 


Tyr 


Ser 


Ala 


Gin 


Trp 


Asn 


Gly 


Thr 


Leu 


Leu 


Gly 


Gly 


Ala 






275 










280 










285 








Leu 


Hi* 


Met 


Arg 


Tyr 


Ala 


Val 


Ser 


Met 


Ala 


His 


Gin 


Ala 


Gin 


Glu 


Arg 




290 










295 










300 










Asn 


Met 


Trp 


Tyr 


Phe 


Thr 


Ala 


Gly 


Asn 


Leu- 


Phe 


Asn 


Pro 


Gly Lys 


Arg 


305 










310 










315 










320 


He 


Asn 


Gly 


Tyr 


Leu 


Asp 


Leu 


Thr 


Tyr 


Ser 


He 


Glu 


Gly 


Leu Asp 


Asp 










325 








330 










335 




Lys 


Gly 


He 


Met 


Thr 


Ala 


Arg 


Tyr 


Gly 


Lys 


Gly 


Lys 


Thr 


Leu 


Thr 


Asp 








340 










345 










350 






Val 


Lys 


Tyr 


Tyr 


Ala 


Leu 


Val 


Sec 




Trp 


Asn 


Phe 


Arg 


He 


Pho 


Asp 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



289/490 



355 360 36b 

Gin Val Asn Leu Phe Leu Lys Gly Met Tyr Glu Am Gly Tyr Ala Pro 

370 375 380 

Ala Gin Tyr Gly Glu Ser Ser His The Ar<g His Ser Tyr Gly Tyr Met 
385 390 395 400 

Gly Gly Val Glu Tyr Tyr Pro Thr Glu Thr Acn Phe Acq Leu Phe Val 

405 410 415 

Thr Tyr lie Gly Arg His Tyr Arg Tyr ser Ala Thr Glu Thr Glu Ser 

420 425 430 

Thr Asn Al« Leu Arg A] a Gly Leu lie Tyr Glt\ lie Pro Phe Leu 
435 440 445 

(21 INFORtlATION FOR SEQ ID WO:358 

(j} SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 227 amino acids 
fB> TYPE: amino acid 
(U) TOPOLOGY; linear 

Ui> J 10LECULE TYPE: protein 

(iii> HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonac gingivalis 

fix) FEATURE: 

(AV HAUE/KEY: mlccjeature 
(S) LOCATION 1...227 

(xl) SEQUENCE DESCRIPTION: SEQ ID MO: 358 



Tyr 


Lys 


Gin 


Ala 


He 


Met 


Lys 


His 


Leu 


Phe 


Lys 


Ser 


Thr 


Leu 


Val 


Leu 


1 








5 










10 










15 




Lsu 


Cys 


Ala 


Leu 


Ser 


Phe 


Ser 


Gly 


Thr Tyr 


Thr 


Phe 


Ma 


Gin 


Glu 


Asn 








20 










25 










30 






Asn 


Thr 


Glu 


Lys 


Ser 


Arg 


Phe 


Asp 


Phe 


Ser 


Val 


Arg 


Leu 


Gly 


Gin 


Gly 






35 










40 










45 








Tyr 


lie 


Ala 


Gly 


Ser 


Thr 


Thr 


Asn 


Leu 


Met 


Tyr Gly 


Tyr 


Thr 


Ser 


Ala 




So 










55 










60 








Asn 


Asp Arg 


Leu 


Leu 


Ser 


Gly 


Ala 


He 


Tyr 


Leu 


Gly 


Leu 


Thr 


Pro 


Ser 


65 










70 










75 










80 


Lys 


Lys 


Glu 


Asn 


Ala 


Thr 


Gly 


Val 


Ala 


Phe 


Arg 


Phe 


Leu 


Ser 


Pro 


Ser 










85 










90 










95 




Pro 


Gly 


Tyr 


Tyr 


Val 


Asp 


He 


Ser 


Gly 


Lys 


Glu 


Asn 


Thr 


Leu 


Asn 


Tyr 








100 










105 










HO 






Ala 


Phe 


Tyr 


Val 


Val 


Gly 


Ala 


Tyr 


Asn 


Arg 


He 


Ala 


He 


Pro 


lie 


Arg 






115 










120 










125 








Pro 


lie 


Lys 


Asn 


Phe 


Asn 


Phe 


He 


Phe 


Ser 


Thr 


Glu 


Val 


Gly Met 


Ala 




130 










135 










140 










Trp 


net 


Ser 


Arg 


His 


Glu 


Gin 


He 


Tyr 


Asn 


S-r 


Thr 


Ser 


Gin 


Thr 


Trp 


145 










150 








155 










160 


Asp 


Lys 


Gin 


Arg 


Lys 


Ser 


Arg 


Ser 


Gly 


Leu 


Asp 


Phe 


Gly Leu 


Gly 


Met 










165 










170 










175 




His 


Leu 


Gin 


Unk 


His 


He 


Asn 


Lys 


Thr 


Val 


Tyr 


Phe 


Met 


Ala 


Gly 


Thr 








180 










185 










190 






Asp 


Leu 


Thr 


Ser 


Cys 


Met 


Phe 


Gly 


Lys 


Arg 


He 


Asn 


Asp Tyr 


Gin 


Gin 






195 










200 










205 








Lys 


Asp Arg 


Thr 


Phe 


He 


Ala 


Leu 


He Asp 


Asn 


Ser 


He 


Gly 


He 


Gly 




210 










215 










220 










Leu 


Asn 


Leu 




























225 
































(2) 


INFORMATION 


FOR 


SEQ 


ID HO: 353 

















(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 06 amino acid? 
IB) TYFEi amino acid 
(D> TOPOLOGY: linear 

(it) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



290/ 490 



(AJ ORGANISM: Po rphyromonas gingivalis 

Ux) FEATURE: 

(A) NAME/ KEY : miscjeature 
13) LOCATION 1 . . - 406 



|xi> SEQUENCE DESCRIPTION: SEQ ID NO: 35 9 



He 


Phe 


He 


Asp 




Asp 


Lys 


Asn 


Thr 


Lys 




Asn 


Glu 


Aig 


Asn 


Met 


1 


















10 










15 




He 


He 


Lys 


T 


Met 


Leu 


Lys 


Asn 


Lys 


Leu 


Ala 


Pro 


Leu 


Ala 


He 


Leu 








20 










25 










30 






Phe 


Leu 




Ala 


Pro 


Lys 


Ala 


Met 


Lys 


Ala 


Gin 


Glu 


Gin 


Leu 


Asm 


Val 
















40 








45 








Val 


Has 


Thr 




Val 


Pro 


Ser 


Leu 


Asn 


He 


Scr 


Pro 


Asp Ala 


Arg 


Al a 




SO 










55 










60 










Ala 


Gly 




Gly 


Asp 


He 


G3 y 


Val 


Ala 


Thr 


Thr 


Pro 


Asp 


Ala 


Tyr 


Ser 


65 










70 








75 










80 


Gin 


Tyr 


Trp 


Asn 




Ser 


Lys 


Tyr 


Ala 


Phe 


Met 


Asp 


The 


Lys 


Ala 


Gly 




















90 










95 




He 


Ser 






Tyr 


Thr 


Pro 


Trp 


Leu 


Ssr 


Lys 


Leu 


Val 


Am 


Asp 


He 








100 










105 










110 






Ala 


Leu 


Met 


Gin 


et 


Thr 


Gly 


Phe 


Tyr 


Lys 


Leu 


Gl^' 


Thr 


Asp 


Glu 


Asn 






115 










120 








125 








Gin 


Ala 


He 


Ser 


Ala 


er 




Arg 


Tyr 


Phe 


Thr 


Leu 


Gly 


Lys 


Leu 






130 










135 










14 0 










Thr 


Pha 


A?p 


Glu 


Leu 


UJ -I 


Glu 


Scr 


Met 


Gly 


Glu 


Al a 


His 


Pro 


Aim 


Glu 


145 










150 










155 










160 


Phe 


Ala 


Val 


Asp 


Leu 


Gly 


Tyr 


Ser 


Arg 


Gln 


Leu 


Ser 


Glu 


Asn 


Phe 


Ser 










165 










no 










175 




llet 


Ala 


Val 


Ala 


Leu 


Arg 


Tyr 


He 


Arg 


Ser 


Asp 


Gin 


Ser 


Thr 


His 


Asn 








180 










185 










190 






Thr 


Gly 


Glu 


Asn 


Gin 


Ala 


Gly 


Asn 


Ala 


Phe 


Ala 


Ala 


Asp 


He 


Ala 


Gly 






195 










200 










205 








Tyr 


Leu 


Gin 


Lys 


Tyr 


Val 


Leu 


Leu 


Gly 


Asn 


Ala 


Glu 


Ser 


Leu 


Trp 


Ser 




210 










215 










220 










Leu 


Gly 


Phe 


Asn 


Val 


Lys 


Asn 


He 


Gly Thr 


Lys 


Ha 


Ser 


Tyr 


Asp 


Gly 


225 










230 










235 










240 


Gly Val 


Thr 


Ser 


Fhe 


Phe 


He 


Pro 


Thr 


Ser 


Leu 


Asn 


Leu 


Gly Thr 


Gly 










245 










250 










255 




Leu 


Leu 


Tyr 


Pro 


He 


Asp 


Asp 


Tyr 


Asn 


Ser 


He 


Asn 


Phe 


Asn 


Leu 


GlU 








260 










265 










270 






Leu 


Ser 


Lys 


Leu 


Leu 


Val 


Pro 


Thr 


Pro 


Pro 


He 


Met 


Asp 


Gin 


Asn 


Asp 
















280 










285 








Gin 


Ala 


Gly 


Tyr 


Glu 


Ala 


Ala 


Leu 


Lys 


Lys 


Tyr 


Gin 


Glu 


Thr 


Ser 


Ser 




290 










295 










300 










He 


Ser 


Gl y 


He 


Phe 


Ser 


Ser 


Phe 


Gly Asp Ala 


Pro 


Gly Gly 


Leu 


Lys 


305 










310 










315 










320 


Glu 


Glu 


Phe 


Arg 


Glu 


He 


Thr 


Trp 


Gly 


Leu 


Gly 


Ala 


Glu 


Tyr 


Ser 


Tyr 










325 










330 










335 




Asp Acp 


Lys 


Phe 


Phe 


Val 


Arg 


Ala 


Gly Tyr 


Ser 


Tyr 


Leu 


His 


Pro 


Thr 








340 










345 










350 






Lys 


Gly 


Asn 


Leu 


Gin 


Tyr 


Phe 


The 


Ala 


Gly Ala 


Gly 


Phe 


Lys 


Met 


Asn 






35b 










360 










365 








He 


Phe 


Arg 


He 


Asp 


Ala 


Ser 


Tyr 


Leu 


Leu 


Ser 


Thr 


He 


Gin 


Ser 


Asn 




370 










375 










380 










Pro 


Leu 


Asp 


Gin 


Thr 


Leu 


Arg 


Phe 


Thr 


Leu 


Ala 


Phe 


Asp 


Met 


Asp 


Gly 


385 










390 










395 










4 00 


Leu 


Arg 


Asn 


Leu 


Phe 


Hie 























405 



(2) INFORMATION FOR SEQ ID HO: 360 

tl) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 452 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

<%i) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyrons* nas gingivalis 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



291/ 490 



(ix) FEATURE: 

(A) IIAME/KEY: misc_featurs 
IB) LOCATION 1 . . . 452 

5 txi) SEQUENCE DESCRIPTION: SEQ ID NO: 360 

Leu Tie lie Glu Lys Glu Met Lys Thr Thr Val Gin Gin lie lie Leu 
IS 10 15 

Cys Leu Ala Leu Met Met Ser Gly Val Leu Gly Gly Aan Ala Gin Ser 
10 20 2b 30 

Phe Trp Glu Glu lie Ala Pro Pro Phe lie Ser Asn Glu Pro Asn Val 

35 40 45 

Lys Tyr lie He Pro Asn Met Gly lie Asp Ser Lys Gly Thr lie Tyr 
50 55 60 

15 Val Thr Val Thr Lys Arg lie Gin Gin Gly Ala Aen Tyr Thr Ser Glu 
65 70 75 80 

Gin Leu Gly Met Tyr Tyr Arg Pro Leu Gly Asp Asn Glu Gin Trp Trp 

85 90 95 

Lys His Asp Pro Tyr Phe Asp Asp Lys lie Val Ala Asp He Gin Thr 
20 100 105 HO 

Asp Ala Tyr Gly Arg Val Tyr Val Cys Thr Thr Ser Ser Arg Asp Gin 

115 120 125 

Glu Tyr Gin Leu Tyr lie Aan Glu Gin Asn Glu Trp Arg Cys He Phe 
130 135 140 

25 Lys Thr Ser Val Ser Thr Tyr Glu His Gly Met Ala Val Phe Arg Ser 
145 150 155 160 

Ser Thr Gly Val Thr Tyr lie Gly Thr Arg His His Tie Phe Ala Ser 

16b 170 175 

Gly Val Asn Asp Phe Glu Phe Asn Thr Tl« Tyr Glu Asp Ser Thr Pro 
30 180 185 190 

Met Ser cys Arg Phe Ala Glu Ala Thr Asn Ser Gly Thr He Tyr Leu 

195 200 205 

Ala Leu Met His Clu Thr Thr Met Ser Thr Thr He Leu Thr Tyr Gin 
Z10 215 220 

35 Asn Gly Glu Phe Val Asp Tie Ser Glu Ser Glu Leu Ser hsn Ser He 
225 230 235 240 

He Ala Ser Met Cys Ser Asn Lys Glu Gly Asp He lie Ala Leu Val 

245 250 255 

Thr Ser Tyr Thr Gly Phe Met Ser Gly Thr Leu Ala He Arg Lys Ala 
40 260 265 270 

Asp Glu Gly Lys Trp Gin Leu Val Gly Gly Asp He Gin Asn Ala He 

275 280 205 

Val Gin Asn He Cys Met Met Asp Asp Asn Lys lie Ala Cys Glu Val 
290 295 300 

45 Phe Gly Thr Pro Asn Gly Val Asp Gly Arg Thr Arg Val Cys Val Ser 
305 310 315 " 320 

Asp Ala Ser Val Phe Asp Phe Glu Trp Tyr Glu Asp Glu He Tyr Gly 

325 330 335 

Gly Leu He The Asp Thr Phe Phe Tyr Ser Pro Trp Asp Lys Leu Leu 
50 340 345 350 

Tyr Ala Lys Phe Gly Gly He Met Leu Arg Ser Lys Clu Ser Phe He 

3SS 360 365 

Thr Ser Phe He Ser Pro Thr Val Val Gin Gly Val Asp Val Tyr Thr 
370 375 380 

55 Leu Ala Gly Lys He Arg He Glu Ser Glu Thr Pro Val Ser Glu Val 
385 390 395 400 

Leu Leu Phe Asp Leu Ala Gly Arg Met Val Leu Arg Gin Thr He Asp 

405 410 415 

Asn Lys He Tyr Ser A.*p He Asp Thr Asn Gly Leu Lys Arg Ser Gly 
60 420 425 430 

He Tyr Val Val Ser Val Arg Leu Ser Ser Gly Gin Val Phe Ser His 

435 440 445 

Lys Val Gin Val 
450 



65 



(2) I H FORMAT I OM FOR SEQ ID NO: 361 



<ij SEQUENCE CHARACTERISTICS: 

fA) LENGTH: 331 amino acids 
70 (R) TYPE: amino acid 

(D) TOPOLOGY: linear 

(11 ) MOLECULE TYPE: protein 

75 (Hi) HYPOTHETICAL: YES 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



292/490 



PCT/AU98/01023 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyrons n =is ginqivaiis 

(ix) FEATURE; 

\JK) NAME/ KEY ; misc feature 
<B) LOCATION 1...331 



(Xil SEQUENCE DESCRIPTION; SEQ ID NO: 361 



Gl y Leu 


Tyr 


Gly 


Met 


Ser 


Val 


Val 


Pro 


He 


He 


lie 


Tyr 


Leu 


Cys 


Gly 


I 






5 










10 










15 




He Ser 


Asn 


Tyr 


Ala 


Arg 


Leu 


Met 


lie 


He 


A 


Cys 


Leu 


He 


Arg 


Arg 






20 










25 










30 






Pro Arg 


Thr 


Val 


Leu 


Phe 


Gl v 


Leu 


Tie 


Phe 


Val 


Val 


Gly 


Leu 


Phe 


Scr 




35 








40 










45 








Ala Met 


Ala 


Gin 


Glu 




Lys 


Asp 


Ser 


eu 


Se r 


Thr 


Val 


Gin 


Pro 


Val 


50 










55 










60 










Pro Asn 


Ser 


Ser 


Met 


Val 


Glu 


Gin 


Thr 


Pro 


Leu 


Leu 


Ser 


He 


Asp 


His 


65 








70 










75 










80 


Pro Val 


Leu 


Pro 


Ala 


Ser 


Phe 


Gin 


Asn 


Thr 


Arg 


Thr 


Leu 


Lys 


Arg 


Phe 








85 










90 










95 




Arg Asp 


Ly- 


His 


Leu 


Ser 


Asp 


Ala 


Leu 


Leu 


Asn 


Gly 


Leu 


Lys 


Pro 


His 






10O 




















110 






Arg St=r 


Ser 


Leu 


Gin 


Leu 


Asn 


Glu 


Glu 


Leu 


Asn 


Phe 


Ala 


Ala 


Glu 


Arg 




115 










120 










125 








Arg Arp 


Phe 


Val 


Ser 


Pro 


Leu 


Li?u 


Gin 


Thr 


Arg 


His 


Ala 


Ala 


Gly Val 


130 










135 










140 










Leu Ser 


Trp 


Arg 


Pro 


Thr 


Asp 


Arg 


Met 


His 


Phe 


Tyr 


Thr 


Ser 


Gly Asn 


145 








150 










155 










160 


lie Gly 


Leu 


Gly 


His 


Asp 


Leu 


Leu 


Thr 


Gly 


Val 


Arg 


Lys 


Asp 


Phe 


Gly 








1S5 










170 










175 




Trp Asn 


Ala 


Gly 


Ala 


Asp 


Phe 


Leu 


Leu 


Ser 


Gin 


Asn 


Leu 


Thr 


Ala 


His 






1B0 










185 










190 






Val Gin 


Gly 


Gly 


Trp 


Gin 


Gin 


Asn 


Phe 


Gly 


Phe 


He 


Pro 


Met 


Thr 


Ala 




195 










200 










205 








Val Asn 


Gly 


Gin 


Leu 


Arg 


Trp 


Gin 


Ala 


Thr 


Glu 


Arg 


Leu 


Ser 


Phe 


Thr 


210 










215 










220 










Thr Gly 


He 


Asp 


Tyr 


Arg 


Gin 


Val 


Gin 


Trp 


Asn 


Ala 


Phe 


Asp Asn 


Arg 


225 








230 










235 










240 


Thr Phe 


Ser 


Leu 


Lys 


Gly 


Ser 


Ala 


Arg 


Tyr 


Glu 


Val 


Met 


Asp 


Asn 


Val 








245 










250 










255 




Phr; Val 


Asn 


Gly 


Phe 


Gly 


Ser 


Tyr 


Pro 


Leu 


Tyr 


Ser 


Ser 


Thr 


Arg 


Ser 






260 










255 










270 






Gly Leu 


Asn 


Met 


Ala 


Val 


Pro 


Met 


Hie 


Gly 


Phe 


Gly 


Pro 


Gin Tyr 


Gly 




27 5 










2B0 










285 








Gly Ser 


Leu. 


Glu 


Leu 


Lys 


Val 


Ser 


Glu 


Arg 


Phe 


Gly 


Phe 


Ala 


Val 


Gly 


290 










295 










300 










Met Glu 


Arg 


Glu 


Tyr 


Asn 


He 


Trp 


Thr 


Arg 


Arg 


Trp 


Glu 


Thr 


His 


Tyr 


305 








310 










315 










32 0 


Phe Ala 


Tyr 


Pro 


Val 


Phe 


Tyr 


Gly 


Asp 


Lys 


Lys 


















325 










330 














t 2 } INFORMATION 


FOR 


SEQ 


ID NO:362 

















(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 32 9 amino acids 
IB) TYPE: amino acid 
<D> TOPOLOGY: linear 



(ill MOLECULE TYPE: protein 

(ill J HYPOTHETICAL: YF.S 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY: miscfeature 

(B) LOCATION 1...329 

<xil SEQUENCE DESCRIPTION: SEQ ID HO: 362 

Glu Thr Asn Ser Trp Val Ser Ser Asp Cys Asn Sei Thr Thr Met Lys 
15 10 15 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



293/490 



Thr 


Asn 


Acg 


Arg 


Tyr 


Ala 


Phe 


Val 


Leu 


Pro 


Leu 


Leu 


Leu 


Leu 


Thr Gly 








20 










25 










30 






Leu 


Leu 


Ala 


Trp 


Gly 


Gin 


Asp 


Ser 


Ser 


His 


Gly 


Ser 


Asn 


Thr 


Ala 


Phe 






35 










40 










45 








Ala 


Thr 


Asp 


Ser 


Ser 


Ser 


Arg 


Glu 


Leu 


Pro 


Thr 


Glu 


Gin 


scr 


Ala 


Tyr 




50 










55 










60 










Arg 


lis 


His 


Ser 


Ala 


Tyr 


Met 


Val 


G]y 


Gly 


Gly 


Gly 


Ser 


He 


Thr 


Arg 


65 










70 










75 










80" 


Asp 


Thr 


i y r 


Leu 


Ser 




Leu 


A 


Tyr 


Gly Gly 


i rp 


Thr 


Leu 


Asn 


Leu 










85 










90 










95 




eu 


Gly 


Glu 


Lys 


Thr 


Phe 


Pro 


Leu 


Lys 


Ala 


Ser 


P 


Sex 


Arg 


Trp 


Met 








100 










105 










110 






e 


rg 


Thr 


oxy 


His 


Glu 


Leu 




Phe 


Ala 


Leu 


Mo' 


Asp 


Asn 


Pro 


Ala 






115 










i ->n 










125 










A"n 


Ala 


His 


Phe 


Tyr 


Ser 




Leu 


Tyr 


Asn 


J. j 


Ser 


Ala 


Ala 


Ala 




130 










135 










14 0 










Leu 


Tyr 


Arg 


Leu 


Gly 


Ala 


Lys 


His 


Leu 


Arg 


Ala 


AJ a 


Trp Met 


Asp 


Asn 


145 










150 










155 










160 


Leu 


Arg 


Leu 


Ala 


Phe 


Gly 


Pro 


Gly 


Leu 


Glu 


He 


Gly 


Leu Gly Gly lie 










165 










170 










175 




Tyr 


Her 


The 


Arg 


Asn 


Gly 


Asn 


Asn 


Pro 


Ala 


Thr 


Leu 


Lys 


Leu 


Tyr 


Thr 








180 










185 










190 






Asn 


Ala 


He 


Ala 


Gin 


Ala 


Ser 


He 


Gly Tyr 


Tyr 


Val 


Pro 


Ser 


Gl u 


Thr 






195 










200 










205 








Phe 


Pro 


Leu 


Tyr 


Phe 


Arg 


Leu 


Leu 


Ser 


Gin 


He 


Asn 


Leu 


Phe 


Gly 


He 




210 










215 










220 










Ala 


Tyr 


Gly 


Arn 


Gly 


Phe 


Gly 


Glu 


Ser 


Tyr 


Tyr 


Glu 


Asn 


Phe 


Leu 


Leu 


225 










230 










235 










240 


Arn 


Asn 


Gly 


He 


Ala 


Gly 


Ser 


Leu 


His 


Phe 


Thr 


Tyr 


Pro 


Gly Lys 


Phe 




















250 










255 




Thr 


Arg 


Phe 


Thr 


Thr 


Leu 


He 


Thr 


Ala 


Asp 


He 


Pro 


He 


Arg 


Asn 


Phe 








260 










265 










270 






Cys 


Thr 


Leu 


Arg 


Val 


Gly 


Tyr 


Arg 


Tyr 


Ser 


His 


Leu 


Gly 


Ser 


Ser 


Leu 






275 










280 










285 








Asn 


Ala 


Leu 


Asp 


Thr 


Arg 


He 


His 


Ser 


His 


Thr 


Ala 


Phe 


lie Gly 


Phe 




290 










295 










300 










Val 


Thr 


Glu 


Phe 


Tyr Arg 


Phe 


Arg 


Gly 


Arg 


Lys 


Ala 


Met Asn Thr Gly 


305 










310 










315 










320 


Arg 


Arg 


Thr 


Ser 


Leu 


Tyr 


Tyr 


His Asp 

















325 



\?\ INFORMATION FOR SEQ ID NO: 363 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 amino acids 
IB) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

tiii) HYPOTHETICAL: YES 

<vi> ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

FEATURE: 

(A) NAME/KEY: mi* c_f eature 

(B) LOCATION 1. . .319 



UD SEQUENCE DESCRIPTION: SEQ ID NO: 363 



Phe 


Thr Ser 


Gly 


Thr 


He 


Phe 


Val 


Arg 


He 


Ala 


He 


Ser 


Gly 


Arg 


Val 


1 






5 










10 










15 




Val 


Cys Arg 


Glu 
20 


Val 


Glu 


A&p 


Asn 


Gly 
25 


Arg 


Arg 


Glu 


Ala 


Val 
30 


Arg 


HI;? 


Asp Gly Met 


Val 


Val 


Thr 


Leu 


Leu 


Val 


He 


Val 


Gly 


He 


Val 


Val 


Val 




35 










40 








45 








Val 


Arg Tyr 


Ser 


Leu 


Arg 


Val 


His 


Val 


His 


Lys 


Thr 


Gly Thr Val Val 




50 








55 










60 










Ser 


Ala Ala 


He 


Phe 


Gly 


Phe 


He 


Leu 


Leu 


Gly 


Lys 


Thr 


Val 


Pro 


Cy? 


65 








70 










75 










80 


Asp 


Thr Arg 


Asn 


Phe 


Phe 


Ser 


Ser 


Glu 


Ser 


Asp 


Glu 


Pro 


Glu 


Ser 


Arg 








85 










90 










95 


Val 


Ala Thr 


Glu 


lie 


Ala 


His 


Leu 


Cys 


Glu 


He 


Gly 


Phe 


Gin 


He 


His 






100 










105 








110 







SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/OI023 



294/ 490 



Ala Ser S&r lis His Val Ala Val Arg Thr Asp Phe Gly Gin Ala Gly 

115 120 125 

He His Cys Pro Met Ala Thr Asp Ala Ser Ala Thr Glu Phe Asp Arg 
130 135 140 

5 Ser Ala Glu Cys Ala Glu Arg Thr Ser Ala Gin He Asp Thr Ala He 
145 ISO 155 160 

Arg Ser Gin Ser Gin He He Arg Thr His He Asp Thr Cys ?ro Lys 

165 170 1T5 

Set Sex Gly Thr He Gly Gly Ser Thr His Thr Ser Leu His Leu Lys 
10 180 135 190 

Val Phe Asp Gly Arg Gly Glu Val Gly His He His Pro Lys Asp Gly 

195 200 205 

Leu Arg Phe Gly Val Val Glu Gly Tyr Ser He Gly Ser Tyr Val Asp 
210 215 220 

15 Ala He Gly He Gly Ala Thr His Ala Lys Ala Gly He Ser Asp Thr 
225 230 235 240 

Arg Thr Gly He Ala Gly Gly Tyx His Gly Gly Ser Gin His Gin Gin 

245 250 255 

He Gly Asp Val Thr Thr He He Gly Leu Gly Lys Phe Gly Leu Ala 
20 260 265 2*70 

Asn Val Gly Val Ser Asp Arg Gly Phe Arg Arg Gly Ala Ser Ser Tyr 

275 280 285 

Asp Leu Tyr Gly Leu Glu Leu His He Ala Lys Thr He Tyr Leu Ala 
^90 295 300 

25 Val Gly Asp Gly Cys Leu Cys Arg Gin Ala Glu Arg Gin Asp Gly 
305 310 315 

(2) INFORMATION FOR SEQ ID HO: 364 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 614 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 tii) MOLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

(vij ORIGINAL SOURCE: 
40 (A) ORGANISM: Porphyromonas gingival is 

t ix ) FEATURE : 

(A) N A! IE /KEY i miscJTeature 
IB) LOCATION 1. - .614 



45 



50 



55 



60 



65 



70 



75 



SEQUENCE DESCRIPTION: SEQ ID HO: 364 



Pro 


Tyr 


Asn 


Val 


Gin 


Ser 


He 


Ser 


Asn 


Lys 


Thr 


He 


Lys 


Lys 


Gin 


Met 


1 








5 










10 










15 




Glu 


Asn 


Leu 


Lys 


Asn 


He 


Gin 


Pro 


Arg 


Glu 


Arp 


Phe 


Asn 


Trp 


Glu 


Glu 








20 










25 










30 






Ph- 


Glu 


Ala 


Gly 


Gly 


Val 


His 


Ala 


Ala 


Val 


Ser 


Arg 


Gin 


Glu 


Gin 


Glu 






35 










40 










45 








Ala 


Ala 


Tyr 


Asp 


Lys 


Thr 


Leu 


Asn 


Thr 


He 


Lys 


Glu 


Lys 


Glu 


Val 


Val 




50 










SS 










60 










I!et 


Gly 


Arg 


Val 


Thr 


Ala 


He 


Asn 


Lys 


Arg 


Glu 


Val 


Val 


He 


Asn 


Val 


65 










70 










75 










80 


Gly 


Tyr 


Lys 


Ser 


Glu 


Gly 


Val 


Val 


Pro 


Ala 


Thr 


Glu 


Phe Arg 


Tyr 


Asn 










95 










90 














Pro 


Glu 


Leu 


Lys 


Val 


Gly 


Asp 


Glu 


Val 


Glu 


Val 


Tyr 


He 


Glu 


Asn 


Gin 








100 










105 










110 






Glu 


Asp 


Lys 


Lys 


Gly 


Gin 


Leu 


Val 


Leu 


Ser 


His 


Arg 


Lys 


Gly 


Arg 


Ala 






115 










120 










125 








Ala 


Arg 


Ser 


Trp 


Glu 


Arg 


Val 


Asn 


Glu 


Ala 


Leu 


Glu 


Lys Asp 


Glu 


He 




130 










135 










140 










Val 


Lys 


Gly 


Tyr 


Val 


Lys 


Cys 


Arg 


Thr 


Lys 


Gly 


Gly 


Met 


lie 


Val 


Asp 


145 










150 










155 










160 


Val 


Phe 


Gly 


He 


Glu 


Ala 


Phe 


Leu 


Pro 


Gly 


Ser 


Gin 


He 


Asp 


Val 


Arg 










X65 










170 








175 


Pro 


He 


Arg 


Asp 


Tyr 


Asp 


Ala 


Phe 


Vol 


Glu 


Lys 


Thr 


Met 


Glu 


Phe 


Lys 








180 










185 










190 




He 


Val 


Lys 


He 


Asn 


Gin 


Glu 


Tyr 


Lys 


Asn 


Val 


Val 


Val 


Ser 


His 


Lys 






195 










200 










205 








Val 


Leu 


He 


Glu 


Ala 


Glu 


Leu 


Glu 


Gin 


Gin 




Lys 


Glu 


He 


He 


Gly 




210 










215 










220 









SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



295 / 490 



Lys 


Leu 


Glu 


Lys 


Gly 


Gin 


Val 


Leu 


Glu 


Gly 


He 


Val 


Lys 


Asn 


He 


Thr 


225 










230 










235 










240 


Ser 


Tyr 


Gly Val 


Phe 


He 


Asp 


Leu 


Gly Gly Val 


Asp 


Gly 


Leu 


He 


His 








245 










250 










255 




lie 


Thr 


Asp 


Leu 


Ser 


Trp 


Gly Arg 


Val 


Ala 


His 


Pro 


Glu 


Glu 


He 


Val 








260 










265 










27 0 






Gin 


Leu 


Asp 
275 


Gin 


Lys 


He 


Asn 


Val 
280 


Val 


He 


Leu 


Asp 


Phe 
285 


Asp 


Glu 


Asp 


Axg 


Lys 


Arg 


lie 


Ala 


Leu 


Gly 


Leu 


Lys 


Gin 


Leu 


Met 


Pro 


His 


Pro Trp 




290 










295 










300 










Asp 


Ala 


Leu 


Asp 


Ser 


Glu 


Leu 


Lys 


Val 


Gly Asp 


Lys 


Val 


Lys 


Gly Lys 


305 










310 










315 










320 


Val 


Val 


Val 


Mot 


Ala 

325 


Asp 


Tyr 


Gly 


Ala 


Phe 
330 


Val 


Glu 


He 


Al a 


Gin 
335 


Gly 


Val 


GlU 


Gly 


Leu 
340 


He 


His 


Val 


Ser 


Glu 
34 5 


Het 


Ser 


Trp 


Thr 


Gin 
350 


His 


Leu 


Axg 


Ser 


Ala 


Gin 


A/^p 


Phe 


Lsu 


His 


Val 


Gly Asp 


Glu 


Val 


Glu 


Ala 


Vol 






355 










360 










365 








lie 


Leu 


Thr 


Leu 


Asp Arg Glu 


Glu 


Arg Lys 


Met 


Ser 


Leu 


Gly 


Leu 


Lys 




37 0 










375 










380 










Gin 


Leu 


Lys 


Pro 


Asp 


Pro 


Trp 


Ala 


Asp 


He 


Glu 


Thr 


Arg 


Phe 


Pro 


Val 


305 










390 










395 










4 00 


Gly 


Ser 


Arg 


His 


His 


Ala 


Arg 


Val 


Arg 


Asn 


Phe 


Thr 


Asn 


Phe 


Gly Val 










4 05 










410 










415 




Phe 


Val 


Glu 


He 


Glu 


Glu 


Gly Val 


Asp 


Gly Leu 


He 


His 


He 


Ser 


Asp 








420 










425 










430 






Leu 


Ser 


Trp 
435 


Thr 


Lys 


Lys 


He 


Lys 
440 


His 


Pro 


Ser 


Glu 


Phe 
445 


Thr 


Glu 


Val 


Gly 


Ala 


Asp 


He 


Glu 


Val 


Gin 


-Val 


lie 


Glu 


He 


Asp Lys 


Glu 


Asn 


Arg 




450 










455 










460 










Arg 


L«=> u 


Ser 


Leu 


Gly 


His 


Lys 


Gin 


Lou 


Glu 


Glu 


Asn 


Pro 


Trp 


Asp 


Val 


465 










470 










476 










4B0 


Phe 


Glu 


Thr 


Val 


Phe 


Thr 


Val 


Gly 


Ser 


He 


His 


Glu 


Gly Thr 


Val 


He 










485 










4 90 










4 95 




Glu 


Val 


Uet Asp 


Lys 


Gly Ala 


Val 


Val 


Ser 


Leu 


Pro 


Tyr 


Gly Val 


Glu 








500 










505 










510 






Gly 


Phe 


Ala 


Thr 


Pro 


Lys 


Has 


Met 


Val 


Lys 


Glu 


Asp Gly 


Ser 


Gin 


Ala 






515 










520 










S25 








Val 


Lsu 
530 


Glu 


Glu 


Lys 


Leu 


Pro 
535 


Phe 


Lys 


Val 


He 


Glu 
540 


Phe 


Asn 


Lys 


Asp 


Ala 


Lys 


Arg 


lie 


He 


Val 


Ser 


His 


Ser 


Arg 


Val 


Phe 


Glu 


Asp 


GlU 


Gin 


545 










550 










555 










560 


Lys 


Het 


Ala 


Gin 


Arg 

565 


Glu 


Ala 


Asn 


Ala 


Glu 
570 


Arg 


Lys 


Ala 


Glu 


Ala 
575 


Lys 


Ala 


Ala 


Gin 


Lys 
580 


Glu 


Ala 


Ala 


Ala 


Glu 

585 


Ala 


Ala 


Asn 


Pro 


Ala 

5 90 


Gin 


Ala 


Val 


Glu 


Lys 


Ala 


Thr 


Leu Gly 


Asp 


Leu 


Gly Glu 


Leu 


Ala 


Ala 


Leu 


Lys 






5 95 










600 










605 








Glu 


Lys 
610 


Leu 


Ser 


Glu 


Asn 























(2) IN FORMATION FOR SEQ ID NO: 365 

ti) SEQUENCE CHARACTERISTICS: 

tA) LENGTH: 24 3 amino acids 
(BJ TYPE: amino acid 
(DJ TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(till HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

I XX I FEATURE: 

(A| NAME/tfeY: misc_feature 
f B) LOCATION 1. . .243 

IXi) SEQUENCE DESCRIPTION: SEQ ID NO: 365 

Ser Thr He Met Lys Lys Ala He Leu Ser Gly Ala Ala Leu Leu Leu 

15 10 15 

Gly Leu Cys Ala Asn Ala Gin Asn Val Gin Leu His Tyr Asp Phe Gly 
20 25 30 
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His 


Ser 


ils 


Tyr Asp 


Glu 


Leu Asp 


Gly 


Arg 


Pro 


Lys 


Leu 


Thr 


Thr Thr 






35 








40 










45 






Val 


GlU 


Asn 


Phe 


Thr 


Pro 


Asp Lys 


Trp 


Giy 


Ser 


Thr 


Phe 


Phe 


Phe He 




50 










55 








60 








Asp Met 


Asp 


Tyr 


Thr 


Gly 


Lys Gly 


He 


Gln 


See 


Ala 


Tyr 


Trp 


Glu He 


65 










70 








75 








80 


Ser 


Arg 


Glu 


Leu 


Lys 


Phe 


Trp Gin 


Ala 


Pro 


Val 


Ser 


He 


His 


Leu Glu 










85 








90 










95 


Tyr 


Asn 


Gly 


Gly Leu 


Ser 


Thr Ser 


Phe 


Thr 


Phe 


Gly His 


Asp Ala Leu 








100 








105 










110 




He 


Gly 


Ala 


Thr 


Tyr 


Thr 


Tyr Asn Asn 


Pro 


Ser 


Phe 


Thr Arg 


Gly Phe 






115 








120 










125 






Thr 


He 


Thr 


Fro 


Met 


Tyr 


Lys His 


Leu 


Gly Ala 


His 


Asp 


Phe 


His Thr 




130 










135 








140 








Tyr 


Gin 


lie 


Thr 


Gly Thr Trp Tyr Met 


His 


Phe 


Leu Asp 


Gly 


Leu Leu 


145 










150 








155 








160 


Thr 


Phe 


Asn 


Gly 


Phe 


Leu 


Asp Leu 


Trp 


Gly 


Phe 


Pro 


Gin 


Glu 


Asn Pro 










165 








170 










175 


He 


Gly 


Gly 


Pro 


Val 


Leu 


Lys Glu 


Gly 


Asp 


Lys 


Phe 


Val 


Phe 


Leu Ser 








100 








185 










190 




Glu 


Pro 


Gin 


Phe 


Trp 


He 


Asn Leu 


A^n 


Arg 


He 


Lys 


Gly 


He Asp Lys 






193 








200 










20S 






Asp 


Phe 


Asn 


Leu 


Ser 


He 


Gly Thr 


Glu 


Met 


Glu 


lie 


Ser 


Arg 


Asn Phe 




210 










215 








220 








Ala 


Arg 


Met 


Asp Lys 


Phe 


Ser Cys 


He 


Pro 


Thr 


Leu 


Ala 


Val 


Lys Trp 


225 










230 








235 








240 


Thr 


Ph- 


Asn 

























(2) IN FORMATION FOR SEQ ID HO: 366 

<i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 235 amino acids 
IB) TYPE: amino acid 
( t>) TOPOLOGY : linear 

(ID MOLECULE TYPE: protein 

(Hi I HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY: raiscjeaturs 
(8 ) LOCATION 1...235 



{xi! SEQUENCE DESCRIPTION: SEQ 10 NO: 366 



Met 


Lys 


Thr 


He 


Ser 


Lys 


Asn 


His 


Ala 


Ala 


Arg 


He 


Cys 


Ala 


Ala He 


1 








5 










10 








15 


Ala 


Leu 


Phe 


Ala 


Val 


Cys 


Asn 


Gly 


Arg 


He 


Ala 


Ala 


Gin 


Asp 


Phe Leu 








20 










25 










30 




Tyr 


Glu 


He 


Gly 


Gly 


Gly 


Phe 


Gly 


Ala 


Ala 


Gin 


Tyr 


Phe Gly Asp Ala 






35 










40 










45 






Asn 


Arg 


Gly 


Lsu 


Phe 


Gly 


Ser 


Ser 


Gly 


Val 


Gly 


Leu 


Glu 


Leu 


Val Gly 




50 










55 










60 








Arg 


Tyr 


Asn 


Tyr 


Asn 


Phe 


Arg 


Trp 


Ala 


Phe 


Ser 


Thr 


Met 


Leu Asp Trp 


65 










70 










75 








80 


Arg 


Thr 


Leu 


Arg 


Gly 


Asp 


Thr 


Asp 


Lys 


Ser 


Gly 


Asn 


Val 


Phe 


Pro Asp 










B5 










90 










95 


Phe 


Ala 


Gin 


Ala 


Asp 


Phe 


Lys 


Val 


Gly 


Leu 


Thr 


Gin 


Leu 


His 


Val Arg 








100 










105 










110 




Ser 


Glu 


Phe 


As ii 


Phe 


Leu 


Pro 


Tyr 


Ser 


Asp 


Gly 


Tyr 


Lys 


Tyr 


Leu Gly 






115 










120 










125 






Thr 


Ala 


Arg 


T.PU 


Ser 


Pro 


Tyr 


Val 


Ala 


Ala 


Gly 


Leu 


Ser 


Leu 


Gly Phe 




130 










135 










140 








Ala 


Ser 


Gly 


Ala 


Lye 


Gly 


Ser 


Ala 


Phe 


Ala 


Pro 


Gly 


He 


Thr 


Ala Gly 


145 










150 










155 








160 


Met 


Gly 


Val 


Lys 


Tyr 


Lys 


Leu 


Lys 


Pro 


Arg 


He 


Asn 


Val 


Gly 


He Glu 










165 










170 










175 


Tyr 


Ser 


Phe 


Thr 


Gly 


Leu 


Leu 


Thr 


Asp 


Ala 


Leu 


Asp 


Ala Leu Thr Asp 








180 










185 










190 




Lys 


Ser 


Val 


Trp 


Leu 


Glu 


Asp 


Pro 


Tyr 


Lyr 


lie 


Asn 


Asp Sor 


Trp Val 






195 










200 








205 
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10 



Lys Asm Lys A*p Ala Thr Gly Ala Leu Val L*u A rg I le Thr Tyr Asp 

215 220 
Phe Gly Leu Arg Lys Thr Phe Cys A*n Lys Gin 
225 230 2 35 

(2) INFORMATION FOR SEQ ID NO:36 7 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 436 amino acids 
(8 J TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

15 {iil} HYPOTHETICAL: YF.S 

{vij ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 
20 <1«) FEATURE; 

(A) NAME/KEY: misc feature 
IE) LOCATION a... 436 

25 Ui) SF.QUEHCE DESCRIPTION: SEQ ID NO: 367 

Asn lie Met r Lys A^p Tyr Lys Gly L *u Tyr Ala Ser Leu Arg Trp 
Tyr Ala Leu lie He Gly Leu Leu ^ JJ a Ala Asp Gly ^ ^ ^ 

30 Gin Asn A*n Asn Phe Thr Glu Ser Pro Tyr Thr Arg Phe lly Leu Gly 
Arg Leu Gly Glu Arg Thr Thr He Ser Gly , Iis Ser Met Gly Gly Leu 

35 65'' ^ % n GlY Thr Tyr Val A | n v *l A,n Pro Ala 

Ser Tyr Ser Ala Val Asp Ser Met Thr Phe lie Phe Asp Phe Gly Ala 
Ser Thr Gly lie Thr Trp Tyr Ala Glu Asn Gly Lys Ly* Asp Asn Arg 
40 Lys Het Gly Asn He Glu Tyr Phe lit Me t Leu Phe Pro lie Ser Lys 



45 



Ser lie Ala Het Ser Ala Gly Val Leu Pro Tyr Ser Ala Ser Gly Tyr 

Gin The Gly Ser Val A,p Gin Val Glu Gly Gly Ser Vai Gin Tyr Thr 

Arg Lys Tyr Leu Gly Thr Gly Asn Leu Asn A^p Leu Tyr Val Gly "° 
165 170 



Gly Ala Thr Pro Phe Lys Asn Phe Ser He Gly Ala Asn Ala Ser Ser 

50 Leu Phe Gly Arg Phe Thr Hi, Ser Arg Gin Val He Phe Ser Thr Glu 
A-** 3 200 



Ala Pro Tyr A*n Pro Val Hl« L^u S=c Thr Leu Tyr Leu Lys Ala Ala 



215 



55 22E Phe ASP Phe Gly aaS ™" Tyr " is Leu Leu "» Lys s « 



230 235 



Ser T.eu val lie Gly Ala Val Tyr Ser Pro Arg Val L ys Met His l« 

Glu Leu Thr Gin lie Lys Asn Gin Val Gin Asn Gly Val Val Val Glu 

60 Ser Glu Thr Gin Glu Tyr lie L y * "y Met Asp Tyr Tyr rhr Leu Pro 

280 ?ft s 

His Thr Leu Gly n. aly eh Ser Tyr Glu Lys ^ ^5 ^ ^ 

" 300 
65 305 ' 31y **" fJS TVr S « L ^ T 'P **» «Y Lys Phe Tyr 

Lys Ser Asp Cys Lys P„ e Gin Asp Arg II. Ar| Val Ser Leu Gly 
Glu lie lie Pro A^ He Asn Ax a Val Gly Met Trp Pro Lys Val Arg 
70 T yr Arg Phe Gly Leu His Gly Glu Asn Ser Tyr Leu Lys Vat Pro Thr 



75 



1*. 91y Gly Val Tyr Gin Gly Tyr His lie Val Gly Ala Val Phe Gly 

J / o 380 

«. Gly lie Pro Leu Asn Asp Arg Arg Ser Phe Val Asn Val Ser Leu 

395 400 
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Glu Tyr Asp Arg Leu lie Pro Lys Glu Gly Met He Lys Glu Asn Ala 

4 05 410 415 

Leu Lys Leu Thr Phe Gly Leu Thr Phe Asn Glu Ser Trp Phe Lys Lys 
420 425 430 

5 Leu Lys Leu Asn 
435 

(2 J INFORMATION FOR SEQ ID UO: 368 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (Ii) IIOLECULE TYPE: protein 

(111) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
20 (A> ORGANISM; Porphyromonas gingival Is 

( ix ) FEATURE : 

iA) NAME/ KEY: miec_f eature 
(B) LOCATION 1. . .945 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Ui> SEQUEIICE DESCRIPTION: SEQ ID NO: 368 



Acn 


Thr 


He 


Ser 


Glu 


Asn 


Gly 


Asn 


Asp 


Ser 


Thr 


Tyr 


Phe 


Ser 


Phe 


Gin 


1 








5 










10 










15 




Thr 


Phe 


Leu 


Mot 
20 


Arg 


Ser 


He 


Tyr 


Gin 

25 


Leu 


Leu 


Leu 


Ser 


He 
30 


Leu 


Leu 


Ala 


Ser 


Leu 


Gly 


Phe 


Val 


Gly 


I.eu 


Glu 


Ala 


Gin 


Gin 


Ala 


Gly Val 


Ala 






35 










40 










45 








Gly 


Arg 
50 


Val 


Leu 


Asp 


Glu 


Glu 
55 


Gly 


Asn 


Pro 


net 


He 
60 


Gin 


Ala 


Asn 


Val 


Gin 


Leu 


Val 


Gin 


Ser 


Thr 


Gly 


Gin 


Val 


Ala 


Val 


Ala 


Ala 


Gly 


Ala 


Thr 


65 










70 








75 








80 


Asn 


Glu 


Lye 


Gly 


Leu 
85 


Phe 


Ser 


Leu 


Lys 


Thr 
90 


Sec 


Gin 


Glu 


Gly 


Asp 
95 


Tyr 


He 


Leu 


Arg 


Val 
100 


Ser 


Tyr 


Val 


Gly 


Tyr 
105 


Thr 


Thr 


His 


Asp 


Glu 
110 


Lys 


He 


Ser 


Leu 


Arg 
115 


Asn 


Gly 


Gin 


Thr 


He 
120 


Thr 


Leu 


Lys 


Asp 


He 
125 


Ser 


Met 


Asn 


Glu 


Asp 
130 


Ala 


Arg 


Leu 


Leu 


Gin 
135 


Ser 


Val 


Thr 


Val 


Gin 
140 


Ala 


Lys 


Ala 


Ala 


Glu 


Val 


Val 


Val 


Arq 


Asn 


Asp 


Thr 


Leu 


Glu 


Phe 


Acn 


Ala 


Gly 


Ser 


Tyr 


145 










150 










155 










160 


Thr 


Val 


Ala 


Gin 


Gly 
165 


Ala 


Ser 


He 


Glu 


Glu 
170 


Leu 


lie 


Lys 


Lys 


Leu 
170 


Pro 


Gly 


Alii 


Glu 


He 
180 


Gly 


Ser 


Asp 


Gly 


Lys 
195 


He 


Thr 


He 


Asn 


Gly 
190 


Lys 


Asp 


He 


Ser 


Lys 


He 


Leu 


Val 


Asp 


Gly 


Lys 


Glu 


Phe 


Phe 


Ser 


Lys Asp Pro 






195 










200 










2C5 








Gin 


Val 


Ala 


He 


Lys 


Asn 


Leu 


Pro 


Ala 


Asp 


Met 


Val 


Asn 


Lys 


Val 


Gin 




210 










215 










220 








Val 


Leu 


Asn 


Lys 


Leu 


Ser 


Glu 


Leu 


Ser 


Arg 


net 


Ser 


Gly Phe 


Asp Asp 


225 










230 










235 










24 0 


Gly 


Glu 


Glu 


Glu 


Thr 


Val 


He 


Asn 


Leu 


Thr 


Val 


Lys 


Pro 


Glu 


Lys 


Lys 










245 










250 








255 




Lys 


Gly 


Leu 


Phe 


Gly 


Thr 


Leu 


Gin 


Ala 


Gly 


Tyr 


Gly 


Thr Asp 


Gin 


Arg 








260 










265 










270 






Tyr 


Met 


Ala 
275 


Gly 


Gly 


Asn 


Val 


Asn 
2B0 


Arg 


Phe 


A^p 


Gly 


Asn 


Lys 


Gin 


Trp 


Thr 


Leu 


lis 


Gly 


Ser 


Ala 


Asn 


Asn 


Thr 


Asn 


A3n 


Met 


Gly 


Phe 


Ser 


Glu 




290 










2 95 










300 








Met 


Asp 


ser 


Glu 


Met 


Gly 


Ser 


Met 


Thr 


Phe 


Phe 


Ser 


Pro 


Gin 


Gly Gly 


305 










310 










315 










320 


Gly Arg 


Arg 


Gly 


Phe 


Gly 


Asn 


Ser 


Gly 


Gly 


Val 


Thr 


Ser 


Ser 


Ser 


Met 










325 










330 










335 




Leu 


Gly 


Gly 


Asn 


Phe 


Ser 


Val 


Glu 


Phe 


Ser 


Ser 


Ala 


Leu 


Asn 


Thr 


Gly 








340 










345 










350 




Gly Asp 


Ala 


Arg 


Tyr 


Gly 


Tyr 


Asn 


Asp 


Lys 


Ala 


He 


Glu 


Thr 


Thr 


Lys 






355 










360 










365 








Arq 


Val 


Glu 


Asn 


He 


Leu 


Ala 


Glu 


Gly 


Asn 


Thr 


Tyr 


Met 


Asp 


Glu 


Asn 




370 










375 










380 
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Tie 


Leu 


GlU 


Arg 


Ser 


Phe 


Ser 


His 


Asn 


Gly 


Gin 


Ala 


Arg 


Phe 


Arg 


Met 


385 










390 










395 










400 


Gin 


Trp 


Lys 


Pro 


Ser 


Glu 


Arg 


Thr 


Glu 


Val 


Val 


Phe 


Glu 


Pro 


Asp 


Leu 










405 










410 










415 




Ser 


He 


Ser 


Lys 


He 


Asp 


Gly 


Phe 


Phe 


Asn 


Asp 


Thr 


Tyr 


Glu 


Thr 


Lys 








4 20 










425 










430 






Asp 


Ala 


Thr 


Gly 


He 


Ser 


He 


Asn 


Lys 


GLy 


Scr 


Tie 


His 


Gin 


Thr 


Thr 






43b 










440 










4 4 5 








Gin 


Gly 


Asn 


Asn 


Phe 


Arg 


Leu 


Asn 


Gly 


Glu 


Leu 


Asp 


He 


Ser 


His 


T.yj? 




4 50 










455 










4 60 










Leu 


Asn 


Asp 


Glu 


Gly Arg 


Thr 


He 


Ser 


Ala 


Sor 


Val 


Ser 


Gly Gly Leu 


465 










470 










475 










460 


Thr 


Asp 


GlU 


Asp 


Gly Asp Gly 


lie 


Tyr 


Gin 


Ala 


Val 


Leu 


Gin 


Ser 


Va] 










485 










490 










495 




Glu 


Thr 


Asn 


Gin 


Lys 


Gin 


Phe 


Asn 


Atp Asn 


Ser 


Asn 


Leu 


Gin 


Tyr Arg 








500 










505 










510 






tiSU 


Arq 


Leu 


Ser 


Tyr 


Val 


GlU 


Pro 


Leu 


Gly Lys 


Asn 


Tyr 


Phe 


Ala 


Gin 






51 5 










520 










525 








Ala 


He 


Leu 


Asn 


Arg 


Arg 


Phe 


Ser 


Pag 


Arg 


Asn 


Ser 


Asp 


Arg 


Glu 


Val 




530 










535 










540 










Tyr 


Arg 


Leu 


Gly 


Asp 


Asp 


Gly 


Gin 


Tyr 


Ser 


He. 


Leu 


Asp 


Ser 


Gin 


Tyr 


545 










550 










555 










560 


Gly 


Leu 


Ser 


Tyr 


Ser 


Asn 


Glu 


Phe 


Thr 


Gin 


Tyr 


Arg 


He 


Gly 


Leu 


Asn 










565 










57 a 










57 5 




Leu 


Lys 


Lys 


He 


Ala 


Lys 


Thr 


Trp Asp 


Tyr 


Thr 


Val 


Gly 


Phe 


Asn 


Val 








580 










585 










590 






Asp 


Pro 


Asn 


Arg 


Thr 


Val 


Ser 


Tyr Arg 


Ser 


Val 


Ala 


Gly 


Val 


Glu 


Gin 






595 










600 










605 








Asp 


Lys 


Leu 


Ala 


Phe 


Asn 


Arg 


Val 


Asn 


Leu 


Ser 


Pro 


Met 


Leu 


Arg 


He 




610 










615 










620 










Asn 


Tyr 


Lys 


Pro 


Ser 


Arg 


Thr 


Thr 


Asn 


Leu 


Arg 


Val 


Asp 


Tyr 


Arg 


Gly 


625 










630 










635 










640 


Arq 


Thr 


Thr 


Gin 


Pro 


Ser 


He 


Asn 


Gin 


He 


Ala 


Pro 


Val 


Gin 


ASp 


He 










645 










650 










655 




Thr 


Asn 


Pro 


Leu 


Phe 


Val 


Thr 


Glu 


Gly Asn 


Fro 


Gly 


Leu 


Lys 


Pro 


Ser 








660 










665 










670 






Tyr 


Ser 


Asn 


Asn 


Val 


Met 


Ala 


Met 


Phe 


Ser 


Asp 


Phe 


Asp 


Ala 


Lys 


Ser 






675 










680 










685 








Gin 


Arg 


Ala 


Phe 


Asn 


He 


Val 


Phe 


Phe 


Gly Asn 


Tyr 


Thr 


Phe 


Asp Asp 




690 










695 










700 










lie 


Val 


Pro 


Asn 


Thr 


His 


Tyr 


Asp 


Pro 


Ser 


Thr 


Gly 


Ho 


Arg 


Thr 


Thr 


705 










710 










715 










720 


Arg 


Tyr 


Glu 


Asn 


Ala 


Ser 


Gly Thr 


Trp 


Gin 


Ala 


Asn 


Leu 


His 


Gly Thr 










725 










730 










7 35 




Leu 


Ser 


Leu 


Pro 


Leu 


Lys 


Asn 


Arg 


Ala 


Phe 


Ser 


Phe 


Arg 


Met 


Ser 


Leu 








74 0 










745 










750 






Phe 


Asn 


f 


Leu 


Ala 


Glu 


Gly Gin 


Ser 


Phe 


Tie 


Asn 


Asp 


Asp 


Lys 


Asn 






7 55 










760 










765 








Lys 


Ala 


Leu 


Ser 


Phe 


Arg 


Thr 


Arg 


Glu 


Arg 


Leu 


Thr 


Leu 


Thr 


Tyr 


Arq 




770 










775 










700 










Asn 


Asn 


Trp 


He 


Asp 


Thr 


Ser 


He 


Gly Gly Asn 


He 


Gly 


Phe Tyr Met 


7B5 










790 










795 










BOO 


Ala 


Asn 


Asn 


Ser 


Leu 


Ser 


Gly Gin 


Lys 


Asp 


Ser 


Arg 


Thr 


Tyr Asp 


Phe 










805 










810 










815 




Gly Gly 


Asn 


Tyr 


Gin 


Val 


Ala 


Leu 


Thr 


Leu 


Pro 


Tyr 


Gly 


Phe Arg 


He 








820 










825 










830 






Asp 


Ser 


Asp 


Val 


Glu 


Tyr 


Asn 


Thr 


Asn 


Ser 


Gly 


Tyr 


Ser 


Gly Gly 


Phe 






835 










840 








845 








Ser 


Leu 


Asp 


Glu 


Trp 


Leu Trp Asn 


Ala 


Ser 


Leu 


Ser 


Tyr 


Ser 


Phe 


Leu 




850 










855 










860 








Arg 


Asp 


Lys 


Ala 


Gly 


Thr 


Leu 


Arg 


Val 


Asn 


Gly 


Tyr 


Asp 


lie 


Leu 


Gly 


865 










8^0 










875 










880 


Gin 


Arg 


Ser 


Ser 


He 


Ser 


Arg 


Ser 


Ala 


Ser 


Ala 


He 


Asm 


He 


Glu 


Glu 










865 










890 










895 




Ser 


net 


Ser 


Asn 


Thr. He 


Gly Arg 


Tyr 


Val 


Met 


Val 


Asp 


Phe 


He 


Tyr 








900 










905 










910 






Arg 


Phe 


Asn 


Ala 


Phe 


Ser 


Gly Gly Gly 


Ser Arg 


Ser 


Asp 


His 


Gin 


Arg 






915 










920 










925 








Gly Asn 


Met 


Asn 


Arg 


Pro 


Gly 


Pro 


Pro 


Phe 


Gly 


Gly 


Gly 


Arg 


Arg 


Pro 




930 










935 










940 










Sor 
































945 
































(2) 


INFORMATION 


FOR 


SEQ 


ID NO:369 
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(i] SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 amino acids 
(B> TYPE: amino acid 
t D) TOPOLOGY: linear 

Hi) MOLECULE TYPE: protein 

tiil) HYPOTHETICAL: YES 

(Vi) ORIGINAL. SOURCE: 

(A) ORGANISM: Porphyrojnonas gingivalis 

(ix) FEATURE: 

IA) NAME/KEY: misc feature 
(B> LOCATION 1 412 



(xi) SEQUENCE DESCRIPTION: SZQ ID UO;3(S9 



Gly 


Glu 


Tyr 


Pro 


Ala 


Asn 


Ser 


Asn 


Asp 


Lys 


Lys 


Glu 


Met 


Val 


Met 


Lys 


1 








5 










10 










15 




Leu 


lie 


Lys 


Arg 
20 


S<=L 


Leu 


Leu 


Leu 


Leu 
25 


Gly 


Ala 


Val 


Leu 


Leu 
30 


He 


Thr 


Leu 


Pro 


Ala 


Tyr 


Ser 


Gin 


Asn 


Asp 


Asp 


He 


Phe 


Glu 


Asp Asp 


He 


Tyr 






35 










40 










45 








Thr 


Ser 


Arg 


Lys 


Glu 


He 


Arg 


Lys 


Gin 


Asn 


Gin 


Val 


Lys 


Asp Trp 


Gin 




50 










55 










60 










Asn 


Gin 


Glu 


Asp 


Gly 


Tyr 


Gly 


Asp 


Asp 


Thr 


Glu 


Tyr 


Thr 


Val 


Ala 


Ser 


65 










70 










75 










B0 


Asp 


Arg 


Asp 


lie 


Asp 
85 


Ala 


Tyr 


Asn 


Arg 


Arg 

90 


Asp 


Gly 


Gin 


Ser 


Tyr 
95 


Asp 


Gly 


Lys 


Lys 


Leu 


Ser 


Lys 


Asp 


Lys 


Lys 


Arg 


Asp 


Ssr 


Thr Arg 


Ser 


Ser 








100 










105 










110 






Val 


Pro 


Gly 


Arg 


Tyr 


Ser 


Arg 


Arg 


Leu 


Ala 


Arg 


Phe 


Tyr Lys 


Pro 


Asn 






115 










120 










125 








Thr 


lie 


Val 


He 


Ser 


Gly 


Ala 


Asp 


Asn 


Val 


Tyr 


Val 


Thr 


Asp Asp 


Gly 




130 










135 










140 








Giu 


Tyr 


Phe 


Val 


Tyr 


Gly 


Asp 


Glu 


Tyr 


Tyx 


Asp 


Asp 


Ala 


Ser 


Ser 


Val 


145 










150 










155 










160 


Asn 


Tie 


Tyr 


He 


Asn 

165 


Ser 


Pro 


Trp 


Cys 


Asp 
170 


Pro 


Phe 


Pro 


Tyr 


Thr 
175 


Ser 


Trp 


Tyr 


Pro 


Ser 
100 


Phe 


Ser 


Gly 


Trp 


Tyr 
185 


Asn 


Tyr 


Thr 


Trp 


Asn 
190 


Tyr 


Pro 


Trp 


Phe 


Tyr 
195 


Tyr 


Gly 


Ser 


His 


He 
200 


Gly 


Trp 


Gly 


Gly 


Tyr 
205 


Tyr 


Pro 


Gly 


Tyr 


Asn 


Trp 


Tyr 


Trp 


Ser 


Tyr 


Tyr 


Tyr 


Asp 


Pro 


Phe 


Tyr Asn 


Pro 


Tyr 




210 










215 










220 










Gly 


Tie 


Gly 


Met 


Gly 


Trp 


Gly 


Tyr 


Pro 


Tyr 


Gly 


Trp 


Gly 


Ser 


Tyr 


Tyr 


225 










230 










235 










240 


Gly 


Trp 


Gly 


Gly 


Tyr 
245 


Pro 


Gly 


Val 


He 


His 
250 


His 


Tyr 


His 


His 


Tyr 
255 


Pro 


Lys 


Lys 


Thr 


Tyr 


Ser 


Asn 


Gly 


Gin 


His 


Ser 


Gly Ala 


Tyr 


Tyr 


Ser 


Tyr 








260 










265 










270 




Gly 


Arg 


Pro 


Asn 


Arg 


He 


Lys 


Gly 


Gly 


Thr 


Ser 


Gly Ala 


Lys 


Leu 


Gly 






275 










280 










285 








Thr 


Gly 


Arg 


Tyr 


Asp 


Arg 


He 


Gin 


Asn 


Ser 


Ser 


Ser 


Gin 


Lys Asn 


Lys 




290 










295 










300 








Phe 


Gly 


Leu 


Gin 


Ser 


Asn 


Lys 


Pro 


Asn 


Asn 


Asn 


Leu 


Gin 


Asn 


Val 


Lys 


305 










310 










315 










320 


Ser 


Gly 


Arg 


Thr 


Gly 
325 


Arg 


Ala 


Asn 


Arg 


Asp 
330 


Arg 


Asn 


He 


Glu 


Thr 

335 


Val 


Thr 


Pro 


Asn 


Asn 

340 


Gly 


Gin 


Lys 


Gin 


Asn 
34S 


Arg 


Pro 


Val 


Phe 


Gin 
350 


Gin 


Asn 


Gin 


Ser 


Gly 


Asn 


Asp 


Arg 


Pro 


Thr 


Gly 


Arg 


Asn 


He 


Arg 


Ser 


Glu 


Arg 






355 










360 










365 






Gin 


Gly 
3/0 


Glu 


Asn 


Asn 


Asp 


Arg 
375 


Thr 


Phs 


Ser 


Thr 


Pro 
360 


Ser 


Arg 


Ser 


Asn 


Ser 


Asn 


Gly 


Gly 


Phe 


Ser 


Thr 


Pro 


Ser 


Arg 


Ser 


Ser 


Ser 


Gly Ser Het 


385 










3 90 








395 










400 


Ser 


Gly 


Gly 


Gly 


Gly 
4 05 


Arg 


Ser 


Gly 


Arg 


Gly 
410 


Arg 


Asn 











(2) INFORMATION i'OK SEQ ID NO: 37 0 



|i) SEQUENCC CHARACTERISTICS: 

(A) LENGTH: 601 amino acids 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



301/490 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival!* 
(ix) FEATURE: 

(A) NAMK/KEY; mlsc feature 

(B) LOCATION 1. , ,601 

(xi) SEQUEIICE DESCRIPTION: SEQ ID UO:370 

Ser Asn S,r Ser Ser Hi, Ly* Trp Leu He Tyr Tyr His He Glu Lyr 

Thr Lys Ser XI, net He Arg Lys Leu iL Leu Leu Leu Ala Leu Met 

Pro Vol Ala Ser Val Ala Phe Ala Val Pro Thr Asp Ser Thr Glu Ser 

Ly, A^p Asn Arg u« Leu Thr Ser HeL Gin Ser Ser Ser Leu Asn Ary 

& 60 
Arp Asp Ala Pro Asp Lys Trp Gin Pro Met His Ala Asn Phe Ser He 

Gin Ser A sp Met L=„ Leu Ser Thr Al a Gin Lys Ser Lys Asn Thr Trp 
Ph. Gly Asn ser Tyr Ils Met ay „ ly- ^ ^ ^ 9^ ^ 

Phe Gly Ala Arg Phe Glu Asp Leu Tyr Lys Pro Leu Pro lly His Glu 
Pro Glu Met Gly ^ Gly Val Pro His Met Tyr Val Lys Gly Ser Tyr 
His Trp Ala Glu Leu Thr Met Gly Asp Phe Tyr Asp Gin Phe Gly Ser 



Gly Met Val Phe Arg T^ Tyr G!u Glu Arg a" Leu Gly He Asp ^ 

Ala Val Arg Gly Gly Arg lie Val Leu Thr Pro Phe Asp Gly Val Arg 

Val Lys Gly He Ala Gly Gin Gin Jrg Asn Tyr Phe Asp Arg Thr Gly 

" ~ 200 20s 

Lys Val Phe Asn Ser Gly Arg Gly Tyr Leu L.u Gly ser Asp Leu Glu 

Leu Asn Val Glu Arg Trp Ser Ser Ala „et Arg A^p Arn Asp Tyr His 

Leu Ala He Gly oiy Ser Phe Val Ser Lys nil Glu Ala Asp Glu £p 

lie Phe Val Gly Val Gly Glu Asp Arg Lys Arg Leu Asn Leu Pro Leu 

Asn Val Pro He Met Gly Leu Arg Thr Asn Phe Gin Lys Zly Gly Leu 

Ala Leu Tyr Ala G,u Tyr Gly Jyr Lys Tyr Asn Asp III Ser Ala Asp 

Asn Asp Tyr He Tyr His Asp 61 y Gin Ala Ala Leu Leu Ser Ala Ser 

Tyr Ser Lys Lys Gly Met Sec He Leu Leu «n Ala Lys Arg Cys llu 

330 

Asn Phe Ala Phe Arg Ser Lys Arg Ser Ala Gin Leu Thr Pro Lau Met 
He Asn Tyr Met Pro Ala Phe Thr Gl'n Ala His Thr Tyr Thr Leu Ala 
Ala lie Tyr Pro Tyr Ala Thr sin Pro Gin Gly Glu Ala Phe Gin 

Gly Glu Leu Arg Tyr Asn Phe Ala Arg Arg Thr m! Leu Gly Gl y Arg 

39S .jot) 



Tyr Gly Thr Gly Le u Arg He Asn Val Ser Hi's Val Arg Gly Leu As'p 
Lys Lys Met Leu Lys Clu Asn Pro Asp G }S Leu He Gly Thr Aso Gly 
Tyr Thr Val 5 « eha Phe Gly ^ ^ ^ „ yr «0 ^ ^ 

Asp Val Glu He Thr Lys Lys Val Ser Pro , ly Phe Asn Phe Thr Leu 
Thr Tyr Leu Asn Gin He Tyr Asn Asn Lys Val Leu „„ Gly Ala Ala 



SUBSTITUTE SHEET (Rule 26) (RQ/AU) 



WO 99/29870 



PCT/AU98/01023 



302/490 



465 










470 










475 










400 


Gly Glu 


Lys 


Pro 


Glu 


Lys 


He 


Tyr 


Ala 


Asn 


He 


Phe 


Val 


Tyr 


Asp 


Gly 










485 










490 










4 95 






Tyr 


Lys 


Leu 
500 


Ser 


Asn 


Lys 


Val 


Ala 

505 


Leu 


Arg 


Thr 


Glu 


Leu 

51 0 


Gin 


Tyr 


Leu 


Ills 


Thr 


Lys 


Gin 


Asp 


Gin 


Gly 


Asp 


Trp 


He 


Tyr Gly Met 


Ala 


Glu 






515 










520 










525 








Leu 


Ser 
530 


lie 


L~u 


Pro 


Ser 


Leu 

535 


Met 


Leu 


Ser 


Leu 


Ser 
540 


Glu 


Gin 


Tyr 


Asn 


He 


Gly 


Glu 


Thr 


Lys 


Lys 


His 


Tyr 


Val 


Met 


Gly 




Val 


Thr 


Tyr 


Thr 


615 










550 










555 










560 


His 


Gly 


Ala 


His 


Arg 

565 


Val 


Ala 


Phe 


Ser 


Ala 

570 


Gly 


Lys 


Thr 


Arg 


Ala 

57 5 


Gly 


net 


Asn 


Cys 


Ser 
580 


Gly 


Gly 


Val 


Cys 


Arg 
585 


Val 


Val 


Pro 


Glu 


Thr 
S90 


Gin 


Gly 


Phe 


Tyr 


Leu 
595 


Ser 


Tyr 


Ser 


Thr 


Asn 
600 


Leu 

















(2) I N FORMAT I OM FOR SEQ ID NO: 371 

(l) SEQUENCE CHARACTERISTICS: 

{AJ LENGTH : 252 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui> 110LECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE : 

(A) NAME/KEY: mlsc_f eature 
{B) LOCATION 1. . .252 



(XI) SEQUENCE DESCRIPTION: SEQ ID HO: 371 



Arg 


Gly 


Ser 


Ser 


Ser 


Gly 


He 


Ser 


Ala 


Arq 


Gly 


Arg 


Asp Met 


Arg 


Ser 


1 








5 










10 










15 




Leu 


Phe 


Leu 


Ser 


Ala 


Leu 


Arg 


Ser 


Ser 


Ser 


Leu 


His 


Gly 


Ser 


Glu 


Arg 








20 










25 










30 






Arg 


Ser 


Arg 


He 


Ser 


Ser 


Ser 


Val 


Val 


Met 


Ser 


He 


Arg 


Gin 


Lys 


11 = 






35 










40 










45 








Arq 


Leu 


Phe 


His 


Leu 


Ser 


Val 


Cys 


Ala 


Gin 


Thr 


His 


Asp 


His 


Leu 


He 




so 










55 








60 








Glu 


He 


His 


Leu 


Val 


Cys 


He 


Glu 


Phe 


Gly 


Ala 


He 


Asp 


Thr 


Asp 


Glu 


65 










70 










75 










80 


Phe 


Arg 


Leu 


Ser 


Ser 


His 


Ala 


Tyr 


Thr 


Thr 


Ser 


Pro 


Thr 


His 


Thr 


Gly 










85 








90 










95 




Ala 


He 


His 


His 


Asn 


Cys 


He 


Glu 


Arq 


Ser 


Tyr 


Gly 


Arg 


Tyr 


Leu 


Val 








100 










105 










110 






Thr 


Phe 


Gly 


Gin 


Glu 


Arg 


Asn 


Glu 


Leu 


His 


His 


His 


Ser 


Arg 


Pro 


Asp 






115 










120 










125 








Arg 


Asn 


Ala 


Glu 


Val 


Tyr 


Arg 


Phe 


Pro 


Phe 


Asp 


Asn 


Ala 


Phe 


His 


Ser 




130 










135 










140 










He 


Arg 


Tyr 


Glu 


Ala 


Phe 


Arg 


Pro 


He 


Arg 


Pro 


lis 


Val 


Cys 


His 


Asp 


145 










150 










155 










160 


Asp 


His 


Phe 


He 


Ala 


He 


Gly 


Ser 


His 


Leu 


Phe 


Phe 


Lys 


Asp 


Asn 


Gin 










165 










170 










175 




He 


Phe 


Ser 


Ser 


Gly 


Ser 


Gin 


Tyr 


Asp 


Asn 


Tyr 


Thr 


Val 


Ala 


Cys 


Phe 








180 










185 










190 






Va3 


Glu 


Ser 


Leu 


His 


Asp 


Arg 


Glu 


Gin 


Arg 


Ser 


His 


Thr 


His 


Thr 


Ala 






5 95 










200 










205 








Ser 


Gly 


Thr 


Asn 


His 


Cys 


Ala 


Asp 


Leu 


Phe 


Asp 


Met 


Arg 


Thr 


Leu 


Ser 




210 










215 










220 










Gin 


Arg 


Thr 


Tyr 


His 


Tie 


Arg 


Asp 


He 


Val 


Ala 


Asp 


Phe 


Glu 


Phe 


Gly 


225 










230 










2 35 










240 


Gin 


Phe 


Leu 


Gly 


Arg 


Phe 


Ala 


Hie 


Gly 


Leu 


Asn 


His 











245 250 



(2) INFORMATION FOR SEQ ID MO: 372 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 amino acids 



SUB STITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



303/490 

(B) TYPE: amino acid 
{D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii ) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gxngivalis 

< ix J FEATURE : 

(A) MAI IE /KEY: misc feature 

(B) LOCATION 1. . .790 

Ui> SEQUENCE DESCRIPTION; SEQ ID MO:372 

Leu Leu Phe Ser Ser Pro Leu Pro Arq Met Asp Arg Pro Lys Pro Ser 

5 10 15 

Tyr Tie Val Arg He Ala Ala He Leu Cys Leu Phe Val Gly Arg Pro 

20 25 3Q 

Leu Phe Al d Gin Ser Tyr Val Asp Tyr Val Asp Pro Leu He Gly Thr 



' fn r PhS GlU L ' eU Ser Ala G1 >' Asn Thr T Vr Pro Val He Gly 

50 55 60 

Leu Pro Trp Gly Met Asn Ser Trp Thr Pro Met Thr Gly Val Pro Gly 

Asp Gly Trp Gin Tyr Thr Tyr Ser Ala Hi* Lys He Arg Gly Phe Lys 



90 05 
Gin Thr His Gin Pro Ser Pro Trp He Asn Asp Tyr Gly Gin Phe Ser 

100 105 110 

Leu Leu Pro Leu Thr Ala Pro Gin Lys Pro Ser Ser Asn Asp Ser He 

115 120 12 S 

Ala Leu Thr Lys Trp Cys Lys Gin Leu Phe Ser Asp Glu Gin Thr Ser 

1JU 135 140 

Trp Phe Ser His Lys Ala Glu Thr Ala Thr Pro Tyr Tyr Tyr Ser Val 
150 155 icq 

Tyr Leu Ala Asp Tyr Asp Thr Arg Val Glu Met Ala Pro Thr Glu Arq 

165 I 70 175 

Ala Ala He Phe Arg He Arg Tyr Ser Gly Asn Thr Glu Ser Gly Ser 

180 185 190 

Gly Arg Trp Leu Arg Leu Asp Ala Phe Thr Gly Gly Ser Glu He Ser 

195 200 205 

lie Val to p Pro Hi* Thr Val Val Gly He Ser Arg Lys Asn Ser Gly 

* XKm 215 220 

Gly Val Pro Ala Asn Phe Ala Cys Tyr Phe He Leu Gin Ser Asp Thr 

235 o a c\ 

Pro Met Ala Asp Val Leu Leu Glu Thr Asp Thr Gly Lys Ser Asp Glu 
245 250 



Gly Thr Arg Ala Trp Ala Ala Cys Arg Phe Asp Ser Gin Glu Val Thr 

260 265 

Ala Ser Ser Phe 

Val Lys Gly Gin 
290 295 " 3oo 



270 

Val Arg Val Ala Ser Ser Phe He Ser Val Glu Gin Ala Glu Arg Asn 

275 280 285 

Leu Ala Glu Val Lys Gly Gin Ser Phe Asp Arg He Arg Leu Ala Gly 



Arg Glu Ala Trp Asn Lys Val Leu Gly Arg He Hi* Val Glu Gly Gly 

Thr Lys Asp Glu Arg Thr Thr Phe Tyr Ser A^a Leu Tyr Arg Cys Leu 

Leu Phe Pro Arg Arg Phe Tyr Glu Glu Asp Ala Ser Gly Asn Phe Val 

His Tyr Ser Pro Tyr Asn Gly Glu Val Leu Pro Gly Tyr l!u Tyr Thr 

- 360 365 

ASP 370 I5f Ar9 Ala LeU Phe Leu Leu Asn 

Leu Leu Tyr Pro Asp Glu Asn He Lys He Gin Glu Gly Leu Leu Asn 
390 395 don 

Val Tyr Arg Glu Ser Gly eh* Phe ero Glu Trp Ala Ser Pro Gly His 
405 4\0 41 | 



Arg Asp Cy S M*t lie Gly Asn Asn Ss r Ala Ser V*l Leu Ala Asp Ala 

420 4 2S j 30 

Tyr Leu Ly* Gly Val Arg Val Glu Asp Thr Arg Thr Leu Met Asn Gly 

LS " £0 Thr L5fS A if His pr ° 116 Ser Ser Thr Gly 

455 46O 
Arg Lys Gly Trp Glu Trp Tyr Asn Ser Leu Gly Tyr Val Pro Ala Asp 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



304/ 490 



465 










470 










475 










480 


Ala 


Gly 


lie 


Asp 


Glu 


Ser 


Ala 


Ala 


A a 


Thr 


Leu 


Glu 


Tyr 


Ala 


Tyr 


Asn 










485 








4 90 










4 95 




ASD 


Trp 


Cys 


He 


Leu 


Arg 


Leu 


Glv 


Arg 


Thr 


Leu 


Gly 


Trp Asp 


Arg 


Ala 








500 










505 










510 






Ala 


L*»u 


Asp 


Thr 


Leu 


Ala 


Hi 3 


Arg 


Ser 


Met 


Asn 


Tyr 


Arg 


His 


Leu 


Phe 






515 










520 










525 








Asp 


Pro 


Glu 


Thr 


Lys 


Leu 


Met 


Arg 


Gly 


Arg 


Asn 


Gin 


Asp 


Gly Ser 


Phe 




530 










535 










540 










Arg 


Thr 


Pro 


Phe 


Ser 


Pro 


Phe 


Lys 


1 rp 


uxy 


Asp 


Val 


Phe 


Thr 


Glu 


Gly 


545 










550 










555 










560 




Ala 


Trp 


Hi.-? 


Tyr 


Thr 


l rp 


Ser 


Val 


Phe 


His 


Asp Val 


Gin 


Gly 


Leu 










565 










570 










575 




Tie 


A 


Leu 


Met 


Gly Gly 


Asp 


Arg 


Pro 


Phe 


Val 


Ser 


Met 


Leu 


Asp 


Ser 








580 




















590 






Val 


Phe 


A"n 


Thr 


Pro 


Pro 


Met 


Phe 


Asp 


Glu 


Ser 


Tyr 


Tyr 


Gly 


Phe 


Val 






595 










600 










605 








He 


His 


Glu 


He 


Arg 


Glu 


Met 


Gin 


He 


Ala 


Asp 


llet 


Gly 


Asn 


Tyr Ala 




610 










615 










620 










Hie 


Gly 


Asn 


Gin 


Pro 


He 


Gin 


His 


Met 


He 


Tyr 


Leu 


Tyr 


Asn 


His 


Ala 












630 










635 








640 


Gly 


His 


Pro 


Trp 


Lys 


Ala 


Gin 


Glu 


Arg 


Leu 


Arg 


Glu 


Val 


Met 


Gly Arg 










645 










650 










655 




Leu 


Tyr 


Arg 


Pro 


Thr 


Pro 


Asp 


Gly 


Tyr 


Cys 


Gly Asp 


Glu 


Asp 


Asn 


Gly 








660 










665 










67 0 






Gin 


Thr 


Ser 


Ala 


Trp 


Tyr 


Val 


Phe 


Ser 


Ala 


Leu 


Gly 


Ph» 


Tyr 


Pro 


Val 






67 5 










680 










665 








Thr 


Pro 


Ala 


Thr 


Asp 


Gin 


Tyr 


Val 


Leu 


Cly 


Ser 


Pro 


He 


Phe 


Ser 


Lys 




690 










695 








700 










Val 


He 


Leu 


Ser 


Phe 


Pro 


Asp 


Gly 


His 


Lys 


Thr 


Val 


Leu 


His 


Ala 


Pro 


705 










710 










715 










720 


Ala 


Acn 


Ser 


Ala 


Asp 


Thr 


Pro 


Tyr 


He 


Arg 


Ser 


He 


Ser 


Val 


Glu Gly 










725 










730 










735 




Lys 


Glu 


Trp 


Ser 


Cys 


Asn 


Tyr 


Leu 


Thr 


His 


Glu 


Gin 


Leu 


Arg 


Ser 


Ser 








740 










745 










750 






Ala 


Scr 


He 


Gin 


Trp Met 


Met 


ASp 


Thr 


Lys 


Pro 


Asn 


Tyr 


Asn Arg 


Gly 






755 










760 










765 








Met 


Lys 


GlU 


Ser 


Asp Arg 


Pro 


Tyr 


Ser 


Phe 


Ser 


Thr 


GJu 


Gin 


Gin 


Arg 




770 










775 










780 










Arg 


Ala 


Asn 


His 


Ser 


Asn 






















785 










7 90 






















(?) 


INFORMATION 


FOR 


SEQ 


ID NO:373 

















fi! SEQUENCE CHARACTERISTICS: 

(A} LEUGTH: 286 amino acids 
(BJ TYPE; amino acid 
(Dt TOPOLOGY: linear 



Ui] MOLECULE TYPE: protein 

(tli) HYPOTHETICAL; YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyroroonas gingivalls 

(ix) FEATURE: 

<A) NAME /KEY : aiscfeature 
(8) LOCATION 1 . . -286 



(Ai) SEQUENCE DESCRIPTION: SEQ ID NO:373 



He 


Cys 


Gly 


Ser 


Lys 


Met 


Asn 


Leu 


Ser 


Gly 


Leu 


Gin 


Ser 


Phe 


Thr 


Met 


1 








5 










10 










15 




Met 


Lys 


Ser 


Her 


Arg 


Ser 


Val 


Leu 


Leu 


Leu 


Leu 


Ehe 


Pro 


Leu 


Ser 


Leu 








20 










25 










30 






He 


Thr 


Ala 


Leu 


Gly 


Cys 


Ser 


Asn 


Asn 


Lys 


Ala 


Ala 


Glu 


Ser 


Ly* 


Ser 






35 










40 








45 








Val 


Ser 


Phe 


Asp 


Ser 


Ala 


Tyr 


Leu 


Glu 


Axg 


Tyr 


lie 


Pro 


Leu 


Arg 


Ala 




50 










55 










60 










Asp 


He 


Asp 


Thr 


Pro 


Ser 


Leu 


His 


Val 


Met 


Tie 


Ser 


Tyr Val 


Tyr 


Pro 


65 










70 










75 








80 


Ser 


Gly 


Asp 


Asp 


Met 


Leu 


Thr 


Glu 


He 


Phe 


Asn 


Gly 


Leu 


Leu 


Ph^s 


Gly 










85 










90 










95 




Asp 


Ser 


Leu 


Met 


Asp 


Ser 


Ser 


Scr 


Pro 


Glu 


Asn 


Ala 


Met 


Giu 


Gly Tyr 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



305/490 









100 










105 








110 






Ala 


Gin 


Met 


Leu 


Gly 


Glu 


Asp 


Tyr 


Arg 


Ser 


Asn 


Asn Ala 


Glu 


Ala 


Asn 






115 










120 








125 








Leu 


Gin 


Gly 


Leu 


Pro 


Sar 


Asp 


Leu 


I.eu 


Asp 


Tyr 


He Tyr 


Lys 


Gin 


Glu 




130 










135 










140 








Asn 


Thr 


He 


Ala 


Tyr 


Cys 


Asp 


Thr 


Gly 


Leu 


He 


Ser Thr 


Arg 


He 


Asn 


145 










150 










155 








160 


Thr 


Tyr 


Thr 


Tyr 


Glu 


Gly 


Gly 


Ala 


Hi a 


Thr 


Glu 


Asn Thr 


Val 


Arg 


Phe 










165 










170 








175 




Ala 


Asn 


He 


Leu 


Arg 


Thr 


Thr 


Gly 


Lys 


Val 


Leu 


Glu Glu 


Arg 


Asp 


He 








ISO 










185 








190 






Phe 


Lys 


lie 


A^p 


Tyr 


Ala 


Glu 


Arg 


Leu 


ser 


Ala 


Leu He 


He 


Gly Gin 






135 










200 








205 








Leu 


Val 


His 


Asp 


Phe 


Gly 


Lys 


Thr 


Thr 


Pro 


Ala 


Glu Leu 


Asp Ala 


He 




210 










215 










220 








Gly 


Phe 


Phe 


Asn 


Ala 


Glu 


Glu 


He 


Gin 


Pro 


Asn 


Gly Asn 


Phe 


Met 


He 


225 










230 










235 








240 


Asp 


Asp 


Lys 


Gly 


Leu 


Thr 


Tyr 


Cys 


Phe 


Asn 


Glu 


Tyr Gin 


He 


Ala 


Ala 










245 










250 








255 




Tyr 


Ala 


Arg 


Gly 


Ala 


Val 


Tyr 


Val 


Arg 


Leu 


Gly, 


Tyr Asp Val 


Leu 


Ala 








260 










265 








270 






Pro 


r,ou 


Leu 


Arg 


Asp 


Asp 


Ser 


Pro 


Leu 


Lys 


Arg 


Tyr Leu 


Pro 










275 










280 








285 








(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 37 4 















(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 37 8 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<lii) HYPOTHETICAL: YES 

{vi} ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

Ux) FEATURE: 

(A) NAME/ KEY : misc_feature 
IB) LOCATION 1. . .37 8 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 374 



Gly 


He 


He 


Ser 


Val 


Leu 


Ser 


His 


Val 


Val 


Asp 


Arg 


Pro 


Gin 


Trp Gly 


1 








5 










10 










15 




Ala 


Ser 


Pro 


Glu 


Ala 


Ala 


Gly 


Thr 


His 


Ser 


Val 


Tyr 


Ser 


He 


Leu 


His 








20 










25 








30 






Pro 


Ser 


Ala 


Gly 


He 


He 


Arg 


He 


Arg 


Ser 


Het 


Gly 


He 


He 


Ser 


Ala 






35 










40 








45 








Cys 


Arg 


He 


Ala 


He 


Leu 


Ala 


Gly 


Lys 


His 


Pro 


Arg 


Ser 


Gly 


Leu 


Ser 




50 










55 










60 










Arg 


Ala 


Asn 


Val 


Gly 


He 


Leu 


Ser 


Tyr 


Asn 


Pro 


Glu 


Asn 


Thr 


Pro 


Glu 


65 










70 










75 










80 


Lys 


Lys 


Arg 


Lys 


Leu 


Gin 


Glu 


Lys 


Asn 


Val 


Phe 


Leu 


Gin 


He Arg 


Leu 










85 










90 










95 




Arg 


Gin 


Ser 


Phe 


Asn 


Asn 


Leu 


lie 


Pro 


Ser 


Leu 


Pro 


Phs 


Arg 


He 


Asp 








100 










105 










110 






Asn 


Thr 


Lys 


Lys 


He 


Thr 


Glu 


Met 


Lys 


Lys 


Thr 


Thr 


Leu 


Thr 


Gly Ser 






115 










120 










125 








He 


Cys 


Ala 


Leu 


Leu 


Leu 


Phe 


Leu 


Gly 


Leu 


Ser 


Ala 


Asn 


Ala 


Gin 


Ser 




130 










135 










140 










Lys 


Leu 


Lys 


He 


Lys 


Ser 


He 


Glu 


Ala 


Ala 


Thr 


Thr 


Phe 


Ser 


Ser 


Ala 


14b 










150 










155 










160 


Thr 


Ala 


Gly 


Asn 


Gly 


Phe 


Gly 


Gly 


Asn 


He 


Phe 


Gly 


Met 


Asp 


Met 


Ser 










165 










170 










175 




He 


Arg 


Met 


Arg 


Val 


His 


His 


Ser 


He 


r.eu 


Pro 


Glu 


Gly Leu Asp 


Phe 








180 










IBS 










190 






Ser 


Val 


Gly 


He 


His 


Glu 


Arg 


Arg 


Ala 


His 


Trp 


Glu 


Glu 


Ala 


Gly 


Ser 






195 










200 










205 






Pro 


Lys 


Leu 


Met 


Tyr 


Thr 


Asn 


Val 


Pro 


Ser 


He 


lie 


Gly He 


Val 


Glu 




210 










215 










220 










Lys 


Val 


He 


Val 


Phe 


Glu 


Asp 


Ala 


Glu 


Asp 


Phe 


Phe 


Asp Lys 


Lys 


Ala 


225 










230 










235 








240 


Leu 


Gly 


Arg 


Phe 


Leu 


He 


Ser 


Leu 


Gly 


He 


Ser 


Tyr 


Thr 


Lys 


Ills 


Leu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



306/490 



245 250 255 



Gly 


Ala 


Tyr 


Trp 
260 


Gly 


Trp 


Thr 


Asn 


Asp 
265 


Ala 


His 


lie 


Leu 


Phe 
270 


Ser 


Pro 


lie 


Pro 


Lys 
275 


Ser 


Lys 


Val 


His 


Tyr 
280 


Asp 


Thr 


Tyr 


Thr 


Arg 
285 


Ala 


Gly 


Ser 


Asp 


Leu 
290 


Val 


Leu 


Gin 


Ser 


Glu 

295 


Asp 


Val 


Ala 


Thr 


Val 

300 


Ser 


Asn 


Gly 


Phe 


Ser 


Pro 


Gl> 


He 


Gly 


Leu 


Lys 


Ser 


Ser 


lie 


Trp 


Trp 


Lys 


Met 


Pro 


He 


305 










310 










315 










320 


Lys 


Ser 


Lys 


Tyr 


Asp 
325 


Phe 


Arg 


Leu 


Gly 


Phe 
330 


Ser 


Leu 


Gly 


Tyr 


Glu 
335 


Tyr 


Leu 


Asn 


Leu 


Leu 
340 


Tyr 


Pro 


Tyr 


Arg 


Asn 
345 


Ph« 


Lys 


Leu 


Asp 


Gly 
350 


Asn 


Lys 


Pro 


Leu 


Ser 
355 


Ala 


Leu 


Ser 


Pro 


Arg 
360 


Met 


Asn 


His 


lie 


Gly 
365 


His 


Val 


Gly 


Phe 


Asn 


Phe 


Thr 


Val 


Gly 


Leu 


Trp 


Thr 


Asn 















370 375 



(2) IMFOPJtATIOU FOR SKQ ID MO: 375 

(1} SEQUENCE CHARACTERISTICS! 

(A) LENGTH; 1269 amino acids 
(8) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui) ltOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vl } ORIGINAL SOURCE: 
* (A) ORGANISM: Porphyromonas gingivalis 

lix) FEATURE : 

<A> NAME/KEY: miscjeature 
(Bi LOCATION 1...1269 



Ui) SEQUENCE DESCRIPTION: SEQ ID HO: 375 



Val 


Lys 


Cys 


Wet 


Gly 


Lys 


Tyr 


Lys 


Arg 


Ala 


Lys 


Tyr 


Arg 


Tyr 


Trp Leu 


1 








5 










10 










15 


Phe 


Pro 


Phe 


Cys 


Ser 


Asp 


Tyr 


Tyr 


Thr 


Phe 


Glu 


Gly 


Val 


Thr 


Phe Leu 








20 










25 










30 




Cys 


Ala 


Ser 


Asp 


Asp 


Met 


Thr 


Thr 


Lys 


Lys 


Pro 


Gin 


Ala 


He 


Leu Asp 






35 










40 










45 






Leu 


Glu 


Lys 


Ala 


Tyr 


Asn 


He 


GlU 


He 


Pro 


Asp 


Leu 


Ser 


Ser 


Gin Glu 




50 










55 










60 








Gly 


He 


Ser 


Trp 


Ser 


Val 


Asn 


Arg 


Tyr 


Phe 


Lys 


Gin 


Asp 


Ser 


Ser Gly 


65 










70 










75 








80 


Ala 


Val 


Val 


Glu 


Leu 


Cys 


Leu 


Arg 


Glu 


Cys 


Gin 


He 


Glu 


Ser 


Met Thr 










85 










90 










95 


Trp 


Leu 


He 


Asp 


Phe 


Pro 


Ala 


Lcm 


Lys 


Lys 


Leu 


Asp 


Leu 


Ser 


Tyr Asn 








100 










105 










110 




Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Axg 


Leu 


Thr 


Ser 


Leu 


Thr Lys 






115 










120 








125 






Leu 


Arg 


Leu 


Arg 


Ser 


Asn 


Gin 


He 


Arg 


Lys 


Leu 


Glu 


Gly Leu 


Asp Ser 




130 










135 










140 








Leu 


Thr 


Ser 


Leu 


Thr 


Lys 


Leu 


Ser 


Leu 


Ser 


Asp 


A3n 


Gin 


He 


Ser Lys 


145 










150 










155 








160 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Glu 


Leu 


Tyr 


Leu Leu 










165 










no 










175 


Asp 


Asn 


Gin 


He 


Sor 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser Leu 








180 










185 










190 




Ala 


Thr 


Leu 


Glu 


Leu 


Ser 


Gly 


Asn 


Gin 


He 


Arg 


Lys 


Leu 


Glu 


Gl y Leu 






195 








200 










205 






Glu 


Alo, 


Leu 


Thr 


Ser 


Leu 


Ala 


Thr 


Leu 


Glu 


Leu 


Ser 


Gly Asn 


Gin He 




210 










215 










220 








Arg 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


,Ser 


Leu 


Thr 


Lys 


Leu Arg 


225 










230 










235 








240 


Leu 


Arg 


Ser 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu Thr 










245 










250 










255 


Ser 


Leu 


Ala 


Thr 


Leu 


Glu 


Leu 


Ser 


Gly 


Asn 


Gin 


He 


Arg 


Lys 


Leu Glu 








260 










265 










270 




Gly 


Leu 


GlU 


Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Thr 


Leu 


Glu 


Leu 


Ser 


Gly Asn 






275 










2B0 










285 




Gin 


He 


Ser 


I.ys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Ser 


Ser 


Leu 


Thr Lys 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



307/ 490 



290 295 30O 

Leu Arg Leu Arg Ser Asn Gin lie Ser Lys Leu Glu Gly Leu Glu Arq 
305 310 315 320 

Leu Thr Ser Leu Thr Lys Leu Ser Leu Ser Asp Asn Gin He Ser Lys 

325 330 335 

Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr Leu Leu 

340 345 350 

Asp Asn Gin He Arg Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu 

355 360 365 

Thr Lys Leu Axg Leu Arg Ser Asn Gin He Ser Lys Leu Glu Gly Leu 

370 375 380 

Asp Ser Leu Thr Ser Leu Thr Lys Leu Ser Leu Ser Asp Asn Gin He 
385 390 395 400 

Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr 

405 410 415 

Leu Leu A~p Asn Gin He Arg Lys Leu Glu Gly Leu Asp Gly Leu Ala 

420 425 430 

Ser Leu Thr Arg Leu Ser Leu Arg Arg Asn Gin He Ser Lys Leu Glu 

435 440 445 

Gly Leu Asp Arg Leu Lys Val Leu Arg Lys Leu Asp Val Ser Gly Asn 

450 455 4 6 o 

Asp He Gin Ser He Asp Asp He Lys Leu Leu Ala Pro He Leu Glu 
4*5 470 475 480 

Gin Thr Leu Glu Lys Leu Arg He His Asp Asn Pro Phe Val Ala Ser 

4fi5 4 go 495 

Ser Gly Leu He Leu Ser Pro Tyr Asp Asn His Leu Pro Glu 11° L"s 

500 505 510 

Ala Leu Lou Glu Lys Glu Lys Glu Lys Gin Lys Lys Thr Ser Val Glu 

515 520 525 

Tyr His Pro Phe Cys Lys Val Met Leu Leu Gly Asn His Ser S»r Glv 

530 535 540 

Lys Thr Thr Phe Leu Ser Gin Tyr Asp Thr Asn Tyr Thr Tyr Gin Lys 
545 550 555 560 

Asn Thr His Val Leu Ser He Hi a Arq Ser Asn Asn Pro Asn Ala He 

565 570 575 

Phe Tyr Asp Phe Gly Gly Gin Asp Tyr Tyr His Gly He Tyr Gin Ala 

580 565 590 

Phe Phe Thr Thr Gin Ser Leu Tyr Leu Leu Phe Trp Asp Ala Lys Lys 

595 600 605 

Asp Arg Asn Phe Val Ser Val Asp Asp Lys Glu Tyr Gin Thr Leu Asn 

<510 615 620 

Phe Asn Arg Pro Tyr Trp Leu Gly Gin He Ala Tyr Ala Cys Asn Arg 
€25 630 635 640 

Cys Met Ser Val Gly Gly Asn Pro Asp Gly Lys Asp Thr Pro Gin Thr 

645 650 655 

Thr Asp Asp Thr He He He Gin Thr His Ala Asp Glu Thr Gly Ala 

660 665 670 

Lys Gin Gin Thr Leu Gly Cys Ala Ala Glu Asn Gly Val Leu Glu Glu 

€75 680 685 

He Tyr Val Ser Leu Glu Pro Lys Ala Asn Ser Ala Val His Ala Leu 

690 695 700 

Asn Tyr Leu Asn Glu Arg Val Arg Glu Val Val Ala Ser Arq Ser Lvs 
■705 710 715 720 

Ser He Gin He Thr Glu Lys Asp Lys Gly Leu Tyr Glu Ala Leu Pro 

725 730 735 

Thr He Ala Gly Asp Asn Lys His He Pro He Ser Leu Glu Ala Leu 

7 40 745 750 

Ala Ala Gin Leu Asn Lys Gly Arg Ala Glu Asn Asp Leu Tyr Thr He 

7 55 760 765 

Glu Tyr Leu Gin Thr Glu Leu Asn Gin Leu Ssr Leu Arq Glv Glu Val 

77n 775 780 

Leu Tyr Tyr Arg Glu Asn Glu Lys Leu Asn Asn Tyr Val Trp Leu Asp 
785 790 795 800 

Pro Ala Ala Phe Val Gin Met He His Gly Glu He Leu Gin Lys Asp 

805 810 ei5 

Asn lie Asn Arg Gly Thr Val Pro Lys Asp He Phe Glu Cys Lys Leu 

820 825 830 

His Asn Leu Ser Ser Gly Ser He Phe Glu Glu Asp Gly Gin Asn Glv 

835 840 ... 845 

Asn Met He Leu Gin Leu Leu Leu Glu Glu Leu He Val Tyr Glu Asp 

950 855 e60 

Lys Asp Cys Tyr Val He Pro Gly Tyr Leu Pro Leu His Ser Asp Asp 
865 670 8-75 P 880 

Glu Ala Tyr Lys Trp Leu Thr Leu Gly Phe Glu Arg Pro Asu Phe Val 
885 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



308/ 490 



60 



Leu 


Lys 


Phe 


Glu 


Arg Phe 


He 


Pro 


Phe 


Gly 


Leu 


He 


Asn 


Gin 


He 


He 








900 








905 










910 






Ala 


Tyr 


Tyr 


Gly Arg Glu 


Glu 


Gly 


Ala 


Leu 


Lys 


Arg 


Tyr 


Trp 


Arg 


Asp 






91b 








920 










925 








Gin 


Val 


He 


Phe 


Thr Ala 


Gly 


Arg 


Glu 


Met 


Asp 


Arg 


Gin 


Thr 


Leu 


Glu 




930 








935 










94 0 










Gin 


Glu 


Glu 


Glu 


Lys Glu 


Gly 


Leu 


Pro 


Lys 


Thr 


Asn 


Ala 


Glu 


Asp 


Tyr 


945 








950 










955 










960 


Gin 


He 


Trp 


He 


Lys Leu 


Asp 


Phe 


Thr Asp 


Leu 


Ala 


He 


Ser 


Val 


Phe 










965 








970 










97 5 




lie 


Ly3 


Glu 


Gin 


Arg Lys 


Thr 


Ser 


Ala 


Lys 


Asp 


Met 


Gin 


Arg 


Lys 


Glu 








980 








985 










990 






Ala 


Thr 


He 


Leu 


Ser Asp 


Met 


Leu 


Asp 


Met 


Tyr 


Trp 


Asn 


Asn 


He 


Pro 






995 








1000 








1005 






Pro 


Arg 


Glu 


Gin 


He Gly 


Asp 


Lys 


Asp 


Thr 


Glu 


Gin 


Thr 


Arg 


Ser 


Thr 




1010 






1015 








1020 








lie 


Arc/ 


Glu 


Thr 


Asn Arg 


I.ys 


Lys 


Arg 


Pro 


He 


Gin 


Asp 


Leu 


Tyr 


Leu 


1025 






1030 








1035 










Ser 


Cys 


JU a 


Gin 


Ala Asp 


Lys 


Asp 


Leu 


Thr 


Glu 


Ser 


His 


Tyr 


He 


Mis 










1045 








1050 






1055 


Leu 


Gly Thr 


Leu 


Asp Asp 


Glu 


Ser 


Lys 


Thr 


Thr 


Ala 


Arg 


He 


Ala 


Ala 



10 



15 



20 

Leu Gly Thr 

1060 1065 1070 

Tyr Pro I.eu Lys Asn Gly Val He Asp Lys Glu Arg Val Arg Glu Val 
1075 1080 1085 

25 Ser Thr Arg Pre Tyr Lys His Leu Ser Val Asn Ly3 Asn Leu Ala Thr 
1090 1095 1100 

Ala Lys Gin He Phe He Ser Tyr Ser Lys Glu Asp Gin Thr Glu Leu 
1105 1110 1115 1120 

Glu Thr Cys Leu Gin Phe Phe Lys Pro Leu Glu Lys Asn Gly Gin He 
30 1125 1130 1135 

Glu He Tyr Tyr Asp Lys Leu Thr Lys Phe Glu Thr Pro He His Pro 

1140 1145 1150 

Glu He Arg Lys Arg He Val Glu Ala Asp Cys He He Ala Leu He 
1155 1160 1165 

35 Ser Gin Arg Tyr Leu Ala Thr Asp Tyr He Leu Asp His Glu Leu Pro 
1170 1175 1180 

Val Phe Arg Glu Tyr Asn Lys Thr He Val Pro He Leu He Lys Pro 
1185 1190 1195 1200 

Cys Thr Phe Glu Asp Asp Glu Phe Leu Arg Glu Lys Tyr Phe Ala Gin 
40 1205 1210 1215 

Lys Ala Gin He He Asn Leu Gly Lys Glu Gly Lys Thr He Lys Ala 

1220 1225 1230 

Tyr Asp Ser He Thr Ala Ser Ala His Arg Asp Glu Asn Trp Val Ala 
1235 1240 1245 

45 Val Val Arg Glu Phe Lys Glu Lys He Leu Arg He Thr Lys Gin Glu 
1250 1255 1260 

Val Asn Thr Asp Glu 
1265 

50 {2) INFORMATION FOR SEQ ID HO: 376 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 amino acids 
(D) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Uii> HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

fA) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 
65 (A) NAI IE/KEY: misc_feature 

(B ) LOCATION 1. . .231 

{Xi» SEQUENCE DESCRIPTION: SEQ ID NO: 37 6 

70 Lys Phe Met Met Lys Lys Ala Phe Val Phe Val Leu Leu Val Cys Leu 
15 10 15 

Phe Ser S-r Phe Ser Ser Ser Ala Gin Thr Thr Thr Asn Ser Ser Arg 

20 25 30 

Ser Tyr Phe Thr Gly Arg He Glu Lys Val Ser Lou Asn Leu Gly Val 

75 35 40 45 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



309/ 490 



45 



50 



55 



60 



65 



70 



75 



Pro 


Pro 


Val 


Ser 


Thr 


Glu 


Val Trp 


Gly 


Met 


Thr 


His 


Asp 


Ala 


Asn 


Gly 




50 










55 








60 










Leu 


Pro 


Phe 


Glu 


He 


Pro 


He Ser 


Phe 


Ser 


Arg 


Phe 


Asn 


Ser 


Gin 


Gly 


65 










TO 








75 










80 


Asp 


He 


Ala 


Thr 


Thr 


Tyr 


Tyr He 


Ala 


Asn 


Ser 


Glu 


Ala 


Thr 


Leu 


Asn 










85 








90 










95 




Glu 


Trp 


Cys 


hsp 


Tyr 


Ala 


His Pro 


Gly Gly 


He 


Val 


Arg 


Val 


Glu 


Gly 








100 








105 










110 






Arg 


Phe 


Trp 


Lys 


Met 


Thr 


Tyr Asn 


He 


Pro 


Thr 


Tyr 


Asn 


Ala 


Val 


Cys 






115 








120 










125 








Thr 


Acq 


Tie 


Thr 


Phe 


Glu 


Asn Gin 


Glu 


Tie 


Glu 


Gly 


Thr 


He 


Val 


Leu 




130 










135 








140 










lie 


ero 


tys 


Pro 


Lys 


Val 


Ser Leu 


Pro 


His 


Val 


Ser 


Glu 


Ser 


Val 


Pro 


145 










150 








155 










160 


Cys 


He 


Arg 


Thr 


Glu 


Ala 


Gly Arg 


Glu 


Phe 


He 


Leu 


Cys 


Glu 


Glu 


Asp 










165 








no 










175 




Asp 


Thr 


Phe 


Val 


Ser 


His 


Asp Gly Asn 


Glu 


Val 


Thr 


He 


Gly Gly 


T.ys 








180 








185 










190 






Pro 


Phe 


Leu 


Leu 


Asn 


Thr 


Asn Val 


Lys 


He 


Val 


Gly 


Asp 


Val 


Ser 


Gin 






195 








200 










205 








Lys 


Tyr 


Ala 


Val 


Gly 


Val 


Gly Glu 


He 


Arg 


Phe 


Leu 


Glu 


lis 


Cys 


Ala 




210 










215 








220 










Gin 


Thr 


Val 


Ser 


Gin 


Gin 


Lys 


















225 










230 




















(2) 


IH FORMAT! ON 


FOR 


SEQ 


ID NO:377 

















10 



15 



20 



25 



ti> SEQUENCE CHARACTERISTICS: 

(A J LEMGTH: 563 amino acids 
30 (E) TYPE: amino acid 

(D) TOPOLOGY: linear 

tii) MOLECULE TYPE: protein 

35 (iii) HYPOTHETICAL: YES 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

40 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 
{B| LOCATION 1...563 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:377 



Ser 


Leu 


He 


Asn Asn 


Ser 


Arg 


Asp 


Trp 


Arg 


Ala 


Pro 


Val 


Arg 


Leu 


ser 


1 






5 










10 










15 




Leu 


Lys 


Lys 


Lys Thr 


Lys 


Thr 


Met 


I,y& 


Thr 


Lys 


Val 


Leu 


Arg 


Lys 


Phe 








20 








25 










30 






Val 


Val 


Ala 


Ala Phe 


Ala 


Val 


Ala 


Thr 


Leu 


Cys 


Pro 


Leu 


Ala 


Gin 


Ala 






35 








40 










45 








Gin 


Thr 


Met 


Gly Gly Asp 


Asp 


Val 


Lys 


Val 


Val 


Gin 


Tyr Asn 


Gin 


Glu 




50 








55 










60 










Lys 


Leu 


Val 


Gin Thr 


Arg 


Met 


Ser 


Val 


Ala 


Asp 


Asn 


Gly Trp 


He 


Tyr 


65 








70 










75 










80 


Val 


Met 


Thr 


His Ser 


Gly 


Tyr 


Asp 


Thr 


Gly 


Asn 


ser 


Asn 


Val 


Lys 


He 








85 










90 










95 




Phe 


Arg 


Ser 


Lys Asp 


Gin 


G"ly 


Ala 


Thr 


Tyr 


Gin 


Lys 


Leu Arg 


Asp 


Trp 








100 








105 










110 






Aep 


Pro 


Ser 


Asp Asp 


Tyr 


Gin 


Phe 


Gin 


Asp 


Ph- 


Asp 


He 


Val 


Val 


Thr 






115 








120 










125 








Gly Lys 


Asn 


Glu Ser 


Asp 


He 


Lys 


He 


Trp 


Ser 


Val 


Glu 


Leu 


Met 


Asn 




130 








135 










140 










Lys 


Pro 


Gly 


Gly Tyr 


Lys 


Ser 


Arg 


Val 


Ala 


Val 


Phe 


Ser 


Arg 


Asp 


Ala 


145 








150 










155 










160 


Asn 


Ala 


Gin 


Asn Ala 


Ly3 


Leu 


Val 


Tyr 


Lys 


Glu 


Asp 


Phf> 


Ser 


Asn 


Val 








165 










170 










175 




Gin 


Leu 


Tyr 


Asp Val 


Asp 


He 


Ala 


Ser 


Asn 


Tyr 


Arg 


Ser 


Pro 


Ser 


Scr 








180 








185 










190 






Leu 


Asn 


Asn 


Gly Gly Asn 


Pro 


Phe 


Ala 


Leu 


Ala 


Phe 


Ala 


Tyr 


Thr 


Gly 






195 








200 










205 








Phe 


Asn 


Asn 


Thr His 


Lys 


He 


Ser 


Phe 


Val 


Asp 


Tyr 


Val 


Phe 


Ser 


Leu 




210 








215 








220 










Asn Gly Gly 


Gin Asn 


Phe 


Asn 


Lys 


Asn 


Leu 


Leu 


Phe 


Ser 


Gin 


Asp Gly 


225 








230 










235 










240 
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Glu 


Lys 


Lys 


He 


Asp 
245 


Lys 


Val 


Asp 


Leu 


Ser 
250 


Leu 


Gly 


Ser 


Thr 


Ser 
255 


Glu 


Ser 


Met 


Gly 


His 
Z60 


Asn 


Ala 


Trp 


Pro 


Leu 

265 


Met 


Gly 


Val 


Val 


Phe 
270 


Glu 


Met. 


Asn 


Lys 


Gin 


Gly Gly 


Lys 


Ser 


Asp 


He 


Gly 


Phe 


Leu 


Ser 


Asn 


Phe 


Val 






27 5 










230 










285 








Asp 


Asn 
290 


Asp 


Pro 


Glu 


Phe 


Gin 
295 


Trp 


Ser 


Gly 


Pro 


He 
300 


Lys 


Val 


Ser 


Glu 


Sex 


Asp 


Met 


S»r 


Phe 


Ser 


Pro 


Lys 


He 


Gin 


Met 


Leu 


Leu 


Asp 


Glu 


Asp 


305 










310 










315 










320 


Asn 


Asn 


Thr 


He 


Asn 


Gly 


Glu 


Ser 


Cys 


His 


Asn 


Phe 


Het 


He 


Thr 


Tyr 










325 










330 










335 


Ser 


Asp 


Tyr 


Asp 


Ser 


Glu 


Tyr 


Ser Asp 


Trp Asp 


He 


Arg 


Tyr 


Val 


Tyr 








340 










345 










350 






Pro 


Lys 


Lys 
355 


Ser 


Phe 


Lys 


Tyr 


Glu 
360 


Lys 


Gly 


Lys 


Thr 


Pro 
365 


Thr 


Het 


A**p 


Asp 


Leu 
370 


Val 


Glu 


Ala 


Phe 


Leu 
37 5 


Thr 


Ala 


Ser 


Tyr 


Gin 
380 


Ser 


Glu 


Thr 


Asn 


Ser 


Gly 


Leu 


Gly Tyr 


Asp 


Lys 


Asn 


Ala 


Asn 


His 


Tyr 


Leu 


He 


Thr 


Tyc 


385 










390 










395 










400 


Ala 


Lys 


Lys 


Glu 


Glu 


Asn 


Gly Thr Asn Thr 


Leu 


Lys 


Tyr Arg 


Trp 


Ala 










405 










410 










415 




Asn 


Tyr 


Asp 


Lys 


He 


Hie 


Asn 


Lys 


Asp 


Leu 


Trp 


Ser 


Asp Thr 


Phe 


Thr 








420 










425 










430 






Tyr 


Thr 


Ser 
4 35 


Ser 


Ala 


Asn 


Ala 


Leu 
440 


Tyr 


Thr 


Pro 


Gin 


Val 
445 


Asp 


He 


Asn 


Pro 


Thr 


Ly- 


Gly Leu 


Val 


Cys 


Trp 


Ser 


Trp 


Val 


Glu 


Tyr 


Leu 


Pro 


Cli- 




4 50 










4 55 










460 










Lys 
465 


Arg 


lie 


Val 


Trp 


Ser 


Asp 


Thr 


Gin 


Trp 


Thr 


His 


Ala 


Asn 


Gly 


val 










470 










475 










480 


Glu 


A^p 


He 


Val 


Met 


Gin 


Glu 


Gly Ser 


Met 


Lys 


Leu 


Tyr 


Pro 


Asn 


Pro 










485 










4 90 










4 95 




Ala 


Gin 


Glu 


Tyr 


Ala 


Val 


He 


Ser 


Leu 


Pro 


Thr 


Ala 


Ala 


Asn 


Cys 


Lys 








500 










505 










510 




Ala 


Val 


Val 


Tyr 


Asp Met 


Gin 


Gly Arg 


val 


Val 


Ala 


Glu 


Ala 


Ser 


Phe 






515 










S20 










525 








Ser 


Gly 


Asn 


Glu 


Tyr 


Arg 


Leu 


Asn 


Val 


Gin 


His 


Leu 


Ala 


Lys 


Gly Thr 




530 










535 










540 








Tyr 


lie 


Leu 


Lys 


Val 


Val 


Ser 


Asp 


Thr 


Glu 


Arg 


Phe 


Val 


Glu 


Lys 


Leu 



545 550 555 560 

He Val Glu 



(2) INFORMATION FOR SEQ ID NO: 37 8 

ti> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 86 atnino acids 
<B) TYPE; amino acid 
<D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

<iii) HYPOTHETICAL: YES 

(vil ORIGINAL SOURCE : 

(A) ORGANISM: Porphyromonas qingivalis 

(ix) FEATURE: 

{A} NAME /KEY: misc_feature 
IB) LOCATION 1. . .786 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378 

He Val Tyr L«u Cye Hts Cys Met Asn His Arg Arg Ser Lys Thr Met 

1.5 10 15 

Leu Thr He Arg Asn Phe Leu Leu Phe Cys Cys Leu Ser Leu He Ala 

20 25 30 

Phe Ala Ala Asp Ala Gin Ser Ser Val Ser Ser Gly Arg Arg Leu Thr 

35 40 45 

Glu Tyr Val Asn Pro Phe He Gly Thr Ala Asn Tyr Gly Tnr Thr Asn 

50 55 60 

Pro Gly Ala Val Leu Pro Asn Gly Leu Met Ser Val Thr Pro Phe Asn 
65 70 75 SO 

Val Ser Gly Ser Thr Glu Asn Arg Phe Asp Lys Asp Ser Arg Trp Trp 
85 90 95 



SUBST ITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



311/490 

Ser Ala Pro Tyr Ser Ala Asp Asn Ser Tyr Cys He Gly Phe Ser His 

1°0 105 no 

Val Asn Leu Ser Gly Val Gly Cys Pro Glu Leu Ser Gly Tie Leu L»u 

115 120 125 

Het Ala Thr Ser Gly Thr Ph- Asp Pro Asp Tyr Cys Cys Tyr Gly Ser 

13° 135 140 

Ser Leu Ser Arg Glu Tyr Ala Arg Pro Gly Glu Tyr l.ys Ala Val Leu 
145 150 155 3 60 

Asp Lys Tyr Gly tie Asp Ala Ala Val Thr Val Thr Glu Arg Thr Ala 

165 170 i7b 

Leu Thr Glu Phe Ala Phe Pro Glu Gly Glu Gly His He Leu Leu Asn 

ISO 185 190 

Leu Gly Gin Ala Leu Ser Asn Glu Ser Gly Ala Ser Val Arg Phe Leu 

200 205 
Asn Asp Ser Thr Val Val Gly Ser Arg Leu Met Gly Thr Phe Cys Tyr 

210 215 220 

Asn Pro Gin Ala Val Phe Arg Gin Tyr Phe Val Leu Gin Val Ser Ara 
2 ^ S 230 235 2 40 

Arg Pro He Ser Ala Gly Tyr Trp Lys Lys Gin Pro Pro Met Thr Val 

245 250 255 

Glu Ala Gin Trp Asp Ser Thr Ala Gly Lys Tyr Lys Gin Tyr Asp Gly 

260 265 270 

Tyr Lys Arg Glu Met Ser Gly Asp Asp He Gly Val Arg Phe Sec Ph- 

2*75 280 285 

Asn Cys Asp Gin Gly Glu Lys He Tyr Val Arq Ser Ala Val Ser Phe 

290 295 300 

Val Ser Glu Ala Asn Ala Leu Tyr Asn Leu Glu Ala Glu Gin Glu Glu 
305 310 31s 32o 

Val Phe Lys Ser Val Gly Gly Asn Pro Ala Lys Ala Phe Ser Ala lie 

325 330 335 

Arg Ser Arg Ala He Glu Arg Trp Glu Glu Ala Leu Gly Thr Val Glu 

340 345 350 

Val Glu Gly Gly Thr Pro Asp Glu Lys Thr He Phe Tyr Thr Ala L-u 

355 360 365 

Tyr His Leu Leu He His Pro Asn He Leu Gin A^p Ala Asn Gly Glu 

370 375 3eo 

Tyr Pro Met Met Gly Ser Gly Lys Thr Gly Afin Thr Ala His Asp Arg 
305 390 40Q 

Tyr Thr Val Phe Ser Leu Trp Asp Thr Tyr Arg Asn Val His Pro Leu 

„ T 410 415 

Leu Cys Leu Leu Tyr Pro Glu Lys Gin Leu Asp Met Val Arg Thr Leu 

420 4 25 430 

He Asp Met Tyr Arg Glu Ser Gly Trp Leu Pro Arg Trp Glu Leu Tyr 

435 440 445 

Gly Gin Glu Thr Leu Thr Met Glu Gly Asp Pro Ser Leu He Val Ho 

450 455 46Q 

Asn Asp Thr Trp Gin Arg Gly Leu Arg Ala Phe Asp Thr Ala Thr Ala 

465 470 475 480 

Tyr Glu Ala Het Lys Lys Asn Ala Ser Ser Ala Gly Ala Thr His Pro 

^ » 485 490 495 

He Arg Pro Asp Asn Asp Asp Tyr Leu Thr Leu Gly Phe Val Pro Leu 

500 505 510 

Arg Glu Gin Tyr Asp Asn Ser Val Ser His Ala Leu Glu Tyr Tyr Leu 

515 520 525 

Ala Asp Trp Asn Leu Ser Arg Phe Ala His Ala Leu Gly His Lys Glu 

fi 30 535 540 

Asp Ala Ala Leu Phe Gly Lys Arg Ser Leu Gly Tyr Arg Hie Tyr Tyr 
545 s 50 555 560 

Asn Lys Glu Tyr Gly Met Leu Cys Pro Leu Leu Pro Asp Gly Ser Phe 

565 570 575 

Leu Thr Pro Phs Asp Pro Lys Gin Gly Glu Asn Phe Glu Pro Asn Pro 

580 S85 590 

Gly Phe His Glu Gly Ser Ala Tyr Asn Tyr Ala Phe Phe Val Pro His 

595 600 60S 

Asp He Gin Gly Leu Ala Arg Leu Met Gly Gly Ala Lys Val Ph- Ser 

610 61S 620 

Glu Arg Leu Gin Lys Val Phe Asp Glu Gly Tyr Tyr Asp Pro Thr Asn 
625 6 35 640 

Glu Pro Asp He Ala Tyr Pro Tyr Leu Phe Ser Tyr Phe Pro Lys Glu 

645 650 ™ 655 

Ala Trp Arg Thr Gin Lys Leu Thr Arg Glu Leu He Asp Lys His Ph- 

660 665 6 70 

Cys Asn Ala Pro Asn Gly Leu Pro Gly Asn Asp Asp Ala Gly Thr M-t 

675 680 685 

Ser Ala Trp Leu Val Tyr Ser Met Leu Gly Phe Tyr Pro Asp Cys Pro 
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WO 99/29870 



312/490 





690 










695 










7 00 










Gly 


Ser 


Pro 


Thr 


Tyr 


Thr 


Leu 


Thr 


Ser 


Pro 


Val 


Phe 


Pro 


Arg 


Val 


Arg 


705 










710 










715 










7 20 


He 


Arg 


Leu 


Asn 


Pro 


Gin 


Tyr 


Tyr 


Pro 


Gin 


Gly 


Glu 


Leu 


He 


He 


Thr 










72S 










730 










735 




Thr 


Asn 


Thr 


Glu 


Asn 


Gin 


Pro 


Thr 


Asp 


Ser 


He 


Tyr 


He 


His 


Thr 


Val 








740 










745 










750 






Ser 


Leu 


Gly Asn 


Lys 


Thr 


Leu 


Pro 


His 


Gly Thr 


Arg 


His 


He 


Ser 


His 






755 










760 










765 








Ala 


Asp 


Leu 


Val 


Arg 


Cys 


Gly His 


Leu 


Arg 


Tyr 


Glu 


Leu 


Ser 


Asn 


Arg 




770 










775 










780 










Pro 


Arg 































765 

12) INFORMATION FOR SEQ ID NO: 37 9 

(i> SEQUENCE CHARACTERISTICS*. 

(A) LENGTH: 814 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ill MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vt) ORIGINAL SOURCE: 

{A) ORGANISM: Porphyromonas gingival is 

Ux) FEATURE: 

(A) NAME/ KEY: miscfeature 

(B) LOCATION 1 . . .614 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 9 



Pro 


Arg 


Phe 


His 


Pro 


Gly 


Gly 


Thr 


His 


Ala 


Gin 


Lsu 


Cys 


Arg 


Aen 


Gly 


1 








5 










10 










15 




Arg 


Arg 


Asn 


Gin 


Lys 


Ser 


Asn 


Gin 


Ser 


Ser 


Cys 


Phe 


Gly 


Gly 


Gly 


Thr 








20 










25 










30 






Ala 


Pro 


Arg 


Phe 


Phe 


He 


Met 


Cys 


Lys 


He 


Arg 


Phe 


Ser 


Leu 


Leu 


Gin 






35 










40 










45 








Ala 


Leu 


Val 


Val 


Cys 


Leu 


Leu 


Phe 


Thr 


Ser 


Phe 


Ser 


Leu 


Gin 


Ala 


Gin 




50 










55 










60 










Glu 


Glu 


Gly 


He 


Trp 


Asn 


Thr 


Leu 


Leu 


Ala 


He 


His 


Lys 


Thr 


Glu 


Lys 


65 










70 










75 










B0 


Ala 


Val 


Glu 


Thr 


Pro 


Lys 


Lys 


Val 


Phe 


Ala 


Val 


Ala 


Asn 


Gly 


Val 


Leu 










85 










90 










95 




Tyr 


Ser 


Val 


Gly 


Lys 


Glu 


Ala 


Pro 


His 


GlU 


Ala 


Lys 


He 


Phe 


Asp Arg 








100 










105 










110 






He 


Ser 


Gly 


Leu 


Ser 


Asp 


Thr 


Ser 


Val 


Ser 


Ser 


He 


Ala 


Tyr 


Ser 


Glu 






115 










120 










125 








Gin 


L-2U 


Lys 


Ser 


Leu 


Val 


He 


Tyr 


Tyr 


Ala 


Ser 


Gly 


Asn 


He 


Asp 


He 




130 










135 










140 










Leu 


Asp 


Glu 


Ala 


Gly 


Arg 


Val 


Thr 


Asn 


Val 


Pro 


Ala 


Leu 


Lys 


Asp 


Asn 


145 










150 










155 










160 


He Asp 


Leu 


He 


Asp 


Lys 


Thr 


Leu 


Asn 


Arg 


Leu 


Leu 


He 


Val 


Gly Arn 










165 










170 










175 




Arg 


Ala 


Tyr 


Leu 


Ala 


Gly 


Gly 


Phe 


Gly 


Leu 


Ser 


Val 


Leu 


ASp 


Val 


Ala 








180 










IBS 










190 






Glu 


Ala 


Arg 


He 


Pro 


Ala 


Thr 


Tyr 


Ala 


Lys 


Gly 


Thr 


Lys 


Val 


Thr 


Asp 






195 










200 










205 








Val 


Ala 


Lys 


Leu 


Asp 


Asn 


Asp 


Arg 


Leu 


Leu 


Met 


Leu 


Lye 


Glu 


Gly Gin 




210 










215 










220 










Leu 


Phe 


He 


Gly 


Lys 


Glu 


Thr 


Aep 


Asn 


Leu 


Gin 


Asp 


Pro 


Ala 


Ala 


Trp 


225 










230 










235 










240 


Thr 


Ala 


Leu 


Ser 


Leu 


Asn 


Leu 


Pro 


Met 


Gly 


Ser 


Val 


Thr 


Gly 


Leu 


Gly 










245 










250 










255 




He 


Val 


Gly 


Glu 


Asp 


He 


Cys 


Phe 


Leu 


Leu 


Ala 


Asp 


Gly 


Arg 


Val 


Tyr 








260 










265 










270 






Val 


Ala 


Ala 


Asn 


Gin 


Ser 


Phe 


Glu 


Pro 


Glu 


Leu 


Leu 


.Leu 


Ser 


Ser 


Ser 






275 










280 










285 








Ala 


Asp 


Ser 


Arg 


Leu 


Tyr 


Val 


Thr 


Asp 


Arg 


Gly 


Leu 


Phe 


lie 


Cys 


Ala 




290 










2 95 










300 










Glu 


Asn 


Arg 


He 


Tyr 


Phe 


He 


Glu 


Lys 


Gly 


Arg 


Lys 


Thr 


Thr 


Gin 


Phe 


305 










310 










315 










320 


Pro 


He 


Ala 


Asp 


Val 


Leu 


Gly 


Val 


Gly 


Ala 


Met 


Asn 


Glu 


Ser 


Asn 


Thr 
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325 330 335 



Ala 


Tyr 


He 


Ala 


Leu 


Gly 


Glu 


Glu 


Gly 


Leu 


Ala 


Ser 


Leu 


Leu 


Leu 


Ala 








340 










345 










350 






Glu 


Giy 


Ser 


Thr 


Ala 


Glu 


Ala 


Met 


Pro 


Val 


Ala 


Phe 


Asp 


Gly 


Pro 


Gly 






355 










360 










365 








Asp 


Asn 


Asp 


Phe 


Tyr 


Glu 


Met 


Arg 


Phe 


Ser 


His 


Gly 


Arg 


Leu 


Tyr 


Ala 




37 D 










375 










380 










Ala 


Ser 


Gly 


Leu 


Trp 


Gly 


Thr 


Asn 


Leu 


Met 


Gly 


His 


Ala Gly Met 


Val 


385 










390 










3S5 










400 


Lys 


Leu 


Tyr 


Asp Gly 


Asn 


Arg 


Trp 


Thr 


Asn 


Phe 


Asp 


Lys 


Lys 


Thr 


Val 










4 05 










410 










415 




Gin 


Glu 


Gin 


Leu 


Gly 


Gly 


Gly 


Phe 


Ser 


Phe 


Asn 


Asp 


Ala 


He 


Asp 


He 








420 










425 










4 30 






Ala 


Val 


Ser 


Asn 


Gly 


Asp 


Pro 


Asp 


His 


Phe 


Phe 


Val 


Gl y 


Thr 


Trp 


Gly 






•135 










440 










445 








Asa 


Gly 


Leu 


e 




Phe 


Lys 


Asp 


Gl y 


Lys 


Ala 


He 


Ala 


Arg 


Tyr 


Ser 




450 










455 










460 










Gly Asn 


Glu 


Thr 


Al a 


He 


Ala 


Glu 


Cys 


Asn 


Pro 


Gly 


Asp 


Ala 


Arg 


Val 


465 










470 










47 5 










480 


Lys 


Ala 


lie 


a 


Ph« 


Asp 


A&n 




Gly 


Asn 




Trt> 


Gly 


Thr 


Leu 


Gly 










4 85 










4 90 










495 




Ala 


Val 


Gly 




Asn 


He 


Phe 


Met 


Tvr 


Asp 


Pro 


Gin 


Ser 


Ser 


Thr 


Trp 


















505 










510 






His 


Ser 


Phe 


er 


iyr 


Pro 




Val 


Ala 


Asn 


Leu 


Ala 


Ser 


Phe 


Gly Asn 






515 










520 










525 








Met 


He 


He 


Leu 


Pro 


Asn 


Gly 


A-p 


i -«=• 

-"jr — 


- 

i rp 


Val 


Asn 


He 


Leu 


His 


Arg 




530 










535 










54 0 










Ser 


Gly 


Gly 


Ser 


Thr 


Arg 


Lys 


Gly 


Val 


Leu 


He 


Phe 


Asn 


Asp 


Arg 


Gly 


545 










550 










555 










560 


Thr 


Pro 


Glu 


Thr 


Thr 


Ser 


Asp 


Asp 


Ser 


His 


Leu 


Tyr 


Val 


Glu 


Gin 


Phe 










565 










57 0 








57 5 




Val 




Arg 


Leu 


Gly 


Ala 


Ala 


He 


Gly 


His 




Thr 


He 


Tyr Ala 


Met 








580 










585 










590 






Ala 


Val 


P 


His 


Asn 


Gly 


Ser 


Val 


Trp 


Met 


Gl v 


Ser 


Asp 


He 


Gly 


He 






595 










600 










605 








Phe 


Gly 


Val 


Tyr 


Asn 


Ala 


Ala 


Gly 


Val 


Leu 


Ser 


Ser 


Thr 


Ser 


Thr 


Pro 




610 










615 








620 










He 


Ala 


Val 


Arg 


Pro 


Val 


Gly 


Gly 


Glu 


Glu 


Pro 


Asn 


Leu Tyr 


Tyr 


Val 


625 










630 










635 










640 


Leu 


Asp 


Lys 


Val 


Thr 


Val 


Thr 


Asp 


He 


Val 


Val 


Asp 


Lys 


Lou 


Asn 


His 










645 










650 










655 




Lys 


Trp 


Val 


Ala 


Thr 


Gin 


Gly 


Thx 


Gly 


Leu 


Tyr 


Leu 


Leu 


Ser 


Glu 


Asp 








660 










665 










67 0 






Cys 


Ser 


Lys 


He 


Leu 


Ala 


Gin 


Phe 


Thr 


Val 


Glu 


Asn 


Ser 


Pro 


Leu 


Leu 






67 5 










680 










685 








Ser 


Asn 


Asn 


He 


Leu 


Ser 


Leu 


Ala 


Leu 


Asn 


Asp 


Asp 


Asn 


Gly 


Leu 


Leu 




690 










695 










7 00 










Tyr 


lie 


Gly 


Thr 


Ala 


Asp 


Gly 


Leu 


Met 


Thr 


Phe 


Gin 


Thr 


Gly Thr 


Gly 


705 










710 










715 










720 


Set 


Gly 


Ser 


Ala 


Ser 


Glu 


Leu 


Asp 


Gly 


Val 


Tyr 


Val 




Pro 


Asn 


Pro 










725 










730 










735 




Leu 


A eg 


Pro 


Glu 


Tyr 


Pro 


Aep 


Gly 


Val 


Thr 


He 


Ala 


Gly Leu 


Gin 


Ala 








740 










745 










750 






Gly Cys 


Ser 


Val 


Lys 


He 


Thr 


Asp 


Thr 


Thr 


Gly 


Arg 


Leu 


Leu 


Tyr 


Glu 






755 










760 










765 








Thr 


Glu 


Ser 


Val 


Thr 


Thr 


Glu 


Val 


Lys 


Trp 


Asn 


Ala 


Arg Gly Ala 


Asp 




770 










775 










7 90 










Gly Asn 


Arg 


Val 


Ala 


Ser 


Gly 


Val 


Tyr 


Ala 


Val 


Ala 


Val 


Tyr 


Asp 


Pro 


785 










790 










795 










800 


Val 


Ser 


Lys 


Lys 


Ser 


Lys 


Leu 


He 


Arg 


Phe 


Ala 


Val 


He 


Arg 







BOS 810 



(2) INFORMATION FOR SEQ ID NO: 380 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1162 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY: linear 

(ii) IIOLECULt TYPE: protein 

(ili) HYPOTHETICAL: YES 

(vU ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas ginqivalis 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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fix) FEATURE: 

(A) HAHE/KEY: miscjeature 
< 8 ) LOCATION 1 ... 11 62 

Ui) SEQUENCE DESCRIPTION: SEQ ID HO:3flO 





lis 


Ser 


Gin 


Met 


Lys 


Arg 


He 


Leu 


Pro 


He 


Val 


Ala 


Phe 


Leu 


Ser 


1 








5 










10 










15 




Leu 


Phe 


Leu 


Ala 


Leu 


Ala 


Leu 


Pro 


Ala 


Lys 


Ala 


Gin 


Arg 


Ala 


Met 


Gly 








20 










2b 










30 






Lys 


Thr 


Ala 


Asp 


Arg 


Ser 


Leu 


Met 


Ala 


Ser 


Gly 


His 


Trp 


Val 


Lys 


He 






35 










40 










45 








Arg 


Val 


Asp 


Ala 


Ser 


Gly 


Val 


Tyr 


Arg 


Leu 


Thr 


Asp 


Glu 


Gin 


Leu 


Arg 




50 










55 










60 










Ala 


Asn 


Gly 


Phe 


Ser 


Asp 


Pro 


Ser 


Lys 


Val 


Gly 


Val 


Phe 


Gly Tyr 


Gly 


65 










70 










75 










80 


Gly 


Gly 


Val 


Leu 


Pro 


Glu 


Asp 


Leu 


Ser 


Arg 


He 


Thr 


Thr 


Asp 


Asp 


Leu 










85 










90 










95 




Pro 


Pro 


Val 


Pro 


Val 


Leu 


Arg 


Gin 


Gly 


Asn 


Ala 


Leu 


Tyr 


Phs 


Tyr AIa 








100 










105 










110 






Val 


Gly 


Pro 


Val 


Thr 


Trp 


Phe 


Tyr 


Asn 


Pro 


Ala 


Lys 


Thr 


Thr 


Met 


Glu 






115 










120 










125 








His 


Thr 


Val 


Asn 


Thr 


Tyr 


Ser 


Thr 


His 


Gly 


Tyr 


Tyr 


Phe 


Leu 


ser 


Asp 




130 










135 










140 










Ala 


Al,", 


Gly 


Ala 


Pro 


Leu 


Gin 


Met 


Ser 


Gin 


Tyr 


Thr 


Gly Gly Gly Ala 


145 










150 










155 










160 


Ser 


Ala 


niu 


Ala 


Leu 


He 


Asp 


Tyr 


Tyr 


Asp 


Glu 


Leu 


Met 


Leu 


His 


Glu 










165 










170 










175 




Gin 


Glu 


Leu 


Tyr 


Ser 


Pro 


Lys 


Glu 


Ser 


Gly 


Arg 


Asp 


Leu 


Tyr 


Gly 


Glu 








100 










185 










190 






Ser 


Phe 


Ser 


Ala 


Val 


Asn 


Thr 


Arg 


Thr 


Val 


Lys 


Phe 


Pro 


Leu 


Arg 


Gly 






195 










200 










205 








Asa 


Thi 


Arg 


Scr 


Ser 


Gly 


Glu 


Leu 


Gly 


Thr 


Val 


Phe 


Ser 


Tyr 


He' 


Ala 




210 










215 










220 










Lys 


Ala 


Arg 


Ser 


Ala 


Gly 


Gly 


Gly 


Arg 


Glu 


Ilet 


Ser 


Leu 


Ser 


Ala 


Asn 


225 










230 








235 










240 


Gly 


lie 


Leu 


TlG 


Phe 


Ser 


Asp 


Pro 


Phe 


Ser 


Met 


Thr 


Ser 


Asn 


Glu 


Val 










245 










250 










255 




Ser 


Asn 


Ser 


Tyr 


Leu 


Ala 


Gly 


Lys 


Lys 


Arg 


Arg 


Leu 


Tyr 


His 


Ser 


Thr 








260 










265 










27 0 






Pro 


(let 


Asn 


ser 


Leu 


Val 


Asn 


Glu 


Leu 


Arg 


Leu 


Asp 


Ala 


Asn 


Tyr 


Ser 






2*75 










2B0 










285 








Het. 


Thr 


Gly 


Asp 


Ala 


Val 


Asn 


Leu 


Asp 


Phe 


lie 


Glu 


Val 


Ala 


Thr 


Gin 




290 










235 










300 










Asn 


Asp 


Leu 


Arg 


Tyr 


Asp 


Gly 


Ala 


Pro 


Met 


H\3 


lie 


Arg 


Arg 


Phe 


Ser 


305 










310 










315 










320 


Asn 


Leu 


Pro 


Val 


Leu 


Gly 


Gly 


Glu 


Ser 


Cys 


Arg 


Plie 


Val 


He 


Ser 


Glu 










325 










330 










335 




Val 


Pro 


Glu 


Ser 


Leu 


Val 


Val 


Leu 


Gin 


Ala 


Asn 


Ser 


Ser 


Leu 


Thr 


Ala 








340 










345 










350 






Ser 


Leu 


Val 


Pro 


Val 


Lys 


Thr 


Val 


Gly 


Asp 


Lys 


Thr 


He 


Glu 


Phe 


Val 






355 










360 










365 








Ala 


Pro 


Pro 


Lys 


Gly 


Gin 


Asp 


Arg 


Arg 


Thr 


He 


Asn 


Thr 


Phe 


Tyr Ala 




370 










37 5 










380 










Val 


Asp 


Leu 


Ser 


Gin 


Ala 


Ser 


Ala 


Pro 


Glu 


He 


Leu 


Gly Air, 


Val 


Pro 


385 










390 










395 










400 


Asn 


Gin 


Asn 


Leu 


His 


Gly 


Glu 


Glu 


He 


Pro 


Asp 


Leu 


He 


He 


Val 


Se.r 










4 05 










410 










415 




Thr 


Gin 


Ala 


Leu 


Leu 


Leu 


Glu 


Ala 


Asp 


Arg 


Leu 


Ala 


Thr 


Tyr 


Axg 


Arg 








420 










425 










430 






Glu 


Lys 


Asn 


Gly 


Leu 


Lys 


Val 


Leu 


Val 


Val 


Leu 


Gin 


Glu 


Gin 


Val 


Phe 






435 










440 










445 








Asn 


Glu 


Phe 


Ser 


Gly 


Gly 


Thr 


Pro 


Asp 


Ala 


Thr 


Ala 


Tyr 


Arg 


Leu 


Phe 




450 










4 55 










460 










Ala 


Ly? 


tlet 


Phe 


Tyc 


Asp 


Arg 


Trp 


Lys 


Ala 


Asn 


Ala 


Pro 


Val 


Gly 


Glu 


465 










470 










475 










480 


Thr 


Phe 


Pro 


Met 


Gin 


Met 


Leu 


Leu 


Phe 


Gly 


Asp 


Gly 


Ala 


His 


Asp 


Asn 










485 










490 










4 95 




Arg 


Lys 


Val 


Ser 


Val 


Ala 


Trp 


Gin 


Lys 


Pro 


Tyr 


Leu 


Gin 


Gin 


Thr 


Glu 








500 










505 










510 






Phe 


L*u 


Leu 


Thr 


Phe 


Gin 


Ala 


Val 


Asn 


Ser 


Thr 


Asn 


Val 


Asn 


Ser 


Tyr 






515 










520 










525 






Val 


Thr 


Asp 


Asp 


Tyr 


Phe 


Gly 


Leu 


Leu 


Asp 


Asp 


Gin 


Pro 


Ala 


Ser 


Val 




530 










535 








540 
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Asn 


He 


Gly 


Trp Arq 


Asn Tyr 


Asn 


Met 


Ala 


Val 


Gly 


Arg 


Phe 


Pro 


Val 


545 










550 








555 










560 


Arg 


Thr 


Pro 


Ala 


Glu 


Ala Arg 


He 


Ala 


Val 


Asp 


Lys 


Thr 


He 


Arg 


Tyr 










565 








57 0 










575 




Glu 


Glu 


Asp 


Arg 


Glu 


Ser Gly 


Ala 


Trp 


Arg 


He 


Arg 


Ala 


Cys 


Phe 


Ala 








580 








585 










590 






Ala 


Asp 


Asn 


Gly 


Asp 


Lys His 


Ala 


Thr 


GlU 


Thr 


Ser 


Arg 


Leu 


He 


Asp 






595 








600 










605 








Thr 


Val 


Lys 


Arg 


Tyr 


Ala Pro 


Ala 


He 


Met 


Pro 


Val 


Arg 


Ala 


Phe 


Gin 




610 








615 










620 










Asp 


Val 


Tyr 


Pro 


His 


Val He 


GlU 


Asn 


Gly 


Leu 


His 


Set 


He 


Pro 


Gly 


625 










630 








635 










640 


Ala 


Vys 


Lys 


Lys 


Met 


Leu Glu 


Thr 


Leu 


Gin 


Ser 


Gly 


He 


He 


LeU 


Leu 










645 








650 










655 




Asn 


Tyr 


Ala 


Gly His 


Gly Gly 


Pro 


Ala 


Gly Trp 


Ser Asp 


GlU 


His 


Leu 








660 








665 










670 






Leu 


Thr 


Leu 


Asn 


Asp 


He His 


Lys 


Phe 


Asn 


Tyr 


Lys 


His 


Met 


Pro 


He 






67 5 








680 










685 








Trp 


He 


Thr 


Ala 


Thr 


Cys Asp 


Phe 


Ala 


Asn 


Tyr 


Asp 


Ser 


Gin 


Thr 


Thr 




690 








695 










700 










Ser 


Ala 


Gly 


Glu 


Glu 


Val Phe 


Leu 


His 


Glu 


Lys 


Ser 


Gly 


Thr 


Pro 


He 


705 










710 








715 










720 


Met 


Phe 


Ser 


Thr 


Thr Arg Val 


Val 


Tyr 


Aan 


Thr 


Gin 


Asn 


Glu 


Lys 


He 










725 








730 










735 




Asn 


Gly 


Phe 


Met 


Leu 


Arg Arg 


Met 


Phe 


Glu 


Lys 


Ala 


Lys 


Asp 


Gly 


Arg 








740 








745 










750 






Tyr 


Arq 


Thr 


tlct 


Gly Glu He 


He 


Arg 


Ser 


Ala 


Lys 


Gin 


Gly 


Met 


Lsu 






755 








760 








765 








Ser 


Thr 


Val 


Phe 


Pro 


Asp Ser 


He 


Asn 


Gin 


Leu 


Ser 


Phe 


Phe 


Leu 


Met 




770 








77S 










780 










Gly 


Asp 


Pro 


Scr 


Val 


Arg Met 


Asn 


Leu 


Pro 


Thr 


His 


Lys 


Val 


Gin 


Leu 


705 










790 








795 










800 


Thr 


Ala 


lie 


Asn 


Gly Gin Asp 


Pro 


Glu 


Gly Gin 


Tyr Gly 


Thr 


He 


Met 










805 








810 










B15 




Leu 


Lys 


Ser 


Leu 


Glu 


Arg Val Ala 


Leu 


Lys 


Gly 


Lys 


Val 


Thr 


Asp Glu 








620 








825 










830 






Lys 


Gly 


Thr 


Phe 


Asp 


Glu Thr 


Phe 


Ser 


Gly 


Lys 


Val 


Phe 


Leu 


Thr 


Val 






83S 








840 










845 








Phe 


Aep 


Gly 


Arg 


Lys 


Lys Met 


Thr 


Ala 


Leu 


Glu 


Glu 


Glu 


Gly 


Asn 


Asp 




850 








855 










860 










Leu 


Ser 


Leu 


Val 


Tyr 


Tyr Asp Tyr 


Pro 


Asn 


Val 


Met 


Tyr 


Ala 


Gly He 


865 










870 








875 










080 


Ala 


Glu 


Val 


Lys 


Asp 


Gly Leu 


Phe 


Glu 


Thr 


Ser 


Phe 


lie 


Val 


Pro 


Lys 










885 








890 










8 95 




Asp 


Val 


Asn 


Tyr 


Ser 


Glu His 


Glu Gly Arg 


He 


Asn 


Leu 


Tyr 


Ala Tyr 








900 








905 










910 






Asn 


Glu 


Scr 


Thr 


Lys 


Ala Glu 


Ala 


Met 


Gly Val 


Asp 


Phe 


Ser 


He 


Arg 






915 








920 








925 








Val 


Gin 


Pro 


Gly 


He 


Pro Asp 


Glu 


Val 


Thr 


Glu 


Asp 


Asn 


Thr 


Pro 


Pro 




930 








935 










940 










Glu 


He 


He 


Scr 


Cys 


Phe Leu 


Asn 


Asp 


Ser 


Thr 


Phe 


Arg 


Ser 


Gly Asp 


945 










950 








955 








960 


Glu 


Val 


Asn 


Pro 


Thr 


Pro Leu 


Phe 


Met 


Ala 


Glu 


Val 


Phe 


Asp 


Leu 


Asn 










965 








970 








975 




Gly 


He 


Asn 


He 


Thr 


Gly Ser 


Gly Val 


Gly His 


Asp 


He 


Thr 


Leu 


Cys 








980 








98S 










990 




He 


Asp 


Gly 


Arg 


Ala Asp Leu 


Thr 


Tyr 


Asn 


Leu 


Asn 


Ala 


Tyr 


Phe 


Thr 






995 








1000 








1005 






Ser 


Ser 


Ala 


Thr 


Asp 


Ala Gly Val 


Gly Thr 


He 


Leu 


Phe 


Met 


He 


Pro 




1010 






1015 








1020 








Ala 


Leu 


Ala 


Glu 


Gly Asp Hi3 


Thr 


Ala 


Arg 


Leu 


Thr 


Val 


Trp 


Asp 


He 


1025 








1030 








1035 




1040 


Phc 


Asn 


Asn 


Ala 


Val 


His His 


Asp 


Phe 


Ser 


Phe 


Arg 


Val 


Val 


Asp 


Gly 










1045 






1050 








1055 


He 


Ala 


Pro 


Asp 


Val 


Ala Asp 


Val 


He 


Leu 


Phe 


Pro 


Asn 


Pro 


Val 


Arg 








1060 






1065 








1070 


Glu 


Ser 


Ala 


Thr 


Phe 


Arg He 


Phe 


His 


Asn 


Arg 


Pro Gly Ser Asp 


Leu 



1075 10BO 1085 



Asn Val Ala Val Glu He Tyr Asp Phe Thr Gly Arg Leu Val Asn Ser 

1090 1095 HOO 

Leu Pro Val Lys Thr Tyr Ser Ser Ser Tyr Gly Glu Pro He Glu He 

1105 1110 1115 1120 

Lys Trp Asp Leu Thr Ser Lys Tyr Gly Val Lys lie Gly Asn Gly Phe 

1125 1130 H35 

Tyr Lsu Tyr Arg Cys Val Val Asn Ser Pro Gly Gly Gin Thr Ala Ser 
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N5t M » Ly^Het He Vol Val Gly^Gln 115 ° 

11S5 1160 

(2) INFORMATION FOR SEQ ID MO: 381 

U> SEQUENCE CHARACTERISTICS- 

(A> LENGTH: 973 amino acids 
IB) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
15 HYPOTHETIC AL: YES 

(Vi) ORIGINAL SOURCE- 

•A) ORG AH ISM: Porphyrons* gingival i s 
(ix) FEATURE; 



20 



(Aj NAME /KEY: misc feature 
(E) LOCATION 1. . .973 



25 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:3Dl 
Ph. Tyr Thr Gly xie A.p Leu Hi, Il e Glu Ssr Gltl Met Lya ^ ^ 
Phe Pro Leu Leu Leu L.u He Leu Se r H Val Gly Cy£ £ y ^ 

3Q Lys Glu I.y s His ser Val Thr Glu He Ala Arg Glu Lys lys Arg He 
Thr Ala Leu Leu Tyr Glu Ly 3 Giu Leu Pro Thr Asp Ser Val Ly S Gln 
Leu Tyr Glu Ann Ser Val Gin Asn Lys A,n Leu Val Gly Gin Met Leu 
35 Phe Ala He Glu Val Gly Lys Arg „ et Arg Met Se, Gin Tyr tL 



A3 P Ala Met Leu Tyr Hi, Gin Ly S Gly Leu A.„ Ala Ala Leu Leu 



4Q Arg Asp Thr He Val Ala Ala Gin Ala Trp A*n Hi, Leu Gly Thr A,p 

Ala 
Phe 

45 ill Ly S Ala Arg s »° ^ Leu Asn Gly J" ^ ne ^ 160 

ioa 170 



ser Arg Ar g Ile Gly Ala ^ Ala M- ^ ^ 1« ^ ^ ^ 

Ala Leu Ser Leu Ile G i u Sef phe S£r ^ ^ q J° ^ ^ ^ ^ 



Glu L *u Gly Tye His „ p Glu Ala ^ ^ ^ s 175 

50 "J ^ G1 " A-P »-r Pr' Leu Gly Gln Ala As „ Tyr 

*1. Asn t eu 81y Arg «. Ty , Ar g Gln ^ Lys Glu ^ Lys ^ 

Ar f Thr Tyr Phe ^ ser Leu Glu 220 m ^ ^ ^ 

55 LBU Het aly lla GJy cys ^ ^ 235 v ^ ^ „o 

Glu L>, Sly Asp Tyr Gln Lys Ala Gln oiu Tyr Ala Thr Al" Tyr 

60 Ll ' S 2? S G1 " G1 " £ g Ar 9 Trp His Trp Uu Asn Ser Cys 

lie Pro Met Al a A r g He ^ ^ a „ Gly As „ ^ Arg ^ ^ 

Cln Hi. Ph9 Ile Ser Glu wy 300 ^ u ^ 

Thr S er „ is ,, eu „. olu „. Tyr ^ 3» t ^ A ^ ^ 320 

AT, L„ Ly5 G J U Xyr Lys G1 „ M , 330 A ^ c ^ ^ 335 ^ 

7Q Thr Leu Ser A, P Ser Met Ser lie „, His Lys Val Ser III n e Gln 
Glu Thr Ar, Phe A,n Tyr oi u Arg A, n ly .%« GXn 01 „ Leu Giu 

«u Ile «„ G l„ Val 3„ Lys Ala L y s alB Glu ^ Ser Lyg p „ e 

75 Leu teu ser Thr L6U phe JUa ^ 395 L ^ L ^ ^ , 00 
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405 










410 


Val 


Leu 


Ihr 


1 yr 


Ala 


Tyr 


Arg 


Gin 


Gly 


Lys 








420 










425 




Lys 


Glu 


Thr 


At-p 


Lys 


Leu 


Arg 


Ser 


Gly 


Phe 






435 










440 






Glu 


Phe 


Arg 


Thr 


Pro 


He 


Thr 


Val 


He 


Gin 




■150 










455 








Ser 


Ser 


Ser 


Pro 


Asp 


Leu 


Gin 


Ala 


Ser 


Asp 


465 










470 










11c 


He 


Afp 


Arq 


Gin 


Ser 


Ser 


His 


Met 


I.eu 










4B5 










4 90 


Leu 


Asp 


11* 


Cys 


Lys 


He 


Arq 


Ser 


Gly 


Val 








500 










505 




Aen 


Gly 


A.^p 


He 


Val 


Ser 


Phe 


Val 


Gin 


He 






515 










520 






Plo 


Tyr 


Ala 


Gin 


Ala 


Gin 


Asp 


He 


Thr 


Leu 




S30 










535 








Lys 


Pro 


He 


Val 


Val 


Asp 


Phe 


Val 


Pro 


Ser 


5<15 










550 










Ser 


Am 


Leu 


Leu 


Ser 


Asn 


Ala 


He 


Lys 


Tyr 










565 










510 


Val 


Val 


He 


Ser 


Leu 


Ala 




Thr 


Lvs 










6B0 










585 




Arg 


Val 


Ala 


A^"p 


Asn 


Gly 


He 


Gly 


He 


Asd 






r Of 










600 






He 


Phe 


A.?p, 


He 


Phe 


Tyi. 


Arg 


G3 y 


Gin 


Ser 




610 










615 








S-i 


Gly 


Val 


Gly 




Ser 


Phe 


Thr 


Asn 


He 


625 










630 










Gly 


Thr 


He 


Lys 


Val 


Glu 


Ser 


Gin 


Pro 


Gly 










645 










650 


He 


Ser 


He 


Pro 


Thr 


Gl n 


Asn 


Gin 


Ser 


Ser 








660 










665 




Trp 


Lew 


Pro 


Ser 


Ser 


Asp 


Asp 


lie 


Va) 


Jlet 






67 5 










680 






Asp 


Asp 


Ser 


Fro 


Thr 


Ser 


Pro 


Met 


Val 


Ala 




690 










6 95 








Glu 


Asp 


Glu 


Arq 


Pro 


Thr 


He 


Leu 


Leu 


Val 


705 










710 










Asn 


Leu 


Leu 


Val 


Lys 


Leu 


Leu 


Leu 


Cys 


Asp 










725 










730 


A] n 


Ala 


Asn 


Gly 


Lys 


Glu 


Gly 


He 


Ala 


Leu 








740 










745 




Asp 


He 


He 


He 


Thr 


Asp 


He 


Met 


Mel- 


Pro 






155 










7 60 






Met 


Thr 


He 


Arq 


Met 


Lys 


Gin 


Ser 


Pro 


Leu 




770 










775 








Val 


Ala 


Leu 


Thr 


Ala 


Lys 


Ser 


Thr 


Glu 


Gin 


785 










790 










I.ys 


Ser 


Gly 


Val 


Val 


Ser 


Tyr 


Leu 


Cys 


Lys 










805 










810 


Leu 


Leu 


Met 


Arq 


He 


Glu 


Gin 


Leu 


Leu 


Lys 








820 










825 


Lys 


Phe 


Tyr 


Met 


Gin 


Lys 


Leu 


Met 


Leu 


Asp 






83S 










840 




Pro 


Gin 


Pro 


He 


Asp 


Asp 


Ser 


Ser 


Met 


Gin 




850 










855 








Asp 


Ala 


Val 


S&r 


Gly 


Gly 


He 


Lys 


Gin 


Asn 


865 










870 










Asp 


Leu 


Ala 


•31u 


Lys 


Met 


Cys 


Her 


Ser 


Pro 










8R5 










890 


Leu 


Thr 


Ser 


Val 


Val 


Gly 


Cys 


Ser 


Thr 


He 








900 










905 




Lys 


He 


Lys 


Lou 


Al a 


C\'s 


Lys 


Leu 


Leu 


Ala 






915 










920 






Ser 


Asp 


He 


Ser 


He 


Glu 


Ala 


Gly 


Phe 


Ser 




930 










935 








Arg 


Thr 


Phe 


L**S 


Arg 


Tyr 


Met 


Asn 


Cys 


Ser 


945 










950 










Lys 


Leu 


Leu 


Ala 


Met 


Pro 


Gly 


Ser 


Asp 


Lys 










965 










970 



(2) IIIFORItATIOH FOR SEQ XD NO; 382 
(il S£QUF.t!CE, CHARACTERISTICS: 
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415 



Lys 


His 


Asn 


Lys 


Leu 


He 








430 






Phe 


Thr 


Gly 


He 


Thr 


His 






445 








Gly 


Leu 


Asn 


Glu 


Lys 


Met 




460 










Arg 


Thr 


Glu 


Leu 


His 


Ly.' 


475 










•180 


Asn 


Leu 


Val 


Asn 


Gin 


Leu 










495 




Ser 


Thr 


Pro 


GlU 


Trp 


Arg 








510 






Leu 


He 


Asp 


Ser 


Phe 


Ala 






525 








Glu 


Leu 


Gl n 


Pro 


Glu 


Ser 




54 0 










i yr 


Leu 


Gin 


Lys 


He 


He 


555 










560 


Ser 


Leu 


Ala 


Gly Gly 


Arg 










O » 3 




Glu 


Lys 


Asn 


L^u 


lie 


11 e 








590 






L**s 


Thr 


Asp 


Gin 




Kl £ 






605 








Ala 


Thr 


Glu 


Lys 


His 


Gl v 




620 










Leu 


Val 


Gl U 


Asn 


L^u 


Arg 


635 










640 


Lys 


Gly 


Ser 


Ala 


Phe 


Thr 










655 




Ser 


Ala 


Glu 


He 


Leu 


Pro 








670 






Pro 


Val 


His 


He 


Ala 


Pro 






685 








Ala 


Leu 


Asn 


His 


Arg 


Phe 




700 










Glu 


Asp 


Asn 


Lys 


Asp 


He 


715 










720 


Arg 


Tyr 


Asn 


Val 


Leu 


Ser 










7 35 




Ala 


Thr 


Glu 


His 


He 


Pro 








750 






He 


Met 


Asp 


Gly 


He 


Glu 






765 








Leu 


Cys 


His 


He 


Pro 


He 




780 










Asp 


Arg 


Leu 


Glu 


Gly 


He 


7 95 










800 


Pro 


Phe 


Ser 


Pro 


Glu 


Glu 










815 




Asp 


Arg 


Glu 


Leu 


Leu 


Lys 








830 






Arg 


Lys 


Pro 


Glu 


Clu 


Glu 






845 








Phe 


Leu 


Leu 


Ala 


Ala 


Lys 




860 










Pro 


Asp 


Phe 


Ser 


Ala 


Gin 


875 










RfiO 


Ser 


Gin 


Leu 


Asn 


Arg 


Lys 










895 




Gly 


Tyr 


He 


Gin 


Gl n 


He 








910 






Asp 


Glu 


Ser 


Lys 


Asn 


He 






925 








Asp 


Pro 


Ala 


Tyr 


Phe 


Ser 




940 










Pro 


Ser 


Gin 


Tyr 


Arg 


Gin 


955 










960 


Glu 


Thr 


Val 
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(A) LENGTH: 563 amine addr 
(B> TYPE: amino acid 
t D) TOPOLOGY: linear 



< i i > MOLECULE TYPE: protein 

Uii} HYPOTHETICAL: YES 

(Vl) ORIGINAL SOURCE: 

I A} OR HAM ISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAME/ KEY : misc feature 

(B) LOCATION 1. . .563 



ixi) SEQUENCE DESCRIPTION: 



Hi - 


His 


Lys 


Thr 


Tyr 


Gin 


Thr 


Met 


1 








5 








Leu 


lie 


Leu 


Ala 


Leu 


Gly 


Leu 


Phe 








20 








Gin 


Met 


Gin 


Ala 


Asp 


Arg 


Thr 


Asn 






35 










40 


Gin 


Arg 


Ala 


Phe 


Gin 


Glu 


Thr 


Asn 




50 










55 




11* 


Ala 


Glu 


Tyr 


Glu 


Arg 


Ser 


Ala 


65 










70 








lie 


Pro 


Met 


Glu 


Lou 


He 


Lys 


lie 


Thr 


lie 


Val 


85 
Ala 


Ser 


Glu 


Ser 








100 










Tyr 


Thr 


Gin 


Ser 


Gly 


Val 


Asn 


Leu 






115 










120 


Lys 


Thr 


Asp 


Ser 


Tyr 


Trp 


Thr 


Arg 




130 










135 






A~sp 


Thr 


Asn 


Lys 


Val 


Gly 


Leu 


145 










150 






Arg 


Pro 


Ar?n 


Asp 


Asp 


Glu 


Phe 


Pro 










165 








He 


Glu 


ttet 


Phe 


Gly 


Met 


Lys 


Leu 








100 










Thr 


Arp 


Gly 


Tyr 


Gly 


Ser 


Ala 


Val 






195 










200 


Asn 


Ser 


Ser 


Leu 


Ser 


Cln 


Ala 


Gin 




210 










215 




Leu 


Gl y 


lie 


Thr 


His 


His 


Asp 


Val 


225 










230 






lie 


Asn 


His 


Val 


Asp 


Cys 


Trp 


Gly 










245 








Leu 


lie 


Arg 


Lys 


Val 


Pro 


Asp 


Asn 








260 










Asp 


Met 


Ala 


Ala 


Tyr 


Phe 


Ala 


Ala 






275 










280 


Tyr 


Glu 


Val 


Tyr 


Arg 


Ala 


Leu 


Ala 




290 










295 




Ser 


Leu 


He 


Leu 


Asn 


Asn 


Arg 


Val 


305 










310 






Ser 


Val 


Asp 


Asn 


Asp 


Ala 


Leu 


Asn 










32b 








Tyc 


Glu 


Ik 


He 


Gly 


Val 


Lys 


Gly 








340 










Thr 


Asp 


Ala 


Leu 


His 


Cys 


Arg 


Thr 






355 










360 


Leu 


Tyr 


He 


Lys 


His 


Tyr 


Pro 


He 




37 0 










375 




Tyr 


J.ys 


He 


Glu 


Ala 


Asp 


Val 


Val 


385 










390 






Pro 


Val 


Gin 


Cys 


Tyr 


Tyr 


Arg 


He 










405 








Ala 


A^p 


Met 


Thr 


Met 


Glu 


Ser 


Thr 








420 










Gly 


Leu 


Am 


Lys 


Asn 


Asp 


Lys 


Val 






435 










440 


Asn 


Ser 


Gl> 


Arg 


Lys 


Glu 


Thr 


Tyr 




450 










455 





ID 110:302 



Lys 


Lys 


Leu 


Leu 


Gin 


Ala 


Lys 


Ala 




10 










15 




Gin 


Leu 


Pro 


Ala 


He 


Ala 


Gin 


Thr 


25 










30 






Gly 


Gin 


Phe 


Ala 


Thr 


Glu 


Glu 


Met 










45 








Pro 


Pre 


Aia 


Gly 


Pro 


Val 


Arg 


Ala 








60 








Ma 


Val 


L~u 


Val 


Arg Tyr 


Pro 


Phe 






75 










80 


Glu 


Leu 


Ala 


Lys 


Asn 


Asp 


T.ys 


Val 




90 










95 




Gin 


Lys 


Am 


Thr 


Val 


He 


Thr 


Gin 


10b 










110 






Ser 


Asn 


Cys 


Asp 


Phe 


He 


He 


Ala 










125 








Asp 


Tyr 


Thr 


Gly 


Trp 


Phe 


Ala 


Met 








140 










Val 


Asp 


Phe 


He 


Tyr Asn Arg 


Pro 






155 










160 


Lye 


Tyr 


Glu 


Ala 


Gin 


Tyr 


Leu 


Gly 




170 










175 




Lys 


Gin 


Thr 


Gly 


Gly 


Asn 


Tyr 


Met 


185 










190 






Gin 


Ser 


HiF 


He 


Ala 


Tyr 


Thr 


Glu 










205 








Val 


Asn 


Gin 


Lys 


Met 


Lys 


Asp 


Tyr 








220 










Val 


Gin 


A-^p 


Pro 


Asn 


Gly Glu 


Tyr 






235 










240 


Lys 


Tyr 


Leu 


Ala 


Pro 


Asn 


Lys 


He 




250 










255 




His 


Pro 


Gin 


His 


Gin 


Ala 


Leu 


Glu 


265 










270 






Gin 


Thr 


Cys 


Ala 


Trp 


Gly Thr 


Lys 










285 








Thr 


Asn 


Glu 


Gin 


Pro 


Tyr 


Thr 


Asn 








300 










Pits 


Val 


Pro 


Val 


Asn 


Gly 


Pro 


Ala 






315 










320 


Val 


Tyr 


Lys 


Thr 


Ala 


Mot 


Pro 


Gly 




330 










335 




Ala 


Ser 


Gly 


Thr 


Pro 


Trp 


Leu 


Gly 


345 










350 




His 


Glu 


Val 


Ala 


Asp 


Lys 


Gly Tyr 










365 








Leu 


Gli 


Glu 


Gin 


Ala 


Gly 


Pro 


Asp 








380 










Ser 


Cys 


Ala 


Asn 


Ala 


Thr 


He 


Ser 






3 95 










400 


Asn 


Gly 


Ser 


Gly 


Ser 


Phe 


Lys 


Ala 




410 










415 




Gly 


His 


Tyr 


Thr 


Tyr 


Ser 


Phe 


Thr 


4 25 










430 






Glu 


Tyr 


Tyr 


He 


ser 


Ala 


Ala 


Asp 










445 






Pro 


Phe 


He 


Gly 


Glu 


Pro 


Asp 


Pro 



460 
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Phe Lys Phe Thr Cys 
4 65 

Ala Ala Lys Ala Leu 
485 

Ala Val Ser Val Ser 
500 

Tyr Asn Thr Ala Gly 

515 

Ala Gly Thr Ser Val 

530 

Thr Tyr Val Leu Val 
545 

lie Leu l.yn 



Met Asn Glu Thr Asn Thi 
470 475 
Arg Ala Trp Phe Asn Ala 
4 90 

Leu Asn lie Ala Gly Thr 
505 

Glu Glu Val Ala Ala Met 
520 

Phe Ser Met Asp Val Tyr 
535 

Val Glu Gly Asn Gly lie 
550 555 



Cys Thr Val Thr Cly 
480 

Gly Arg Ser Glu Leu 
4 95 

Tyr Arg He Lys Leu 
51C 

Thr Lys Glu Leu Val 

525 

Ser Gin Ala Pro Gly 
54 0 

Arg Glu Thr Met Lys 
560 



(2) INFORMATION FOR SEQ ID MO: 383 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 37 amino acids 

(B) TYPE: amino acid 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{lii] HYPOTHETICAL: YES 



£vi) ORIGINAL SOURCE: 

(A J ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

(A) NAME/KEY: mlsc_feature 
(D) LOCATION 1. . .437 

ixi) SEQUENCE DESCRIPTION : SEQ ID NO: 383 

Thr Thr Asn Arg Lys Pro Asn Thr Asn Met Lys Leu Ser Ser Lys Ly- 

1 5 io is 

He Leu Ala He lie Ala Leu Leu Thr Met Gly His Ala Val Gin Ala 

20 25 30 

Gin Phe Val Pro Ala Pro Thr Thr Gly He Arg Met Ser Val Thr Thr 

^ 5 40 45 

Thr Lys Ala Val Gly Glu Lys He Glu Leu Leu Val His Ser II- Glu 

50 ss 60 

Lys Lys Gly He Trp He Asp Leu Asn Gly Asp Ala Thr Tyr Gin Gin 
65 70 75 80 

Gly Glu Glu Tlf» Thr Val Phe Asp Glu Ala Tyr His Glu Tyr Thr He 

B5 90 9 s 

Gly Thr Gin Thr Leu Thr He Tyr Gly Asn Thr Thr Arg Leu Gly Cys 

100 105 no 

Arg Ser Thr Gly Ala Thr Ala Val Asp Val Thr Lys Asn Pro Asn Leu 

115 120 125 

Thr Tyr Leu Ala Cys Pro Lys Asn Asn Leu Lys Ser Leu Asp Leu Thr 

13° 135 i«jo 

Gin Asn Pro Lys Leu Leu Arg Val Trp Cys A.-?p Ser Asn Glu lis Glu 
145 ISO 155 leo 

Ser Leu Asp Leu Ser Gly Asn Pro Ala Leu He He Leu Gly Cys Asp 

165 170 175 

Arg Asn Lys Leu Thr Glu Leu Lys Thr Asp Asn Asn Pro Lys L-u Ala 

ISO 185 100 

Ser Leu Trp Cys Ser Asp Asn Asn Leu Thr Glu Leu Glu Leu Sec Ala 

195 200 205 

Asn Pro Arg Leu Asn Asp Leu Trp Cys Phe Gly Asn Arg He Thr Ly* 

210 215 220 

Leu Asp Leu Ser Ala Asn Pro Leu Leu Val Thr Leu Trp Cys Ser Asp 
225 2 30 2 35 240 

Asn Glu Leu Ser Thr Leu Asp Leu Ser Lys Asn Ser Asp Val Ala Tyr 

245 250 255 

Leu irp Cys Ser Ser Asn Lys Leu Thr Ser Leu Asn Leu Ser Gly Va] 

260 265 270 

Lys Gly Leu Ser Val Leu Val Cys His Ser Asn Gin He Ala Gly Glu 

275 280 285 

Glu Met Thr Lys Val Val Asn Ala Leu Pro Thr Leu Ser Pro Gly Ala 

290 2 05 300 

Gly Ala Gin Ser Lys Phe Val Val Val Asp Leu Lys Asp Thr Asp Glu 
305 310 315 F K 32n 
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Asn 


lie 


Cys 


Thr 
325 


Val 


Lys 


Asp Val 


Glu 

330 


Lys Al a 


Lys 


Ser 


Lys 
335 


Asn 


Trp 


Arq 


Val 


Phe 


Asp 


Phe 


Asn 


Gly Asp 


Ser 


Asp Asn 


Ilet 


Leu 


Pro" Tyr 








340 








34 5 








350 






Glu 


Gly 


scr 
355 


Pro 


Thr 


Ser 


Asn 


Leu Ala 
360 


Val 


A3p Ala 


Pro 

365 


Thr 


Val 


Arg 


11* 


Tyr 


Pro 


Asn 


Pro 


Val 


Gly Arg Tyr 


Ala 


Leu Val 


Glu 


He 


Pro 


Glu 




370 










37 5 






380 










Set 


Leu 


Leu 


Gly Gin, 


Glu 


Ala 


Ala Leu Tyr 


Asp llet 


Asn Gly Val 


Lys 


385 










390 
















400 


Val 


Tyr 


Sc-i 


Phe 


Ala 
4 05 


Val 


Glu 


Ser Leu 


Axg 
410 


Gin Aen 


lie 


Asp 


Leu 
415 


Thr 


HLs 


Leu 


Pro 


Asp 
420 


Gly 


Thr 


Tyr 


Phe Phe 
425 


Arg 


Leu Asp 


Asn 


Tyr 
430 


Thr 


Thr 


Lys 


Leu 


He 

435 


Lys 


Gin 





















(2) InrORllATION FOR SEQ ID MO: 384 

{D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

Ui) IIQLCCULE TYPE : protein 

tin) HYPOTHETICAL: YEr 

(vi) ORIGINAL SOURCE; 

(A) ORGANISH: Porphyromonas gingiva lis 

tlx) FEATURE: 

(A) H At IE /KEY : mis^reature 
(B| LOCATION 1. • -31 B 



(X-i) SEQUENCE DESCRIPTION: SEQ ID NO: 384 



Asn 


His 


Leu 


Lys 


Thr 


A.<?n 


He 


Lys 


Met 


Arg 


Lys 


Thr 


He 


He 


Phe 


Cys 


1 








5 










10 










15 




Leu 


Leu 


Leu 


Ala 


Leu 


Phe 


Gly 


Cys 


Ser 


Trp 


Ala 


Gin 


Glu 


Arg 


Val 


Asp 








20 










25 










30 






Glu 


Lys 


Val 


Phe 


Ser 


Ala 


Gly 


Thr 


Ser 


He 


Phe 


Arg 


Gly 


He 


Leu 


Glu 






35 










40 










4 5 








T.ys 


Val 


Lys 


Ala 


Pro 


Leu 


Met 


Tyr 


Gly 


ASp 


Arg 


Glu 


Val 


Trp 


Gly Met 




50 










55 










60 










Ala 


Arg 


Ala 


Ser 


Glu 


Asp 


Phe 


Phe 


Phe 


Tie 


Leu 


Pro 


Val 


Thr 


Asp Acp 


65 










70 










75 










80 


Leu 


Thr 


Pro 


Val 


Lou 


Phe 


Tyr 


Asn 


Arg 


Leu 


Thr 


Asn 


Glu 


Pro 


Cys 


Phe 










95 










90 










95 




Val 


Ser 


hsp 


Gin 


Gly 


He 


Thr 


Glu 


Tyr 


Phe 


Lys 


Phe 


Ala 


Gin 


G1 u 


Gly 








100 










105 








110 






Asp 


Tyr 


He 


Glu 


Val 


Glu 


Gly 


Ser 


Ser 


Val 


Phe 


M«t 


Ala 


Asn 


Leu 


Leu 






115 










120 










125 








Tyr 


Tyr 


Arg 


Phe 


Phe 


Pro 


Thr 


Arg 


He 


Thr 


Ser 


Tyr 


Asn 


Ala 


Pro 


He 




130 










135 










140 










Glu 


Gly 


Val 


Val 


Sor 


Lys 


Thr 


Gly 


Asn 


Pro 


Ala 


Phe 


Thr 


He 


Pro 


Met 


145 










150 










155 










160 


Leu 


Pro 


Gly 


Val 


Ser 


Asp 


Cys 


He 


Glu 


He 


Ser 


Asn 


Asn 


Arg 


Lys 


Val 










165 










170 








175 




Phe 


Leu 


Thr 


Asn 


Gin 


Leu 


Gly 


Val 


Val 


Asn 


He 


Thr 


Asp Gly 


Met 


Glu 








100 










185 










190 






Pro 


Pro 


lis 


He 


Ala 


Gly 


Val 


Ser 


Ala 


Ser 


Tyr 


Gly 


Ser 


Ser 


Val 


Arg 






195 










200 










205 








Val 


Tyr 


Gly 


His 


Val 


Ser 


Gin 


Arg 


Trp 


Asp 


He 


He 


Gly 


His 


Cys 


Tyr 




210 










215 










220 










Leu 


Asp 


He 


Tyr 


Pro 


Thr 


Asn 


Cys 


Tyr 


Pro 


Levi 


Ser 


Thr 


Lys 


Pro 


Val 


225 










230 










235 










240 


Ala 


Gly 


Asp 


Asp 


Glu 


Val 


Phe 


Val 


Lys 


Gin 


Gin 


Gly 


Arg 


Gin 


He 


Glu 










245 










250 










255 




He 


Asp 


Ser 


Asn 


Ser 


Pro 


lie 


Val 


Gin 


Val 


Val 


Val 


Tyr 


Asc 


Leu 


Glu 








260 










265 










270 






Gly 


Lys 


Ser 


Val 


Phe 


Arg 


Lys 


Arg 


Met 


Thr 


Glu 


Asn 


Ala 


Tyr 


Thr 


Leu 






275 










280 










205 






Ser 


Phe 


Arg 


Ala 


Pro 


Met 


Leu 


Gly 


Phe 


llel 


Thr 


He 


Met 


He 


Glu 


Thr 




290 










295 










300 











SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



321/ 490 



Gin A£Ti Ser He lie Rsn Lys Lys Leu f<sn Val Thr Gin Leu 
305 310 315 

(2) INFORMATION FOR SEQ ID NO; 385 

(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: -161 aiftino acids 
( B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalio 

lix) FEATURE: 

(A) NAME /KEY: migcje^ture 

(B) LOCATION 1* . .4 61 

(r.i) SEQUENCE DESCRIPTION: SEO ID NOs385 



Arg 


Arg 


Al s. 


Val 


Lys 


He 


Arg 


Ser 


Pro 


Pro 


iris 


He 


rUs 


Ser 


Leu 


Phe 




















10 










15 




Val 


Arc/ 


Lys 


Cys 
20 


Leu 


Phe 


Ser 




Phe 
25 


Lys 


Tvr 


Leu 


Tyr 


Leu 
30 


Ser 




Lys 


He 


Thr 

35 


•Jin 


Glu 


Ary 


Leu 


Gly 
40 


Arg 


Leu 


Ser 


He 


Aiy 
45 


Leu 


Lys 


Ser 


Tyr 


Asn 
50 


Pro 


He 


He 


lie 


He 
55 


Glu 


Met 


Lys 


Lys 


Thr 

60 


Thr 


He 


lie 


Ser 


Leu 


Tl- 


Val 


Phe 


Gly 


Ala 


Phe 


Phe 


Ala 


Ala 


Val 


Gil- 


Gin 


Thr 


Lys 


App 


65 










70 










75 










80 


Asn 


Ser 


Ser 


Tyr 


Lys 
85 


Pro 


Phe 


Ser 


Lys 


Glu 
90 


Asp 


lie 


Ala 


Gly 


Gly 
95 


Val 


Tyr 


Ser 


Leu 


Pro 
100 


Thr 


Gin 


Asn 


Arg 


Ala 
105 


Gin 


Lys 


Asp 


Asn 


Ala 
110 


GlU 


Trp 


Leu 


Leu 


Thr 
115 


Ala 


Thr 


Val 


Ser 


Thr 
120 


Asn 


Gin 


Scr 


Ala 


Asp 
125 


Thr 


His 


Phe 


lie 


Phe 
130 


Asp 


Glu 


Asn 


Asn 


Arg 
135 


Tyr 


He 


Ala 


Arg 


Asp 

140 


He 


Lys 


Ala 


Asn 


Gly 


Val 


Arg 


Lys 


Ser 


Thr 


Asp 


Ser 


He 


Tyr 


Tyr 


Asp 


Ala 


Asn 


Gly 


Arg 


145 










150 










155 










160 


Tie 


Ser 


ULs 


Vai 


Asp 
165 


Leu 


Tyr 


He 


Ser 


Phe 
170 


Ser 


Gly 


Gly 


Glu 


Pro 
175 


Ala 


Leu 


Ak'p 


Thr 


Arg 
180 


the 


Lys 


Tyr 


Thr 


Tyr 
185 


A^p 


Asp 


Glu 


Gly 


Lys 
190 


Mel- 


Thr 


Val 


Arg 


Glu 
195 


Val 


Phe 


Met 


Leu 


Val 

200 


I let 


Asp 


Pro 


A-n 


Thr 
205 


Pro 


He 


Ser 


Ary 


Leu 
210 


Glu 


Tyr 


Hi? 


Tyr 


Asp 
215 


Ala 


Gin 


Gly 


Arg 


Leu 
220 


Thr 


His 


Trp 


He 


Ssr 


Phe 


Ala 


Phe 


Gly 


Ala 


Glu 


Ser 


Gin 


Lys 


Asn 


Thr 


Tyr 


His 


Tyr 


Asn 


225 










230 










235 










240 


Glu 


Lye 


Gly 


Leu 


Leu 
245 


Val 


Ser 


Glu 


Val 


Leu 
250 


Ser 


Asn 


Ala 


Met 


Gly 
255 


Thr 


Thr 


Tyr 


Ser 


Asp 


Thr 


Gly 


I*ys 


Thr 


Glu 


Tyr 


Ser 


Tyr 


Asp Asp 


Ala 


Asp 








260 










265 










270 






Asn 


Met 


Val 


T.yy 


Ala 


Glu 


Tyr 


Phe 


Val 


Val 


Gin 


Gin 


Gly Lys 


Ala 


Trp 






27 5 










280 










285 






G) n 


Val 
2 90 


Leu 


Lys 


Arg 


Glu 


Glu 

295 


Tyr 


Thr 


Tyr 


Glu 


Asp 
300 


Asn 


Tie 


Cys 


He 


Gin 


Tyr 


Leu 


Ala 


He 


Asn 


Gly 


Thr 


Asp Thr 


Lys 


Val 


Tyr 


Lys 


Arg 


Asp 


305 










310 










315 










320 


lie 


Glu 


Ser 


Asp 


Lys 


Scr 


Ha 


Ser 


Ala 


A&'n 


Val 


11 - 


A?p 


He 


Fro 


Ser 










32b 










330 








335 




Met 


Pro 


Glu 


G1 n 
340 


Thr 


Trp 


Pro 


Afn 


Met 
34 5 


Tyr 


Gly 


Phe 


Asn 


Ala 
350 


Lys 


Arg 


Leu 


Lys 


Glu 
355 


Thr 


Tyr 


Ser 


Ser 


Tyr 
360 


Glu 


Gly 


Asp 


Val 


Ala 

36S 


Thr 


Pro 


He 


Phe 


ACp 
370 


Tyr 


He 


Tyr 


Thr 


Tyr 
315 


Lys 


Ala 


Leu 


Thr 


Ser 
380 


IltiL 


Ala 


Thr 


Pro 


Ser 


Thr 


Glu 


Ala 


Gin 


Val 


Ala 


Val 


Tyr 


Leu 


Asn 


Pro 


Ser 


Thr 


Asp 


Arg 


3fl5 










390 










395 








400 


Leu 


Val 


He 


Leu 


Ala 
-405 


Asn Gly 


lie 


Thr 


His 
410 


Leu 


Ser 


Met 


Tyr 


Asp 
415 


Leu 
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Gin Gly I.ys Leu lie Arg Asp Cys Ala Leu Set Gly Asp Lys Val Glu 

■120 425 430 

Met Gly Val Gly Ser Leu Thr Lys Gly Thr Tyr Leu Leu Lys Val Asn 

435 440 445 

Thr Asp Gin Gly Aid Phe Val Axg Lys Val Val He Arg 
450 455 460 



(2) INFORMATION FOR SEQ ID MO: 386 

(j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 51 amino acide 
(P) TYPE: amino acxd 
(D) TOPOLOGY: linear 

(HI MOLECULE TYPE: protein 

(illl HYPOTHETICAL: YES 

(vl ] ORIGIHAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

tlx) FEATURE: 

(A) NAME/ KEY: misc_featuxe 
(BJ LOCATION 1. . . 451 



(>:i) SEQUENCE DESCRIPTION : SCQ ID HO: 386 



Met 


Ala 


Lys 


Vsl 


He 


Lys 


Thr 


Lys 


Lys 


Gly 


Leu 


Ala 


Leu 


Asn 


Leu 


Lys 




















10 










15 




Gly 


I.ys. 


Pro 


Leu 


Pro 


Glu 


Met 


Leu 


Ala 


Glu 


Pro 


Ala 


Gin 


Ser 


Pro 


Thr 








20 










25 










30 






Tyr 


Ala 


Val 


Val 


Pro 


Asp 


Asp 


Phe 


Glu 


Gly 


Val 


He 


Pro 


Lys 


Val 


Thr 






3t> 










40 










45 








Ala 


Arg 


Pro Gly Asp 


Lys 


Val 


Arg 


Ala 


Gly 


Ser 


Ala 


Leu 


Met 


His 


His 




50 










55 










60 










Lys 


Ala 


Tyr 


Pro 


Glu 


Met 


Lys 


Phe 


Thr 


Ser 


Pro 


Val 


Ser 


Gly 


Glu 


Val 


65 










70 










75 










80 


He 


Ala 


Val 


Asn 


Arg 


Gly Ala 


Lys 


Arg 


Lys 


Val 


Lsu 


Ser 


He 


Glu 


Val 










85 










90 










95 




Lys 


Pro 


Asp Gly Leu 


Asn 


Glu 


Tyr 


Glu 


Ser 


Fhe 


Pro 


Val 


Gly Asp 


Pro 








100 








105 










110 






Ser 


Ala 


Leu 


Ser 


Ala 


Glu 


Gin 


He 


Lys 


Glu 


Leu 


Leu 


Leu 


Ser 


Ser 


Gly 






115 










120 










125 








Met 


Trp 


Gly 


Phe 


He 


Lys 


Gin 


Arg 


Pro 


Tyr 


Asp 


He 


Val 


Ala 


Thr 


Pro 




130 










135 










140 










Asp 


He 


Ala 


Pro 


Arg 


Asp 


lie 


Tyr 


He 


Thr 


Ala 


Asn 


Phe 


Thr 


Ala 


Pro 


145 










150 










155 










160 


Leu 


Ala 


Pro 


Afp 


Fhe 


Asp 


Phe 


He 


Val 


Arg 


Gly 


Glu 


Glu 


Arg 


Ala 


Leu 










165 










170 










175 




G1 n 


Thr 


Ala 


He 


Asp 


Ala 


Leu 


Ala 


Lys 


Leu 


Thr 


Thr Gly 


Lys 


Val 


Tyr 








180 










185 










190 






Val 


Gly 


Leu 


Lys 


Pro 


Gly 


Ser 


Ser 


Leu 


Gly 


Leu 


tils 


Asn 


Ala 


Glu 


He 






195 










200 










205 








Val 


Glu 


Val 


His 


Gly 


Pro 


His 


Pro 


Ala 


Gly Asn 


Val 


Gly Val 


Leu 


He 




210 










215 










220 










Asn 


His 


Thr 


Lys 


Pro 


He 


Asn 


Axg 


Gly Glu 


Thr 


Val 


Trp 


Thr 


Leu 


Lys 


225 










230 










235 










240 


Ala 


Thr 


Asp 


Leu 


He 


Val 


He 


Gly 


Arg 


Phe 


Leu 


Leu 


Thr 


Gly Lys 


Ala 










245 










250 










255 




Asp 


Phe 


Thr 


Arg 


Met 


lie 


Ala 


Met 


Thr 


Gly 


Ser 


Asp 


Ala 


Ala 


Ala 


His 








260 










265 










27 0 






Gly 


Tyr 


Val 


Arg 


He 


Met 


Pro 


Gly 


Cys 


Asn 


Va1 


Phe 


Ala 


Ser 


Phe 


Pro 






275 










280 








285 








Gly Arg 


Leu 


Thr 


He 


Lys 


Glu 


Ser 


His 


Glu 


Arg 


Val 


He 


Asp 


Gly Asn 




290 










295 










300 










Val 


Leu 


i'hr 


Gly 


Ly- 


Lys 


L-rU 


cys 


Glu 


Lys 


Glu 


Pro 


Phe 


Leu 


Ser 


Ala 


305 










310 










315 










320 


Arg 


cys 


Asp 


Gln 


He 


Thr 


V<tl 


He 


Pro 


Glu 


Gly 


Asp 


Asp 


Val 


Asp 


Glu 










325 










330 










J3b 




Leu 


Phe 


Gly 


Trp 


Ala 


Ala 


Pro 


Axg 


Leu 


Asp 


Gin 


Tyr 


Ser 


Met 


Ser 


Arg 








340 










345 










350 






Ala 


Tyr 


Phe 


Ser 


Trp Leu 


Gin 


Gly 


Lys 


Asn 


Lys 


Glu 


Tyr 


Val 


Leu 


Asp 






355 










360 










365 








Ala 


Arg 


lie 


Lys 


Gly Gly Glu 


Arg 


Ala 


Met 


He 


Met 


Ser 


Asn 


Glu 


Tyr 




370 










375 










380 
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Asp Arg Val Phe Pro M~t Asp lie Tyr Pro Glu Tyr Leu Leu Lys Ala 
305 390 395 400 

lie He Ala Phe Asp He Asp Lys Met Glu Asp Leu Gly lis Tyr Glu 

4Q5 410 415 

Val Ala Pro Glu Asp Phe Ala The Cys Glu Phe Val Asp Thr Ser Lys 

420 425 430 

He GJ u Leu Gin Arg lie Val Arg Glu Gly Lev) Asp Met Leu Tyr Lys 

435 440 445 

Glu Het Asn 
450 



(2) IMFOR11ATION FOR SEQ ID NO: 307 



f i J SEQUEI ICE CHARACTERISTICS : 

(A) LENGTH: 195 amino acids 
(B> TYPE: amino acid 
(Dl TOPOLOGY: linear 



(11] MOLECULE TYPE: protein 

(iii] HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalls 

lix) FLATURE: 

(A) UAI IE/ KEY: miscjeature 
(8 ) LOCATIOll 1- . .195 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 307 



llet 


Asp 


Lys 


Val 


Ssr 


Tyr 


Ala 


Leu 


Gly 


Leu 


Ser 


He 


Gly Asn 


Asn 


Phe 


1 








5 










10 










15 




Lys 


Ser 


Ser 


Gly 


He 


Asp 


Ser 


Val 


VaI 


Het 


Asp 


Asp 


Phe 


Met 


Gin 


Giy 








20 










25 










30 




Leu 


Ser 


Asp 


Val 


Leu 


Glu 


Glu 


Lys 


Ala 


Pro 


Gin 


Leu 


Ser Tyr 


Asp Glu 






35 










40 










45 








Ala 


Lys 


Arg 


Glu 


lie 


Glu 


Ala 


Tyr 


Phe 


Met 


Asp 


Leu 


Gin 


Gin 


Lys Ala 




50 










55 










60 










Val 


Lys 


Leu 


Asn 


Lys 


GlU 


Ala 


Gly 


Glu 


Glu 


Phe 


T.eu 


Lys 


He 


Asn 


Ala 


65 










70 










75 










80 


His 


Lys 


Glu 


Gly 


Val 


Thr 


Thr 


Leu 


Pro 


Ser 


Gly 


Leu 


Gin 


Tyr 


Gl U 


Val 










85 










90 










95 




He 


Lys 


Met 


Gly 


Glu 


Gly 


Pro 


Lys 


Pro 


Thr 


Leu 


Ser 


Asp 


Thr 


Val 


Thr 








100 










105 










110 






Cys 


His 


Tyr 


His 


Gly 


Thr 


Leu 


He 


Asn 


Gly 


He 


Val 


Phe 


Asp 


Ser 


Ser 






115 










120 










125 








llet 


Arp 


Arg 


Gly 


Glu 


Pro 


Ala 


Ser 


Phe 


Pro 


LCU 


Arg 


Gly 


Val 


He 


Ala 




130 










135 










140 










Gly 


Trp 


Thr 


Glu 


He 


Leu 


Gin 


Leu 


M^t 


Pro 


Val 


Gly 


Ser 


Lys 


Trp 


Lys 


145 










150 










155 










160 


Val 


Thr 


He 


Pro 


Ser 


Asp 


Leu 


Ala 


Tyr 


Gly 


Asp 


Arg 


Gly Ala 


Gly 


Glu 










165 










170 










175 




His 


He 


Lys 


Pro 


Gly 


Ser 


Thr 


Leu 


He 


Phe 


He 


He 


Glu 


Leu 


Leu 


Ser 



180 IBS 190 

He Asn L*/3 
195 



(2) IHFORI lATlON FOR SEQ ID NO: 388 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 3 amino acids 
IB) TYPE: amino anid 
(Dl TOPOLOGY: linear 

tii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL; YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival!*; 



IL'A) FEATURE: 

(At NAME/ KEY: miec_feature 
( B) LOCATION 1 . . .273 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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324/ 490 



(xx) S&QUEltCE DESCRIPTION: SEO ID NO: 388 



Met 


Lys 




Al a 


Leu 


eu 


Xle 




Ala 


Ala 


Leu 


Leu 


Gly Ala Val 


Ser 




















1 0 










15 






Al a 




Al a 


Gin 


or 


Leu 


er 


Thr 


11 a 




Va 1 


Gin 


Asn 


Asn 


Ser 








zo 










25 










30 






Val 


Gin 


Gin 


Pro 










Thr 


e 




V 

a 


Cys 


Gly Glu 


Leu 






3 j 










40 










45 








Ala 


Glu 


Gin 


Val 


Asp 


Cyr 


lie 


Gly 


Thr 


Gly 


Aon 


Ser 


Ala 


He 


He 


Ala 




50 










55 










60 










Ala 


Ala 


Ala 


L-y? 


Phe 


Glu 


Ser 


Asp 


Asp 


Leu 


Glu 


Ser 


Tyr 


Val 


Gly Trp 


65 










TO 




















80 


Glu 


lie 


net 


Ser 


Val 


Asp 


Phe 


Phe 


Pro 


Gly 


Tyr 


Lys 


Ala 


Cys 


Lys 


Tyr 










95 










90 










9S 




Thr 


Str 


Ala 


Val 


Trp 


Ala 


Asp 


Asp 


Ifet 


Thr 


lie 


Leu 


Gly 


Gin 


Sor 


Glu 








100 










105 










110 






AsTp 


Ser 


Asp 


Pro 


Glu 


Met 


Gin 


Tnr 


lie 


Asn 


Asn 


Leu 


Ala 


Leu 


Lys 


Thr 






; i^ 










120 










125 








Ser 


V^l 


Lyr 


He 


Glu 


Ala 


Gly 


L/£ 


Asn 


Tyr 


He 


Val 


Gly Tyr 


lie 


Al * 




1J0 










13b 










140 










A?n 


Thr 


Ala 


Gly 


Gly 


His 


Pro 


Xle 


Gly 


Cys 


Asp 


Gin 


Gly 


Pro 


Ala 


Val 


145 










150 










155 










160 


Asp 


Gly 


Tyr 


Gly 


As P 


Leu 


Val 


Ser 


He 


3er 


Glu 


Asp 


Gly 


Gly 


Ala 


Thr 










165 










170 










175 




Phe 


Pre 


:'ro 


Phe 


Glu 


Ser 


Leu 


in- 


Gin 


Ala 


Val 


Pro 


Thr 


Leu 


Asn 


Tyi 








180 










1S5 










190 






Asn 


lie 


Tyr 


Val 


Val 


Val 


His 


Leu 


Lys 


Lys 


Gly 


Glu 


Gly 


Val 


Glu 


Ala 






195 










200 










205 








Val 


Leu 


Thr 


A 6ti 


Asp 


I.y« 


Ala 


Asn 


Ala 


Tyr 


Val 


Gin 


Asn 


Gly 


Val 


He 




210 










215 








220 










Tyr 


Val 


Ala 


Gly 


Ala 


Asn 


Gly 


Arg 


Gin 


Val 


Ser 


Leu 


Phe Asp 


Met 


A.-?r\ 


225 










230 










235 










240 


Cly 


Lys 


Val 


Val 


Tyr 


Thr 


Gly 


Val 


Ser 


Glu 


Thr 


Tie 


Ala 


Ala 


Pro 


Gin 










245 










250 










25S 




Lys 


Gly 


Met 


Tyr 


lie 


Leu 


Arg 


VaJ 


Gly 


Ala 


Lys 


Ser 


He 


Lys 


Leu 


Ala 



260 265 270 

Tie 



(2) II I FORMATION FOR SEQ ID MO: 389 

(1) SEQUENCE CHARACTERISTICS: 

(A) LEtKiTH: 554 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

<1U llOLEGULE TYPE; protein 

iiii) HYPOTHETICAL: YES 

tvi> ORIGINAL SOURCE: 

tA) ORGANISM: Porphyromonas gingiva lis 

UK) FEATURE: 

(A) NAME/KEY: misc_feature 
*B) LOCATION 1. . .554 



(Xil SEQUENCE DESCRIPTION: SEQ ID MO:3B9 



llet 


Pro 


Arg 


He 


Met 


Lys 


Leu 


Lys 


He 


Ala 


Leu 


Arg, 


Leu Leu 


Leu 


Ala 


1 








5 










10 








15 




Thr 


The 


Ala 


He 


Val 


Leu 


Phe 


Ser 


Pro 


Leu 


Ala 


Lys 


Ala Gin 


Met 


Asp 








~0 










25 








30 






lie 


Gly 


Gly 


Asp 


Asp 


Val 


Leu 


He 


Glu 


Thr 


Uet 


Ser 


Thr Leu 


Ser 


Gly 






35 










40 










45 






Tyr 


Ser 


Glu 


Asp 


Phe 


Tyr 


Tyr 


Lys 


Met 


Ala 


Val 


Ala 


Asp Asn 


Gly 


Trp 














55 










60 






He 


Tyr 


Val 


Wet 


Leu 


Asp 


Phe 


Ser 


Arg 


He 


Tyr 


Phi- 


Asp Asp 


val 


Arg 


65 










70 










75 






90 


Leu 


Tyr 


Arg 


Ser 


Lys 


Asp 


Gly 


Gly 


Ala 


Thr 


Tyr 


Gin 


Lys Leu 


Gly 


Ser 










85 










90 








95 




Leu 


Gly 


Ser 


Leu 


Val 


Pro 


Tyr 


Asp 


Phe 


Asp 


Val 


Ser 


His Cys 


Asp 


Phe 








10O 










105 








110 






He 


Val 


Thr 


Gly 


Lys 


Asp 


Glu 


Asp 


Asp 


He 


Asn 


Val 


Trp Thr 


Val 


net 
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115 120 !-><: 

Thr Ala Phe Glu Tyr Val Gly Gly Thr He Gly Asn Gly Val Leu L-u 

1J0 13S 140 

Met His Arq His Asp Ala Asp Tie A,n Asn Thr Glu Cys Val Tyr Lys 

150 155 
Lys Asp Phe Pro Asn Asn Arg Leu Met Gly Val Ala He Ala Ser A,n 

Tyr Ar, Ala Pro Ser Pro Tyr Gly Leu Gly Gly Asp Pro Phe Ma I.e., 

180 185 190 

Ala vax Ala Val Scr Gly Ser Gly Ser Asp His Ser Phe Leu Asp Tyr 

13:3 200 205 

Tie Phe Ser Leu Asp Gly Gly Val HI* Phe Glu Gin Lys Arg lie Tyr 

Thi Arg Pro Gin Lys Leu Thr He Asn Arg Val Arp Leu Ser Leu G^y 



Ser Thr Ser Pro Ser Leu Gly Phe A*n Thr ?ll Pro Leu H-t Gl» 
245 250 255 

Val Phe Glu Net Asn Lys Am Leu A,p Gly Phe Asp He Gly Phe II- 

260 265 270 

Ser Am Phe VM Asp Tyr Asp Pro Arg Tyr Ala Trp Ser Glu Pro Il~ 

280 285 
He He Glu Glu A*p Cy, Gly Trp Thr Asp Phe Asn Pro T .-u Gl- Ala 

~ 29S 300 

Leu Ser lie Glu He Gin Met Mat Leu Asp Asp Asn Ser Asp Asn Thr 

Val Gly Gly Glu Arg Scr His Asn Phe Leu He Thr Tyr Pro Gly Hi- 

325 330 -^c 

Tyr Val Tyr Pro Lys Gin Ser Phe A*n Tyr Ser Pro Gl V Hi, Thr Pro 

Thr L>« Lys Arp Leu Val Phe Lys His Cys He Gly He Pro Ala Leu 

iJ 360 365 

Ala Ty r Asp Lys Glu Gly Asp Arg Tyr Leu Thr Thr Phe Gin A*p His 

3^5 380 

Asn Le U .let Arg Tyr Arq Trp lie Lys Tyr Asp Asp He Asn Ser Phe 

390 3^5 Ann 

Tyr Gly T,p Ser Trp Pro Tyr Val Tyr Ala Ly> 31u Ala Ly S Asp iys 

40S «10 4is 

Lys Arc, Arg Arg Pre Gin Val Ala Leu Asn Pre. Thr Asn Gly Lys Ala 

420 <2S j, 0 

Cys Trp Val Trp Hi. Thr Arg Lys Ser Pro Tyr Asp Glu Thr Lys Pro 

Hi, Pro Thr Pro Val He lie Lys H is Phe L= U Trp Ser Asp Thr Glu 

455 460 
Trp Val His Ala Leu Asp Val Gly Asp Val Leu Gin Lys Glu Gly Ser 

Met Lys Leu Tyr Pro A*n Pro Ala Lys Glu Tyr Val Leu He Asn Leu 

495 4 go 405 

Pre Ly* Glu Gly Gly His Glu Ala Val Val Tyr Asp Het Gin Gly Arg 

505 si r 

^5 LVS SSC PhS G1 '' L - Glu Tyr Lys Leu Asn Val 

Gin Tyr Leu Ser Lys Gly Thr Tyr Het Leu Lys V„l Val Ala Asp Thr 

535 5d0 
Glu Tyr Phe Val Glu Lys He He Val Glu 
54 5 5S0 

(2) INFORMATION FOR SEQ ID NO: 3 90 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 550 amino acids 
(D) TYPE; amino acid 
(D» TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

IviJ ORIGI1IAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival ic 
I IX) FEATURR: 

(A) NAME/KEY: nvisc feature 

(BJ LOCATION 1. . . 550 

Ui) SEQUENCE DESCRIPTION: SCO. ID HO: 390 
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rist 
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Lys 
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Asp 


He 
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Asp 
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Glu 
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Ser 
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Tyr 


Ser 
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Asp 






35 
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Lys 


Met 
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Asp 
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Trp 


He 


Tyr 


Val 


Met 
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Phe 
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Arg 


He 


Tyr 


Phe 
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Asp 
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Arg 


Leu Tyr, 
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80 


Lys 
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Gly Ala 


Tlic 


Tyr 


Gin 


Lys 


I.eu 


Gly 


Ser 


Leu 


Gly 


Ser 


Leu 










85 










90 










95 




Val 


Pro 


Tyr 


AF P 


Phe 


Arp 


Val 


Ser 


His 


Cys 


Asp 


Phe 


He 


Val 


Thr 


Gly 








100 










105 










110 






Lys 


Asp 


Gl u 


Asp 


Asp 


He 


Asn 


Val 


Trp 


Thr 


Val 


I let 


Thr 


Ala 


Phe 


Glu 






i: c . 










120 










125 








T^r 


Val 


■31 y 


Gly Thr 


He 


Gly 


Asn 


Gly 


Val 


Leu 


Leu 


Met 


His 


Arg 


His 
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135 










140 
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Ala 


Asp 
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Asn 


Asn 


Thr 


Glu 


Cys 


Val 


Tyr 


T.ys 


Lypt 


Asp 


Phe 


Pro 


145 










150 










155 










160 


Asn 


Asn 


Arg 


L?u 


Het 


Gly Val 


Ala 


11 = 


Ala 


Ser 


Asn 


Tyr 


Arg 


Ala 


Pro 
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170 
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Pro 


Tyr 


Gly 


Leu 


Gly Gly Asp 
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Phe 


Ala 




Ala 


Val 


Ala 


Val 








180 










185 
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Gly 


Ser 
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s~r 


Asp 


His 


Ser 


Phe 


Leu 


Asp 


Tyr 


He 


Phe 


Ssr 


Leu 






195 










200 










205 








Asp 


Gly 


Gly 


Val 


His 


Phe 


Glu 


Gin 


Lys 


Arg 


He 


Tyr 


Thr 


Arg 


Pro 


Gin 




21^ 










215 










220 










Lys 


Leu 


Thr 


He 


Asn 


Arg 


Val 


Asp 


Leu 


Ser 


Leu 


Cli- 


Ser 


Thr 


Ser 


Pro 


22b 










230 










235 










240 


Ser 


Leu 


Gly 


Phe 


Asn 


Thr 


Trp 


Pro 


T.eu 


Met 


Gly 


val 


Val 


Phe 


Glu 


Met 










245 










250 










255 




Asn 


i.ys 


A5»n 


Leu 


Asp 


Gly 


Phe 


Asp 


lie 


Gly 


Phe 


He 


Sor 


Asn 


Phe 


Val 








260 










265 










Z70 






Asp 


Tyc 


Asp 


Pro 


Arg 


Tyr 


Ala 


Trp 


Ser 


Glu 


Pro 


He 


lie 


He 


Glu 


Glu 






275 










280 










285 








Asp 


Cys 


Gly 


Trp 


Thr 


Asp 


Phe 


Asn 


Pro 


Leu 


Gly 


Ala 


Leu 


Ser 


He 


Glu 




290 










2 95 










300 










lis 


Gin 


(let 


Met 


Leu 


Asp 


Asp Asn 


Ser 


Asp 


Asn 


Thr 


Val 


Gly 


Gly 


Glu 


305 










310 










315 










320 


Arg 


Scr 


His 


Asn 


Phe 


Leu 


He 


Thr 


Tyr 


Pro 


Gly 


His 


Tyr 


Val 


Tyr 


Pro 










325 










330 








335 




Lys 


Gin 


Ser 


Phe 


Asn 


Tyr 


Ser 


Pro 


Gly 


His 


Thr 


Pro 


Thr 


Lys 


Lys 


Asp 








340 










345 










350 






Leu 


Val 


Pho 


Lys 


Hi s 


cys 


He 


Gly 


He 


Pro 


Ala 


Leu 


Ala 


Tyr 


Asp 


Lys: 






355 










360 










365 








Glu 


Gly 


Asp 


Arg 


Tyr 


Lfeu 


Thr 


Thr 


Phe 


Gin 


Asp 


lias 


Asn 


Leu 


liet 


Arg 




37 o 










37b 




















Tyr 


Arg 


Trp 


Tie 


Lys 


Tyr 


Asp 


Asp 


He 


Asn 


Ser 


Phe 


Tyr 


Gly Trp 


Ser 


385 










390 










395 










4 CO 


Trp 


Pre 


Tyr 


Val 


Tyr 


Ala 


r.ys 


Glu 


Ala 


Lys 


Asp 


Lys 


Lys 


Arg 


Arg 


Arg 










4 05 










410 










41b 




Pro 


Gin 


Val 


Ala 


Leu 


Asn 


Pro 


Thr 


Asn 


Gly 


Lys 


Ala 


Cys 


Trp 


Val 


Trp 








420 










42b 








430 






His 


Thr 


Arg 


Lys 


Ser 


Pro 


Tyr 


Asp 


Glu 


Thr 


L-ys 


Pro 


His 


Pro 


Thr 


Pro 






43b 










440 










445 








Val 


lie 


lie 


Lys 


His 


Phe 


Leu 


Trp 


SCL" 


Asp 


Thr 


Glu 


Trp 


Val 


His 


Ala 




450 










455 










460 










Leu 


Asp 


Val 


Gly 


Asp 


Val 


Leu 


Gin 


Lys 


Glu 


Gly 


Ser 


Hct 


Lys 


Leu 


Tyr 


46S 










470 










475 










480 


Pro 


Asn 


Pro 


Ala 


Lys 


Glu 


Tyr 


Val 


Leu 


He 


Asn 


Leu 


Pro 


Lys 


Glu 


Gly 










485 










4 90 








495 




Gly 


His 


Glu 


Ala 


Val 


Val 


Tyr 


Asp 


Met 


Gin 


Gly 


Arg 


He 


Val 


Glu 


Lys 








500 










SOS 










510 






Vai 


Ser 


Phe 


Ser 


Gly 


Lys 


Glu 


Tyr 


Lys 


Leu 


Asn 


Val 


Gin 


Tyr 


Leu 


Ser 






515 










520 










525 






Lys 


Gly 


Thr 


Tyr 


Met 


Leu 


Lys 


Val 


Val 


Ala 


Asp 


Thr 


Glu 


Tyr 


Phe 


Val 




53C 










535 










540 








Glu 


Lys 


He 


He 


Val 


Glu 






















545 










550 























(2) IMPORJIATIOtJ FOR SEQ ID IIO:391 
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(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 90 amino acids 
(B> TYPE : amino acid 
(D) TOPOLOGY: linear 

(ill MOLECULE TY PL : protein 

(Hi) HYPOTHETICAL: YES 

{ vi ) ORIGINAL SOURCE: 

(A) ORGANISI!: Porphyromonas c/ingivalie 

tlx) FEATURE : 

{A J HAM F./ KEY : misc featur" 

(B) LOCATIOIJ 1. . . 390 

<Xi) SEQUENCE DESCRIPTION: SEQ ID NO:391 

JleL Lys Arg L<--u Leu Pro Phe Leu Leu Leu Ala Gly Leu Val Ala Vai 

X 5 1 0 IS 

Gly Asn Val Ser Ala Gin Ser Pro Arg He Pro Gin Val Asp Val Hie 

20 25 30 

Thr Arg IJ* Ala Arg Asn Ala Arg Tyr Arg Leu Asp Lys He Ser Val 

Ji> 40 45 

Pro Asp Jer Arg 3-ln lie Phe Asp Tyr Phe Tyr Lys Glu Glu Thr II- 

55 60 

Pro .:ir Lys Tie Gin Thr Thr Thr Gly Gly Ala He Thr Ser lie Asp 

Ser Leu Phe Tyr Glu Asp Asp Arg Leu Val Gin Val Arg Tyr Ph- Asp 

05 90 g5 

Asn Asn Leu Glu Leu Lys Gin Ala Glu Lys Tyr Val Tyr Asp Gly Ser 

100 105 no 

Lys Leu Val Leu Arg Glu Tie Arg Lys Ser Pro Thr Asp Glu Thr Pro 

115 120 12 5 

He Lys Lys Val Ser Tyr His Tyr Lou Cys Gly Ser Asp Met Pro Phe 

Glu He Thr Thr Glu Met Ser Asp Gly Tyr Phe Glu Ser His Thr Leu 

155 160 
Asn Tyr LeU ** n Jly Lys He Ala Arg He Asp Ho Me t The Gin Gin 

Asn Pr ° Ser Glu Le " He Glu Thr Gly Arg Met Val Tyr Glu Ph- 

180 ^ 185 190 

Asp AlG Asn Asn Asp Ala Val Leu Leu Arg Asp Ser Val Phe Leu Pro 

195 200 205 

Leu Gin Asn Lys Trp Val Glu Met Phe Thr His Arg Tyr Thr Tyr Asp 

215 220 

Asn Lys Hrs Asn Cys He Arg Trp Glu Gin Asp Glu Phe Gly Thr Leu 

Thr Leu Ala Asn Asn Phe Glu Tyr Asp Thr Thr He Pro Leu Ser Ser 

, . 24S 250 255 

✓al Leu Phe Pro Thr His Glu Glu Phe Phe Arg Pro Leu Leu Pro Asn 

260 265 ?70 

Fhe Ilet Lys His Met Arg Thr Lys Gin Thr Tyr Phe Asn Asn Ser Gly 
„ 275 280 285 

Glu *ly Leu Ser Glu Val Cys Asp Tyr Asn Tyr Phe Tyr Thr Asp Met 

295 300 
Gin Gly Asn Ala Leu Thr Asp Val Ala Val Asn Glu Ser He Lys- He 

310 31 c 

Tyr Pro Arg Pro Ala Thr A*p Phe Leu Arg lie Glu Gly Ser Gin Leu 

325 330 335 

Leu Arg Leu Sor r.eu Phe Asp Met Asn Gly Lys Leu lie Arg Ala Thr 

Glu Leu Thr Gly Asp Leu Ala lie He Gly Val Ala Ser Leu Pro Arc 

355 360 365 

Gly Thi Tyr II* Ala Glu He Thr Ala Ala Asn Ser Lys Thr He Arg 
37 5 300 

Ala Ly.- Val Ser Leu Arg 
385 ago 

(2) IliFORtlATION FOR SEQ ID HO: 392 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 
<B> TYFE: amino acid 
(D> TOPOLOGY: linear 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 
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(ii> MOLECULE TYPE: protein 
(lii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANTSM: Porphyromonas gingivals 
(i;:) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 3 . . . 41! 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 92 
Met Arg Gin Hi. Leu Ser T* u Phe Pro Phe He L*u Phe Leu Leu Leu 
Ala Phe Ser T yr Val Gly cy, Arg Thr Va\ Are, Gin Thr Pro ly S Gln 
S " T ^ Val Val Vai Jju Ser Leu Asp Gly Phe Arg Pro A*p 

Tyr The A,p Arg Ala Arg Thr Pro Ala Leu A*p Arg Met Ala Gin Glu 
Gly Leu Ser Gly Ser Leu Gin Pro Cys Phe Pro ler Leu Thr Phe Pro 
25 His Tyr Ser Met Ala Thr Gly Leu Tyr Pro A*p His Hi, Gly 



10 



15 



20 



30 



50 



75 



Val Ala 31u Phe Val Asp Ser Leu Leu Gly He Phe Arg n\ Ser 

A-p Arg Lys Ala Val Glu Thr Pro Gly Phe Tr P Gly Gly ^ Pro Val 
Trp a™ Thr Ala Ala Arg Gin Gly He Arg Thr Gly Tyr Phe Trp 

Val ciy Ser Glu Thr Ala Val Asn Gly Asn Arg Pro Trp Arg Trp Lys 



120 12 s 
val 

35 PllH sec Ser Th , £J pro phe ^ ^ iss ^ ^ leo 



Ala Trp t.„ Gly Leu Pro Glu Ly 3 ^ Pto Ar , Leu Le<J ™ Trp 
40 Tyr Ils Hi Glu Pro Met l£ "y Hi, Ser Gin Thr 11° olu ser 

Pro Leu Thr Leu Ala Met Val Glu Ar„ U U Asp Ser Val Val Gly Tyr 

Z15 2 20 

Phe Arg Ly» Ar, r.eu Asp s . e Leu Pro „. Mfl Ala ^ ^ ^ ^ 

45 Tie He v.! S er £p H1 . Gly Met Ala Thr Glu ^ gi<j ^ «0 

250 2b5 



1 Asn Leu Ser H1 , Tye Leu p ro Ala A*p S=r Phe Leu Tyr Met Ai. 
Thr My Ala Phe Thr His Leu Tyr P r^ Lys Pro Ssr Tyr ™ 
Ala Tyr „„ Ilo Arq Ile pro ne 28. ^ ^ 



205 



Gly Glu V al Pro Lys Arg £^ Ar g Cys Gly Thr Asn Pro Arg Leu Gly 
55 Glu Leu Val Val lie Pro Asp He Gly S2r £r Val Phe ph . «0 
Asn Glu Ai p Val Arg Pro Gly Ala Ala "s Gly Tyr Asp Asn "n Ala 
60 G1U Ma LeU LeU *? n * ? = V ^ Civ Pro Asp Phe Arg Pro 



Gly Ser Arg Val Glu Asn Leu Pro Asn He Thr lie Tyr Pro Leu He 
Cg Arg L,u Leu Gly lie Glu Pro Ala Pro A,n ^p Ala A, P Glu Thr 

65 Leu Leu Asn Gly Lou He Arg A=p Ly s Arg Pro 
405 41Q 

(2) It I FORI IAT I ON FOR SEQ ID NO: 393 

7 ^ <i> SEQUENCE CHARACTERISTICS - 

fA) LEIIGTH: 24 6 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY; linear 



<ii> MOLECULE TYPE : protein 
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(nil HYPOTHETICAL: YES 

(vi ) ORIGINAL SOURCE: 

iA) ORGANISM: Porphyromonas gingival is 

fix) FEATURE: 

(A) NAME/ KEY : misc_foature 

(B) LOCATION 1...Z46 



(xty SEQUENCE DESCRIPTION: SEQ ID NO : 3 93 



Met 


Lys 


Val 


Gly 


Lou 


Phe 


He 


Pro 


Cys 


Tyr 


Val 


Asn 


Ala 


Val 


Tyr 


Pro 


1 








5 










10 










15 




Glu 


Val 


Gly 


He 


Ala 


Thr 


Tyr 


T.ys 


Leu 


Leu 


Lys 


Ser 


Leu 


Asp 


He 


Asp 








20 










25 










30 






Val 


Asp 


Tyr 


Pro 


Het 


Asp 


Gin 


Thr 


Cys 


Cys 


Gl y 


Gin 


Pro 


Met 


Ala 


Asn 






35 










40 










45 








Ala 


Gly 


Phft 


Glu 


Gin 


Lys 


Al a 


Gin 


Lys 


Leu 


Ala 


Leu 


Arg 


Phe 


Glu 


Glu 




50 










55 










60 










Leu 


Phe 


Glu 


Sor 


Tyr 


Asp 


Val 


Val 


Val 


Gly 


Pro 


Ser 


Ala 


Ser 


Cys 


Val 


65 










70 










75 










80 


Ala 


Phe 


Val 


Lys 


Glu 


Am 


Tyr 


Asp 


His 


lis 


Leu 


Arg 


Pro 


Thr 


Gly 


His 










85 










90 










<*5 




Val 


Cye 


I.y.« 


-Ser 


Ala 


Ala 


Lys 


Val 


Arg 


Asp 


lis 


Cys 


Glu 


Phe 


Leu 


His 








100 










105 










110 






Asp 


Asp 


Leu 


Lys 


He 


Thr 


Ser 


Leu 


Pro 


Ser 


Arg 


Phe 


Al a 


Ills 


Lys 


V«l 






115 










120 










125 








Ser 


Leu 


His 


Arrn 


Ser 


Cys 


His 


Gly 


Val 


Arg 


Glu 


Leu 


His 


Leu 


Ser 


Thr 




1J0 










135 










140 










Pro 


Ser 


Glu 


Val 


Hie 


Arg 


Pro 


Tyr 


HI 6 


Asn 


Lys 


Val 


Arq 


Arg 


Leu 


Leu 


14S 










150 










155 










160 


Glu 


I let 


Val 


Gin 


Gly 


He 


Glu 


Val 


Phe 


Glu 


Pro 


Lys 


Arg 


He 


Asp 


Glu 










165 










170 










175 




Cys 


Cys 


Gly 


Phe 


Gly 


Gly 


Met 


Tyr 


Ser 


Val 


Glu 


Glu 


Pro 


Glu 


Val 


Ser 








100 










105 










190 






Thr 


Cys 


I let 


Gly 


liis 


Asp 


Lye 


Val 


Leu 


Asp 


His 


He 


Ser 


Thr Gly Ala 






195 










200 










205 








Glu 


Tyr 


He 


Thr 


Gly 


Pro 


Asp 


Ser 


Ser 


Cys 


Leu 


Met 


His 


Met 


Gin 


Gly 




210 










215 










220 










Val 


He 


Arp 


Arg 


Glu 


Lys 


Leu 


Pro 


11- 


Lys 


Thr 


He 


His 


Ala 


Val 


Gl u 


225 










230 










235 












Tie 


Leu 


Ala 


Ala 


Asn 


Leu 






























245 
























(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 394 

















ti) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 246 amino acid* 
(B) TYPE: anu.no acid 
(D) TOPOLOGY : linear 

t J 1 ) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vlj ORIGINAL SOURCE: 

(A) ORGA11IS11: Porphyromonas gingival is 

lixj FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1...246 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394 



Met 


Asp He 


Val 


Ser 


Met 


Ala 


Asp l,y? 


Ala 


Leu 


Val 


Val 


Glu 


Met 


Arg 


1 






5 








10 










15 


Asp 


Val Thr 


Leu 


Cys 


Gin 


Glu 


Glu Asn 


Val 


He 


Phe 


Gin 


Asn 


Leu 


Asn 






20 








25 










30 






Leu 


Thr Leu 


Ser 


Ala 


Gly 


Asp 


Phe Val 


Tyr 


Leu 


He 


Gly 


Ser 


Val 


Gil- 




35 










40 








45 








Ser 


Gly Lys 


Ser 


Thr 


Leu 


Leu 


Lys Ala 


Leu 


Tyr 


Ala 


Glu 


Val 


Pro 


lie 




SO 








55 






60 










Ser 


Ala Gly 


Tyr 


Ala 


Arg 


Val 


He Asp 


Tyr 


Asp 


Leu 


Ala 


Lys 


Leu 


Lys 


65 








70 








75 










00 
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Arg 


Lys 


Gin 


Leu 


Pro 


Ti L 


Leu 


Arg 


Arg 


Asn 


Leu 


Gly 


He 


Vdl 


Phe 


Gin 










8S 










90 










95 




Asp 


Phe 


Gin 


Leu 


Leu 


Asn 


Gly Arg 


Thr 


Val 


Aid 


Glu 


Asn 


Leu 


Asp 


Phe 








100 










105 










110 






Val 


Leu 


Arg 


Ala 


Thr 


Asp 


Trp 


Lys 


Asn 


Arg 


Ala 


Asp 


Arg 


Glu 


Gin 


Arg 






115 










120 










125 








He 


Glu 


Glu 


Val 


Leu 


Thr 


Arg 


Val 


Gly Met 


Ser 


Arg 


Lys 


Ala 


Tyr 


Lys 




130 










135 










140 










Arg 


Pro 


Ills 


Glu 


Leu 


Ser 


Gly Gly Glu 


Gin 


Gin 


Arg 


Val 


Gly 


He 


Ala 


1-15 










ISO 










1SS 










160 


Arg 


Ala 


Leu 


Leu 


Ala 


Lys 


Pro 


Ala 


Leu 


He 


Lsu 


Ala 


Asp 


Glu 


Pro 


Thr 










165 










170 










175 




Gly Arn 


Leu 


Asp 


Ser 


Val 


Thr 


Gly 


Leu 


Gin 


He 


Ala 


Ser 


Leu 


Leu 


Tyr 








180 










185 










190 






Glu 


He 


Sti. 


Lys 


Gin 


Gly 


Thr 


Ala 


Val 


Leu 


Met 


Ser 


Thr 


Ills 


Asn 


Ser 






195 










200 










205 








Ser 


Lou 


Leu 


Sec 


His 


Leu 


Pro 


Ala 


Arg 


Thr 


Leu 


Ala 


Val 


Arg 


Lye 


Asn 




210 










215 










220 










Gly 


Asp 


Ala 


Ser 


Ser 


Leu 


Val 


Glu 


Leu 


Ser 


Aic* 


Asp 


Ala 


Val 


Ser 


Arg 


225 










230 




















240 


Lyc 


Acn 


Thr 


Glu 


He 


A^p 























(2) INFORMATION FOR SEQ ID HO: 395 

tl) SEQUENCE CHARACTERISTICS: 

IA> LENGTH: 241 amino acids 
(B) TYPE: amino acid 
ID) TOFOLOGY: linear 

(li) MOLECULE TYPE: protein 

iiii) HYPOTHETICAL: YES 

Ivi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival;.? 

(ix) FEATURE: 

(A) NAME /KEY : miscjeature 

(B) LOCATION 1 . . .241 



(Xi) SEQUENCE DESCRIPTION: SEQ ID 110:395 



Met 


Ala 


Asp 


Lys 


Ala 


Leu 


Val 


Val 


Glu 


Met 


Arg 


Asp 


Val 


Thr 


Leu 




1 








5 










10 










15 




Gin 


Glu 


Glu 


Aim 


Val 


He 


Phe 


Gin 


Asn 


Leu 


A*«n 


Leu 


Thr 


Leu 


Ser 


Ala 








20 










25 










30 






Gly Asp 


Phe 


Val 


Tyr 


Leu 


He 


Gly 


Ser 


Val 


Gly 


Ser 


Gly 


Lys 


Ser 


Thr 






35 










40 










•15 








Leu 


Leu 


T.y? 


Ala 


Leu 


Tyr 


Ala 


Glu 


Val 


Pro 


He 


Ser 


Ala 


Gly 


Tyr 


Aln 




50 










55 










60 






Arg 


Val 


He 


Asp Tyr 


Asp 


Leu 


Ala 


Lys 


Leu 


r.y« 


Arg 


Lys 


Gin 


Leu 


Pro 












70 










75 










80 


Tyr 


Leu 


Arg 


Arg 


Asn 


Leu 


Gly 


lie 


Val 


Phe 


Gin 


A^p 


Phe 


Gin 


Leu 


Leu 










65 








90 








95 




Asn 


Gly 


Arg 


Thr 


Val 


Ala 


Glu 


Asn 


Leu 


Asp 


Phe 


Val 


Leu 


Arg 


Ala 


Thr 








100 










105 










110 






Asp 


Trp 


Lys 


Asn 


Arg 


Ala 


Asp 


Arq 


Glu 


Gin 


Arg 


He 


Glu 


Glu 


Val 


Leu 






115 










120 










125 








Thr 


Arg 


Vdl 


Gly 


Met 


Ser 


Arg 


Ly £ 


Ala 


Tyr 


Ly* 


Arg 


Pro 


Hi* 


Glu 


Leu 




130 










135 










140 










Ser 


Gly 


Gly Glu 


Gin 


Gin 


Arg 


Val 


Gly 


He 


Ala 


Arg 


Ala 


Leu 


Leu 


Ala 


145 










150 










155 








160 


Lys 


Pro 


Ala 


Leu 


He 


Leu 


Ala 


Asp 


Glu 


Pro 


Thr 


Gly 


Asn 


Leu 


Asp 


Ser 










165 










170 










175 




Val 


Thr 


Gly Leu 


Gin 


He 


Ala 


Ser 


Leu 


Leu 


Tyr 


Glu 


He 


Ser 


Lys 


Gin 








180 










1 85 










190 




Gly 


Thr 


Ala 


Val 


T.eu 


Het 


Ser 


Thr 


His 


Arn 


Ser 


Ser 


Leu 


Leu 


Ser 


Hi- 






195 










200 










205 








Leu 


Pro 


Ala 


Arg 


Thr 


Leu 


Ala 


Val 


Arg 


Lys 


Asn 


Sly 


Arp 


Ala 


Ssr 


Ser 




210 










215 










220 










Leu 


Val 


Glu 


Lou 


Ser 


Ala 


Asp 


Ala 


Val 


Ser 


Arg 


Lys 


Asn 


Thr 


Glu 


He 


225 










230 










235 













Asp 
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(2) INrORJiATlOll FOR SEQ ID NO: 3 96 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 amino acids 

(B) TYPE: amino acid 
(OJ TOPOLOGY: linear 

MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi 1 ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivals 

Ux) FEATURE: 

(A) NAME/ KEY : mire feature 

(B) LOCATION 1. . .232 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:396 

Net Arg Asp Val Thr Leu Cyr Gin Glu Glu Asn Val He Phe Gin A,n 

Lou A*n Leu Thr Leu Ser Ala Gly Asp Phe Val Tyr Leu He c!y Ser 

Val 31y Ser Gly Ly* Ser Thr Leu Leu Ly* Ala Leu Tyr Ala Glu Val 
Jo 40 



2b 

u Lys Ala Leu 

Pro II. Ser Aln Gly Tyr Ala Arg Val He Ai? p Tyr tap Lou Ala Ly. 

Leu Ly. Arg Ly Gin Leu Pre Tyr Leu Arg Arg aL Leu Gly Tic Val 

Phe Gin Arp Ph= Gin Leu Leu Asn Gly Arg Thr Val Ala Glu Asn IL 

Asp Phe val Leu Arg Ala Thr Asp Trp Lys As„ Ar, Ala Asp Arg Glu 

Gin Arg lie Glu Glu Val Leu Thr A*"g Vol Gly net Ser Arg Lys Ala 

120 12 5 

Tyr Ly. Arg Pro Hi, Glu Leu Ser Gly Gly Glu Glll G l„ Arg Val G1 

X35 140 

lie Ala Arg Ala Leu Leu Ala Lys Pro Ala Leu He Lsu Ala A*p Glu 
ISO ice 1 

Pro Thr Gly Asa Leu Asp Sor Val Thr Gly Leu Gin II. Ala Ser L~u 

Leu Tyr Glu lie Ser Lys Gin Gly Thr M° v*l Leu Met Ser Thr Hi, 

185 loo 

Asn Ser Ser Leu Leu Sor HI 3 Leu Pro Ala Arg Thr Leu *)» y al Arg 

200 20S ' 

Ly. A.„ Gly A.p Ala Ser Ser Leu Val Glu Leu Ser Ala Asp Ala Val 

" 215 220 

Ser Arg Lys Asa Thr Glu He A*?p 
225 230 

(2) INFORMATION FOR SEQ ID NO:397 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vj J ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivals 
tlx) FEATURE : 

(A) NAIIL/KEY: mis e_f ea tur- 

(B) LOCATION 1. . . 2io 

SEQUENCE DESCRIPTION: SEQ ID NO: 397 
Met lie Glu He Ser Asn Leu Thr L y« Val Phe Arg Thr Glu Glu He 
Glu Thr Val Ala Leu A,p Gly Val Ser Leu Ly s Val A,p Ly, Gly Glu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



332/490 



















25 
















Phe 


He 


Ala 


He 


UeL 


Gly 


Pro 


Ser 


Gly 


Cys 


Gl v 


Lys 


Ser 


Thr 


Leu 


Leu 






35 










40 








45 








A^n 


He 


Lou 


Gly 


Leu 


Leu 


I\so 


Asn 


Pro 


Thr 


Ser 


Gly 


He 


Tyr 


Lys 


Leu 




50 










55 










60 










Acp 


Gly 


Ala 


Glu 


Val 


Gly 


ho n 


Leu 


Axg 


Glu 


Lys 


Asp 


Arg 


Thr 


Ala 


Val 


65 










?o 










75 










30 


Arg 


Lys 


Gly 


Ar.n 


Tie 


Gly 


i?he 


V<*1 


Phe 


Gin 


Ser 


Phe 


Asn 


Leu 


He 


Glu 










as 










90 










95 




Glu 


n»t 


Thr 


Val 


Ser 


Glu 


Asn 


Val 


Glu 


Leu 


Pro 


Leu 


Val 


Tyr 


Leu 


Gly 








100 










105 










HO 






Val 


Lys 


Ala 


Sc-r 


Glu 


Arg 


Lys 


Glu 


Arg 


Val 


Glu 


Glu 


Ala 


Leu 


Arg 


Lys 






115 










120 










125 








Mot 


Ser 


lis 


Ser 


His 


Arg 


Ala 


Gly 


His 


Phe 


Pro 


Asn 


Gin 


Leu 


ser 


Gly 




1 30 










135 










140 










Gly 


Gin 


Gin 


Gin 


Arg 


Val 


Ala 


He 


Al a 


Arg 


Ala 


Val 


Val 


Ala 


Asn 


Pro 


145 










ISO 










155 










160 


Lys 


Leu 


He 


Lfeu 


Ala 


Acp 


GLu 


Pro 


Thr 


Gly 


Asn 


Leu 


Asp 


Scr 


Lys 


Asn 










165 










170 










175 




Giy 


Ala 


A?p 


Val 


Met 


Glu 


Leu 


Leu 


Arg 


Gly 


Leu 


Asn 


Arg 


Glu 


Gly 


Ala 








180 










185 










190 






Thr 


He 


Val 


Met 


Val 


Thr 


His 


S«r 


Glu 


His 


Asp 


Ala 


Arg 


Ser 


Ala 


Gly 






195 










200 










205 








Arg 


He 


He 


A- c n 


Leu 


Phe 


Asp 


Gly 


Lys 


He 


Arg 













210 215 



(2) INFORMATION FOR SEQ ID 110:398 

(X) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: S95 amino acids 
(BJ TYPE: amino acid 
(D) TOPOLOGY: linear 

HI) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

tvi) ORIGINAL SOURCE; 

(A) ORGANISM: Porphyromonas qingivalls 

t xx ) FEATURE : 

I A) NAME/ KEY: misc_f eature 
t D ) LOCATION 1 ... 5 95 



SEQUENCE DESCRIPTION ; SEQ TD NO: 398 



Met 


Ly? 


Glu 


Phe 


Phe 


Lys 


list 


Phe 


Phe 


Ala 


Ser 


He 


Leu 


Gly 


Val 


He 










5 










10 










15 




Thr 


Ala 


Gly 


11 = 


He 


Leu 


fhe 


Cys 


He 


Phe 


L-u 


Phe 


He 


Phe 


Phe 


Gly 








20 










25 










30 






He 


Val 


Ala 


Gly 


He 


Ala 


Ser 


Lys 


Ala 


Thr 


Gly 


Gly 


Thr 


He 


Pro 


Lys 






35 










40 










45 








He 


Glu 


Aln 


Asn 


Ser 


He 


Leu 


His 


Ilf 


Unfc 


A5tl 


Ser 


Ser 


Phe 


Pro 


Glu 




50 










55 










60 










Hs 


Val 


Ser 


Ala 


Asn 


Pro 


Trp 


Ser 


Met 


Leu 


Thr 


Gly 


Lys 


Asp 


Glu 


Ser 


65 










70 










75 










80 


Val 


Ser 


Leu 


Ser 


Gin 


Ala 


Val 


Glu 


Ala 


He 


Gly 


Gln 


Ala 


Lys Ann A^n 










85 










9C 










95 




Pro 


Asn 


He 


Thi 


Gly 


He 


Phe 


Leu 


Asp 


Leu 


Asp 


Acn 


J.eu 


Ser 


Val 


Gly 








100 










105 








110 




Met 


Ala 


Scr 


Ala 


Glu 


Glu 


Leu 


Arg 


Arg 


Ala 


Leu 


Gin 


Asp 


Phe 


Lys 


Hot 






115 










120 










125 








SCL- 


Cly 


Ly* 


Phe 


Val 


Val 


Ser 


Tyr 


Ala 


Asp 


Arg 


Tyr 


Thr 


Gin 


Lys 


Gly 




130 










I3S 










140 










Tyr 


Tyr 


Leu 


Ser 


Ser 


He 


Ala 


Asp 


Lys 


Leu 


Tyr 


Leu 


Asn 


Pro 


Lys 


Gly 


US 










150 










155 










160 


Met 


Leu 


Gly 


Leu 


He 


Gly 


He 


Ala 


Thr 


Gin 


Thr 


Met 


Phe 


Tyr 


Lys 


AiSp 










165 










17C 










175 




Ala 


Leu 


Asp 


Lys 


Phe 


Gly 


Val 


Lys 


Met 


Glu 


11* 


Phe 


Lys 


Val 


Gly 


Thr 








180 










185 










190 




Tyr 


Lys 


Ala 


Ala 


Val 


Gl u 


Pro 


Phe 


Met 


Leu 


Aon 


Arg 


Met 


Ser 


A-sp 


Al« 






195 










200 










205 








Asn 


Arg 


Glu 


Gin 


He 


Thr 


Thr 


Tyr 


He 


Asn 


Gly 


Leu 


Trp Arp Lys 


He 




210 










215 








220 










Thr 


Ser 


Arp 


He 


Ala 


Glu 


Ser 


Arg 


Lys 


Thr 


Ala 


Met 


Asp 


Set 


Val 


Lyr 
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225 










Z30 










235 










240 


Det 


Phe 


Ala 


Air [J 


Lys 


Gly 


Glu 


Met 


Phe 


Gly 


Leu 


Ala 


Glu 


Lys 


Ala 


Val 










Z4l> 










250 










255 




Glu 


Met 


I.y** 


Leu 


Val 


Asp 


Glu 


Leu 


Ala 


Tyr 


Arg 


The 


Asp 


Val 


Glu 


Lys 








2b0 










265 










270 






Glu 


Leu 


Lys 


Lys 


MeL 


Scr 


Gin 


Arg 


Gly 


Glu 


Lys 


Asp 


Glu 


Leu 


Arg 


Phe 






275 










290 










285 








Val 


Scr 


Leu 


Ser 


Gin 


Val 


Leu 


Ala 


Asn 


Gly 


Pro 


Met 


Asn 




Thr 


Lys 




290 










295 










300 










Gly 


Ser 


At y 


He 


Ala 


Val 


Leu 


Phe 


Ala 


Glu Gly 


Glu 


He 


Thr 


G3 U 


Glu 


305 










310 










315 










320 


I ie 


Tie 


Lys 


Lys 


Pro 


Phe 


Asp 


Thr 


Asp 


Gly 


Ser 


Ser 


He 


Thr 


Gin 


Glu 










325 










330 










335 




Leu 


Ala 


Lys 


Glu 


He 


Ly3 


Ala 


Ala 


Ala 


Asp 


Asp 


Acp 


ASp 


11 e 


Lys 


Ala 








340 










34S 










350 




Val 


Val 


Lew 


Arg 


Val 


Asn 


Ser 


Pro 


Gly 


Gly 


Ser 


Ala 


Phe 


The 


Ser 


Glu 






355 










360 








365 








Gin 


lie 


Trp 


Lye 


Gin 


Vol 


Ala 


Asp 


Leu 


Lys 


Ala 


Lys 


Lys 


Pro 


TIP 


Val 




370 










375 










380 










Val 


Ser 


Met 


Gly 


Asp 


Val 


Ala 


Ala 


Scr 


Gly 


Gly 


Tyr 


Tyr 


He 


Ala 


Cy* 


385 










390 










395 










4 00 


Ala 


Ala 


Asn 


Ser 


He 


Val 


Ala 


Glu 


His 


Thr 


Thr 


Leu 


Thr 


Cly 


Ser 


lie 










4 05 










410 








415 




Gly 


Tie 


Phe 


Gly 


Met 


Phe 


Pro 


Asn 


Phe 


Ala 


Gly 


Val 


Ala 




Lys 


He 








420 










425 










430 




Gly 


Val 


Asn 


Met 


Asp 


Val 


Val 


Gin 


Thr 


Ser 


Lys 


Tyr 


Ala 


Asp 


Leu 


Gly 
















440 










445 








Asn 


Thr 


Phe 


Ala 


Pro 


Met 


Thr 


Val 


Glu 


Asp 


Arg 


Ala 


Leu 


He 


Gin Ara 




4 50 










455 










460 










Tyr 


lie 


Glu 


Gin 


Gly Tyr 


Asp 


Leu 


Phe 


Leu 


Thr 


Arg 


Val 


S-r 


Glu 


Gly 


4 65 










470 










4 75 










480 


Arg 


Asn 


Arg 


Thr 


Lys 


Ala 


Gin 


He 


Arp 


Ser 


He 


Ala 


Gin 


Gly Arg 


Vai 










485 










490 










495 




Trp 


Leu 


Gly 


Asp 


Lys 


Ala 


Leu 


Ala 


Leu 


Gly 


Leu 


Val 


A£*p 


Gl u 


Leu 


Gly 








500 










505 










510 






Gly 


Leu 


Asp 


Thr 


Ala 


He 


Lys 


Arg 


Ala 


Ala 


Lys 


Leu 


Ala 


Gin 


Leu 


Giy 






515 










520 










525 








Gly 


Asn 


Tyr 


Ser 


Il« 


Glu 


Tyr 


Gly 


Lys 


Thr 


Lys 


Arg 


Asn 


Phe 


Phe 


Glu 




530 










535 










540 










Glu 


Leu 


Leu 


Ser 


Ser 


Ser 


Ala 


Ala 


Asp Met 


Lys 


Ser 


Ala 


He 


Leu 


Ser 


545 










550 










555 










560 


Thr 


lie 


Leu 


Ser 


Asp 


Pro 


Glu 


He 


Glu 


Val 


Leu 


Arg 


Glu 


Leu 


Arg 


Ser 










565 










57 0 










575 




Met 


Pro 


Pro 


Arg 


Pro 


Ser 


Gly 


He 


Gin 


Ala 


Arg 


Leu 


Pro 


Tyr 


Tyr 


Phe 








500 










585 










590 





MeL Pro Tyr 
595 



(2) INFORMATION hX>R SEQ ID HO: 399 

{1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 589 ami no acid5 

(B) TVPC: amino aciU 
(D) TOPOLOGY : linear 

(jl) MOLECULE TYPE: protein 

Uii) HYPOTHETICAL; YES 

(vi) ORIGIHAL SOURCE: 

IA> ORGANISM: Porphyromonas grngivalis 

(IxJ FEATURE: 

(A} HA] IE/ KEY : misc_f sature 
(B) LOCATIOlI 1...58 9 

(XI 1 SEQUENCE DESCRIPTION : SEQ ID HO: 399 

Mot Phe Phe Ala Ser He Leu Gly Val He Thr Ala "Gly He lie Leu 

15 ]Q 15 

Phe Cys He Phe Leu Phe He Phe Pht Gly He Val Ala Gly He Ala 

20 25 30 

Ser Lys Ala Thr Gly Gly Thr lie Pro Lys He Glu Ala A^n Ser He 

35 40 45 

Leu His He link Asn Ser Ser Phe Pro Glu He Val Ser Ala Ann pro 
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50 










55 










60 










Trp 


Ser 


t 1st 


Leu 


Thr 


Gly 


Lys 


Asp 


Glu 


Ser 


Val 


Ser 


Leu 


Ser 


Gin 


Ala 


65 










7 0 










75 










80 


Val 


Glu 


Al s 


He 


Gly 


Gin 


Ala 


Lys 


Asn 


Asn 


Pro 


A£-n 


He 


Thr 


Gly 


He 










85 










90 










95 




Phe 


Leu 


Asp 


Leu 


Asp Asn 


Leu 


Ser 


Vai 


Gly 


Het 


Ala 


Ser 


Ala 


Glu 


Glu 








100 










105 










110 






L^u 


Arg 


Arg 


Ala 


Leu 


Gin 


Asp 


Phe 


Lys 


Hot 


Ser 


Gly 




Phe 


Val 


Val 






115 










120 










125 








Ser 


Tyr 


Ala 


A&p 


Arg 


Tyr 


Thr 


Gin 


Lys 


Gly 


Tyr 


Tyr 


Leu 


Ser 


Ser 


lie 




130 










135 










140 










Ala 


Asp 


Lys 


Leu 


Tyr 


Leu 


Asn, 


Pro 


Lys 


Gly 


llfit 


Leu 


Gly Leu 


He 


Gly 


145 










150 










155 










160 


lie 


Aid 


Thr 


Gin 


Thr 


Met 


Phe 


Tyr 


Lys 


Asp 


Ala 


Leu 


Asp Lys 


Phe 


Gly 










165 










170 










175 




Val 


Lys 


liet 


Glu 


He 


Phe 


Lys 


Val 


Gly 


Thr 


Tyr 


Lys 


Ala 


Ala 


Val 


Glu 








180 










185 










190 






Pro 


Phe 


Mel. 


Leu 


Asn 


Arg 


Met 


Ser 


Arp 


Ala 


Asn 


Arg 


Glu 


Gin 


Ho 


Thr 






195 










200 










205 








Thr 


Tyr 


He 


Asn 


Gly 


Leu 


Trp 


Asp 


Lys 


He 


Thr 


S-r 


Asp 


He 


Ala 


Glu 




110 










215 










220 










Ser 


Arg 


Lys 


Thr 


Ala 


net 


A*p 


Ser 


Val 


Lys 


ilet 


Phe 


Ala 


Asp 


Lys 


Gly 


225 










230 










235 










240 


Glu 


Met 


Phe 


Gly 


L»u 


Ala 


Glu 


Lys 


Ale 


Val 


Glu 


Met 


hyp 


Leu 


Val 


Asp 










24 5 










2S0 










255 




Glu 


Leu 


Ala 


Tyr 


Arg 


Thr 


Asp 


Val 


Glu 


Lys 


Glu 


Leu 


Lys 


Lys Met 


Ser 


















2*5 










270 






Gin 


Arg 


Gly 


Glu 


Lys 


Asp 


Glu 


Leu 


Arg 


Phe 


Val 


Per 


Leu 


Ser 


Gin 


Val 






275 










280 










285 








Leu 


Ala 


Asn 


Gly 


Pro 


Met 


Asn 


Lys 


Thr 


Lys 


Gly 


Ser 


Arg 


He 


Ala 


Val 




290 










295 










300 










Lou 


Phe 


Ala 


Glu 


Gly Glu 


He 


Thr 


Glu 


Glu 


n & 


Tie 


Lys 


Lys 


Pro 


Phe 


305 










310 










315 










320 


Asp 


Thr 


Asp 


Gly 


Ser 


Scr 


He 


Thr 


Gin 


Glu 


Leu 


Ala 


Lys 


Glu 


He 


Lys 










325 










330 










335 


Ala 


Ala 


Ala 


Asp Asp Asp Asp 


He 


Lys 


Ala 


Val 


Val 


Leu 


Arg 


Val 


Asn 








340 










345 










350 






Ser 


Pro 


Gly Gly Ser Ala 


Phe 


Thr 


Ser 


Glu 


Gin 


lie 


Trp 


Lys 


Gin 


Val 






355 










360 










365 








Ala 


A5p 


Leu 


Lys 


Ala 


Lys 


Lys 


Pro 


He 


Val 


Val 


Ser 


Met 


Gly Asp Val 




370 










37 5 










380 










Ala 


Ala 


Ser 


Gly Gl y Tyr 


Tyr 


Ha 


Ala 


Cys 


Ala 


Ala 


Asn 


Ser 


He 


Val 


385 










390 










395 










400 


Ala 


3lu 


His 


Thr 


Thr 


Leu 


Thr 


Gly 


Ser 


He 


Gly 


He 


Phe 


Gly Met 


Phe 




















410 










415 




Pro 


At-n 


Phc 


Ala 


Gly Val 


Ala 


Lys 


Lys 


He 


Gly 


Val 


Asn 


Ilet 


Asp 


Val 








420 










42J> 










430 






Val 


Gin 


Thr 


Ser 


Lys 


Tyr 


Ala 


Ar,p 


Leu 


Gly 


Asn 


Thr 


Phe 


Ala 


Pro 


Met 






4 35 










440 










445 








Thr 


Val 


Glu 


Acp Arg 


Ala 


Leu 


He 


Gin 


Arg 


Tyr 


He 


Glu 


Gin 


Gly Tyr 




450 










455 










460 










Asp 


L-eu 


Phe 


Leu 


Thr 


Arg 


Val 


Ser 


Glu 


Gly 


Arg 


Asn 


Arg Thr 


Lys 


Ala 


465 










470 










4 75 










480 


Gin 


lis 


Asp 


Scr 




Ala 


Gin 


Gly Arg 


Val 


Trp 


Leu 


Gly Asp 


Lys 


Ala 










485 










4 90 










4 95 




Leu 


Ala 


Leu 


Gly 


Leu 


Val 


Asp 


Glu 


Leu 


Gly Gly 


Leu 


Asp 


Thr 


Ala 


He 








500 










505 










510 






Lys 


Arg 


Al a 


Ala 


Lys 


Leu 


Ala 


Gin 


Leu 


Gly 


Gly 


Asn 


Tyr 


Ser 


He 


Glu 






515 










520 








525 








Tyr 


Gly 


Lys 


Thr 


Lys 


Arg 


Asn 


Phe 


Phe 


Glu 


Glu 


Leu 


Leu 


Ser 


Ser 


Ser 














535 










540 










Ala 


Ala 


Asp 


Mot 


Lys 


S-r 


Ala 


He 


Leu 


Ser 


Thr 


He 


Leu 


S?r 


Asp 


Pro 


5-15 










550 










555 








560 


Glu 




Glu 


Val 


Leu 


Arg 


GlU 


Leu 


Arg 


Ser 


list 


Pro 


Pro 


Arg 


Pro 


Ser 










565 










570 










57 5 




Gly 


lie 


Gin 


Ala 


Arg 


Leu 


Pro 


Tyr 


Tyr 


Phs 


tter 


Pro 


Tyr 









580 585 



(2) INFORMATION FOR. SEQ ID 110:4 00 

(il SEQUEttCR CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

( B) TYPE: amino acid 
(Dl TOPOLOGY: linear 
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WO 99/29870 



PCT/AU98/01023 



335/ 490 



(ii) MOLECULE TYFE : protein 

(nil HYPOTHETICAL: YES 

(vj ) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva Its 

(ix) FEATURE : 

(A) NAME/ KEY: miscf eature 

(B) LOCATlOi; I. . .313 



(xi) SEQUENCE DESCRI PTIOll : SEQ ID 1 JO: 4 00 



Met 


Arg 


Ala 


Asn 


lie 


Trp 


Gin 


lie 


Leu 


Ser 


Val 


Ser 


Val 


Leu 


Phe 


The 


1 








5 










10 










15 




Phe 


Gly 


Thr 


Ala 


lie 


Gly 


Gin 


Ala 


Gin 


Ser 


Arg 


Asn 


Arg 


Thr 


Tyr 


Glu 








20 










25 










30 






Ala 


Tyr 


Vnl 


Lye 


Gin 


Tyr 


Ala 


Asp 


Glu 


Ala 


He 


Axg 


Gin 


Met 


Ser 


Arg 






35 










40 










45 








Tyr 


Asn 


11 = 


Pro 


Ala 


Ser 


lie 


Thr 


lie 


Ala 


Gin 


Ala 


Leu 


Val 


Glu 


Thr 




50 










55 










60 










Gly 


Ala 


Gly 


Ala 


Ser 


Thr 


Leu 


Ala 


Ser 


Val 


Hit; 


Asm 


Asn 


His 


Phe 


Gly 


65 










70 










15 










30 


lie 


L V S 


Cys 


His 


Lys 


Set 


Trp 


Thr 


Gly 


Lys 


Arg 


Thr 


Tyr 


Arg 


Thr 


Asp 










85 










90 














Asp 


Al d 


Fro 


Arn 


Glu 


Cys 


Phe 


Arg 


Ser 


Tyr 


Ser 


Ala 


Ala 


Arg 


Glu 


Scr 








ICO 










105 










110 






Tyr 


Glu 


Asp 


Ids 


Ser 


Arg 


Phe 


L-u 


Leu 


Gin 


Pro 


Arg 


Tyr 


Arg 


Pro 


Leu 






115 










120 










125 








Phe 


Lys 


Leu 


Asp 


Arg 


Glu 


Asp 


Tyr 


Arg 


Gly 


Trp 


Ala 


Thr 


Gly 


Leu 


Gin 




130 










135 










140 










Ara 


Cys 


Gly 


Tyr 


Ala 


Thr 


Asn 


7\rg 


Gly 


Tyr 


Ala 


Asn 


Leu 


Leu 


He 


Lys 


145 










160 










15S 










160 


Met 


Val 


Glu 


Leu 


Tyr 


Glu 


Leu 


Tyr 


Ala 


Leu 


Asp Arg 


Glu 


Ly^ 


Tyr 


Pro 










165 










170 










175 




Ser 


Trp 


Phe 


Hi a 


Lys 


Ser 


Tyr 


Pro 


Gly 


Ser 


Asn 


Lys 


Lys 


Ser 


His 


Gin 








ieo 










186 










190 






Thr 


Thr 


Lys 


Gin 


Lys 


Gin 


Ser 


Gly 


Leu 


Lys 


His 


Glu 


Ala 


Tyr 


Phe 


Ser 






195 










200 










205 








Tyr 


Gly 


L«u 


Leu 


Tyr 


11* 


He 


Ala 


Lys 


Gin 


Gly 


Asp 


Thr 


Phe 


Asp 


Ser 




210 










215 










220 










Leu 


A) a 


Glu 


Glu 


Phe 


Asp 


Met 


Arg 


Ala 


Ser 


Lys 


he.u 


Ala 


Lys 


Tyr 


Asn 


225 










230 










235 










240 


Asp 


Ala 


Pro 


Val 


Asp 


Phe 


Pro 


He 


Glu 


Lys 


Gly 


Asp 


Val 


lie 


Tyr 


Leu 










24b 










2 50 










255 




Glu 


Ly.* 


Lys 


His 


Ala 


Cys 


Ser 


He 


Ser 


Lye 


His 


Thr 


Gin 


His 


Val 


Val 








260 










265 










270 






Arg 


Val 


Gly 


Asp 


Ser 


Mst 


His 


Ser 


He 


Ser 


Gin 


Arg 


Tyr 


Gly 


He 


Arg 






275 










280 










285 








Met 


Lys 


Asn 


Leu 


Tyc 


Lys 


Ltu 


Asn 


Asp 


Lys 


Asp 


GJy 


Glu 


Tyr 


He 


Pro 




290 










295 








300 








Gin 


Glu 


Gly 


Asp 


lie 


Leu 


Arg 


Leu 


Arg 
















305 










310 






















(ZJ 


IH FORMATION 


FOR 


SEQ 


ID NO:401 

















(il SE WEI ICE CHARACTERISTICS: 

{A) LENGTH: 523 amino acids 
(R) TYPE: amino acid 
4 0) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

iv t) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonat yingi vails 

(ix) FEATURE: 

(A) NAME/ KEY: mis c_f eature 
<B) LOCATION 1. . .523 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401 

Met Asp Gly Arg Arq Tyr Ser Asp Giy Leu His Gin Ala Tic Glu Ala 
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1 








5 










10 










IS 




Lys 


Glu 


Hie 


Val 


Lys 1 


Val 


Glu 


Ala 


A.1 a 


Thr 


Gin 


Thr 


Phe 


Ala 


Thr 


He 








20 










25 










30 






Thr 


Leu 


Gin 


At n 


Tyr 


Phe 


Arg 


Met 


Tyr 


His 


Li'i 


Leu 


Ala 


Gly 


Het 


Thr 






3S 










40 










45 








Gly 


Thr 


Ala 


Glu 


Th r 


Glu 


Ala 


Gly 


Glu 


I.eu 


Trp 


Asp 


He 


Tyr 


Lys 


Leu 




50 










55 










60 










Asp 


Val 


V a 1 


Vol 


He 


Pro 


Thr 


Asn 


Lys 


Pro 


lie 


Ala 


Arg 


Lys 


Asp 


Het 


65 










70 










75 










SO 


A-Tl 


Asp 


Ara 


He 


Tyr 


Lys 


Thr 


Ala 


Arg 


Glu 


Lys 


Tyr 


Ala 


Ala 


Val 


lie 










85 










90 










95 




Glu 


Glu 


lie 


Val 


Arg 


I.eu 


Val 


Glu 


Glu 


Gly 


Arg 


Pro 


Val 


Lf u 


Val 


Gly 








1 00 










105 








110 






Thr 


Th r 


S*r 


Val 


Glu 


He 


Ser 


Glu 


Lou 


Leu 


Ser 


Arg 


Het 


Leu 


Arg 


Leu 






1 1 5 










120 










125 








Arg 


Gly 


lie 


Gin 


His 


Asn 


Val 


Leu 


Asn 


Ala 


Lyc 


Leu 


His 


Gin 


Lys 


Glu 




1J0 










135 










14C 










Ala 


Glu 


J 1 e 


Val 


Ala 


Gin 


Ala 


Gly 


Gin 


Lys 


Gly 


Thr 


Val 


?hr 


He 


Ala 


145 










150 










155 










160 


Thr 


Asn 


Me t 


Ala 


Gly 


Arg 


Gly Thr 


Asp 


He 


Lj" 


Leu 


Ser 


Ala 


Glu 


Val 










1 65 










17 0 










175 




Lys 


f.yr* 


Ala 


Gly 


Gly 


Leu 


Ala 


Tie 


He 


Gly 


Thr 


Glu 


Arg 


His 


Glu 


Ser 








1 00 










165 








190 






Arg 


Arg 


Val 


Asp 


Axg 


Gin 


Lou 


Arg 


Gly 


Arg 


Ser 


Gly Arg 


Gin 


Gly 


Asp 






1 f'S 










200 










205 








Pro 


Gly 


Scr 


Se r 


He 


The 


Tyr 


Val 


Ser 


Leu 


<^lu 


Asp 


His 


Leu 


flet 


Arg 




210 










215 










220 










Leu 


Phe 


Ala 


Thr 


Glu 


Lys 


He 


Ala 


Ser 


Leu 


Met 


Asp Arg 


Lou 


Gly 


Phe 


225 










230 










235 










24 0 


Lys 


Gl u 


Gly 


Glu 


Val 


Leu 


Glu 


Asn 


Ai»r\ 


Viet 


Leu 


Ser 


Lys 


Ser 


Val 


Glu 










2 4 5 










250 










255 




Arg 


Ala 


Gln 


Lye 


Lys 


Val 


Glu 


Glu 


Asn 


Asn 


Phe 


Gly 


He 


Arg 


Lys 


His 








260 










265 










27 0 






Leu 


Leu 


Glu 


Tyr 


Asp 


Asp Val 


I let 


Asn 


Ser 


Gin 


Arg 


Glu 


Val 


He 


Tyr 






275 










280 










265 








Thr 


Arg 


Arg 


Arg 


His 


Ala 


Leu 


Het 


Gly 


Glu 


Arg 


He 


Gly 


Met 


Asp 


Val 




290 










295 










300 










Leu 


Asn 


Thr 


He 


Tyr 


Asp 


Val 


Cys 


Lys 


Ala 


Leu 


He 


Asp Asn 


Tyr Ala 


305 










310 










315 










320 


Glu 


Ala 


Asn 


A<sp 


Phe 


Glu 


Gly 


Phe 


Lys 


Glu 




Leu 


Met 


Arg 


Ala 


Leu 










325 










330 










335 




Ala 


lie 


Glu 


Ser 


Pro 


He 


Thr 


Gin 


Glu 


He 


Phe 


Arg 


Gly 


Lys 


Lys 


Ala 








340 










345 










350 






Glu 


Glu 


Lou 


Thr 


Asp 


Met 


Leu 


Phe 


Asp 


G3 u 


Ala 


Tyr 


Lys 


Ser 


Phe 


Gin 
















360 










365 








Arg 


Lys 


list 


Asp 


Leu 


He 


Ala 


Glu 


Val 


Ala 


Hl5 


Pro 


Val 


Val 


HI G 


Gin 




370 










375 










300 










Val 


Phe 


Glu 


Thr 


Gin 


Ala 


Ala 


Val 


Tyr 


Gl u 


Arg 


He 


Leu 


He 


Pro 


He 


305 










390 










395 










400 


Thr 


A.~p 


Gly 


Lys 


Arg 


Val 


Tyr Asn 


He 


Gly 


Cys 


Asn 


Leu 


Arg 


Glu 


Al a 










405 










410 








415 




Asp 


Glu 


Thr 


Gli\ 


Gil- 


Lys 


Ser 


He 


He 


Lys 


Glu 


Php 


Glu 


Lys 


Ala 


He 








420 










425 










430 






Val 


Leu 


His 


Thr 


lie 


Asp 


Glu 


Scr 


Trp 


Lys 


Glu 


His 


Leu Arg 


Glu 


Met* 
















440 










445 








Asp 


Glu 


Leu 


Arg 


Asn 


ser 


Val 


Gin 


Asn 


Ala 


Ser 


Tyr 


Glu 


Asn 


Lys 


Asp 




45C 










455 










4 60 








Pro 


Leu 


Leu 


He 


Tyr 


Lys 


Leu 


Glu 


Ser 


Tyr 


Glu 


Leu 


Phe 


Arg 


Lys 


Met 


465 










4 7 0 










475 










430 


Val 


Glu 


Ala 


Met 


Asn 


Arg 


Lys 


Thr 


Val 


Ala 


He 


Leu 


Het 


Arg 


Ala Arg 










485 










490 










4 95 




lie 


Pro 


Val 


Pro 


Glu 


Ala 


Pro 


Ser 


Gin 


Glu 


Glu 


Leu 


Glu 


His 


Arg 


Arg 








500 










505 










510 




Gin 


lie 


Glu 


lie 


Arg 


His 


Ala 


Thr 


Gin 


Cln 


Arg 













515 520 



<2J inrORIIATIOri fcOR SEQ ID NO: 402 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

Hi) MOLECULE TYPE: protein 



SUBSTITlJTE SHEET (tool* 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



337/ 490 



(ui ) HYPOTHETICAL: YES 

(vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyrons nas gingival! s 

tlx J FEATURE: 

I A) NAME/ KEY: mlscfeature 
(B> LOCATION 1-..37S 



Cxi I SEQUENCE DESCRIPTION: SEO ID HO: 402 



Met 


Am 


Phe 


Leu 


Lys 


Lys 


Glu 


Pro 


Phe 


I.Y3 


He 


Che 


Ser 


Het 


He 


Tyr 


1 








5 










10 










IS 




Leu 


Leu 


Leu 


Asp 
20 


Thr 


He 


Thr 


Asn 


Arg 
25 


Ala 


Gly 


Thr 


Glu 


Arg 
30 


Ala 


Val 


lis 


Ash 


I.CU 
35 


A\a 


Asn 


Asn 


Leu 


His 
40 


Ala 


Am 


Gly 


HIS 


Arg 
45 


Val 


Ser 


Leu 


Val 


Ser 
50 


Val 


Cys 


TUc 


Lyz 


Glu 
55 


Gly 


Glu 


Pro 


Ser 


Phe 
60 


Gin 


Val 


G] u 


Lys 


cu- 


lis 


bl U 


Val 


His 


Ills 


L-u 


Gly 


He 


Arg 


Leu 


Tyr 


Gly Asn 


Ala 


Leu 


es 










70 










75 










80 


Ala 


Arg 


Lys 


Thr 


Val 
05 


Tyr 


Phe 


Lys 


Ala 


Tyr 
90 


Arg 


Arg 


He 


Lys 


Ala 


Leu 


Tyr 


Lys 


Lye 


Arg 
100 


Glu 


Pro 


Val 


Leu 


Leu 
105 


He 


Gly 


Thr 


Asn 


He 
110 


Phe 


He 


Asn 


Thr 


lie 


Leu 


Ser 


Gin 


Tie 


Ser 


Asn 


Arg 


Gly Arg 


He 


Phe 


Thr 


He 






115 










120 










125 








Gly 


Cys 
1 JG 


Glu 


His 


lis 


Ser 


Tyr 
135 


Asp 


He 


Ala 


Arg 


Pro 
140 


He 


Thr 


Lys 


Arg 


lie 


Arq 


Gly 


Hie 


Leu 


Tyr 


Ser 


Gly 


Leu 


Asp 


Ala 


Val 


Val 


Ala 


Leu 


Thr 


145 










150 










155 










160 


Lye 


Arg 


Asp 


Gin 


Gin 


Ser 


Phe 


Glu 


Ala 


He 


L*u 


Arg 


Gly Axg 


Ser 


Lys 










165 










170 










175 




Ala 


Tyr 


Val 


He 
180 


Pro 


Asn 


Gin 


Val 


Ser 
185 


Phe 


Thr 


Thr 


Val 


Gin 
190 


Arg 


Asp 


Ala 


Thr 


Thr 


ItlE 


Lys 


Gin 


Het 


Lou 


Ala 


Xle 


Gly 


Arg 


Leu 


Thr 


Tyr Gin 






195 










200 










205 








Lys 


Gly 


Phe 


Glu 


Phc 


Met 


He 


Glu 


Asp Ala 


Ser 


Arg 


Val 


Leu 


Axg 


Glu 




210 










215 










220 










Arg 


Pro 


Asp 


Trp 


Lys 


Leu 


He 


Tie 


Val 


Gly 


Asp 


Gly Glu Asn 


Glu 


Ser 


225 










230 










235 










240 


Met 


Leu 


Arg 


Lys 


GlU 
245 


lie 


Ala 


Ser 


Arg 


Asn 

2 SO 


Met 


Glu 


Ser 


Gin 


He 

25 5 


Glu 


lie 


HI F 


Pro 


Ssr 

260 


Thr 


Pro 


Glu 


Tie 


Arg 
265 


Lys 


Tyr 


Tyr 


Glu 


Ser 
27 0 


Ser 


Ala 


He 


Tyr 


Leu 


Het 


Thr 


Ser 


Arg 


Phe 
2B0 


Glu 


Gly 


Lsu 


Pro 


Met 
285 


Val 


Leu 


Leu 


Glu 


Al s 


Glu 


Ala 


Tyr 


Ala 


Leu 


Pro 


He 


He 


Ser 


Tyr 


Asp 


Cys 


Pro 


Thr 




290 










2S5 










300 








Gly 


Pro 


Arg 


Glu 


Leu 


He 


Glu 


Asn 


Gly 


Arg 


Asn 


Gly 


Phe 


Leu 


Val 


Pro 


305 










310 










315 










320 


Met 


Glu 


Ala 


His 


Glu 


Asp 


Phe 


Ala 


Asp 


Lys 


Leu 


Arg 


Leu 


Leu 


Met Asp 










32S 










330 










335 




Asp 


Glu 


Thr 


Leu 


Arg 


Lys 


Lys 


Het 


Gly Gin 


Glu 


Ser 


Glu 


Leu 


Het 


Val 








340 










345 










350 






Lys 


Ser 


Tyr 
335 


Ssr 


Pro 


Ala 


Asn 


He 
360 


Tyr 


Glu 


Cys 


Trp 


Lys 
365 


Lys 


Leu 


Phe 


Val 


Glu 
370 


He 


Gly 


Tyr 


I let 


Asn 
375 





















12) IUF0Ri a lATIO11 FOR SEO. ID HO: 4 03 

(i) SEOUEMCE CHARACTERISTICS: 

(A) LENGTH : 362 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY : linear 

<ii> MOLECULE TYPE: protein 

(HI) HYPOTHETICAL: YES 

ivi) ORIGINAL SOURCE: 

(A) ORGAMISI1: Porphyromonas gingivalis 

{ lx ) FEATURE : 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



338/490 



(Al NAME/ KEY; misc_featurs 
(Qt LOCATICHt 1...362 

txa.) SEQUENCE DESCRIPTION: SEO ID 110:403 



1 te t 


He 


Tyr 


Leu 




Leu 


sp 




11 




Asn 




Ala 


Gly 


Thr 


Glu 










5 










3 0 










15 




Arg 


Ala 


Va 1 


1 1 e 


A^n 


L**u 






Asn 


Leu 


Hie 


Ala 


A?n 


Gly 


His 


A 

Arg 








20 










25 










30 






Val 


Ser 


Leu 


Val 


Ser 
er 


Val 


V s 


Thr 


Lys 


1U 


L. > 




Pro 


Ser 


Phe 


Gin 






35 










4 0 










45 








Val 


Glu 


t x"<r 


Gl y 


He 


Gl u 


Val 


His 


is 


eu 


Gl'" 


11** 


Arg 


Leu 


Tyr 


Gly 




50 










55 
















A~n 


Al a 


I ~IJ 

,en 


Al a 


rq 


Lys 


Thr 


Val 


Tyr 




L*/F 


Ala 


Tyr Arg Arg 


lie 


65 










7 o 










75 










00 


Lys 


Ala 


eu 


T""r 


Lye 


Ly** 


Arg 


l»1u 


Pro 


> al 


Leu 


Leu 


He 


Gly 


Thr 


Asn 










8 5 




















95 




lie 


Phe 


He 




Thr 


He 


Leu 


er 






" r 


A-n 


Arg 


Gly Arg 


e 








100 










in" 










110 








r 


He 


Gly 


ys 


G 1 




i in 


^er 


*lyr 


Asp 


He 


Ala 


Arg 


Pro 


e 






115 










170 










125 








Thr 


l.ys* 


Arg 


ll- 


Arg 


Gly 


Phts 


Leu 


Tyr 


Ser 


Gly 


Leu 


Asp 


Ala 


Val 


v«,i 




130 




















140 










Ala 


Leu 


Thr 


Lys 


Arg 


Asp 


Gin 


Gin 




Ph* 


Glu 


Ala 


He 


Leu 


Arg 




145 










150 










155 










160 


Arg 


Ser 




Ala 


Tyr 


Val 


He 


Pro 


Asn 


Gin 


Val 


Ser 


Phe 


Thr 


Thr 


Val 










165 










170 










175 




Gin 


Arg 


Asp 


Ala 


Thr 


Thr 


Hi? 


Lys 


Gin 


Met 


Leu 


Ala 


He 


Gly Arg 


Leu 








ieo 










185 










190 






Thr 


Tyr 


Cln 


Lys 


Gly 


Phe 


Glu 


Phc 


Met 


He 


Glu 


Asp 


Ala 


Ser 


Ary 


Val 






1 95 










200 










205 








Leu 


Arg 


Glu 


Arg 


Pro 


Asp 


Trp 


Lys 


Leu 


lie 


lie 


Val 


Gly 


Asp 


Gly 


Glu 




210 










215 










220 










A/^n 


Glu 


Ser 


Met 


Leu 


Arg 


Lys 


Glu 


lie 


Ala 


Ser 


Arg 


Asn 


Met 


Glu 


Ser 


225 










230 










235 










240 


Gin 


He 


Glu 


He 


His 


Pro 


Ser 


Thr 


Pro 


Glu 


Ho 


Arg 


T.ys 


Tyr 


Tyr 


Glu 










245 










2b0 










255 




Ser 


Ser 


Ala 


He 


Tyr 


Leu 


Met 


Thr 


Ser 


Arg 


Phe 


Glu 


Gly Leu 


Pro 


Met 








260 










265 










270 






Val 


Leu 


Leu 


Glu 


Ala 


GlU 


Ala 


Tyr 


Ala 


Leu 


Pro 


He 


He 


S-r 


Tyr 


Asp 






275 










260 










285 








Cys 


Pro 


Thr 


Gly 


Pro 


Arg 


Glu 


Leu 


He 


Glu 


Asn 


Gly Arg A&n 


Gly 


Phe 




290 










295 










300 










Leu 


Val 


Pro 


Hot 


Glu 


Ala 


His 


Glu 


Asp 


Phe 


Ala 


Asp 


Lys 


Leu 


Arg 


Leu 


305 










310 










315 










32 0 


J.CII 


Met 


Asp 


Asp 


Glu 


Thr 


Leu 


Arg 


Lys 


Lys 


Met 


Gly Gin 


Glu 


Ser 


Glu 










325 










330 










335 




Leu 


Met 


Val 


Lys 


Ser 


Tyr 


Ser 


Pro 


Ala 


Asn 


He 


Tyr 


Glu 


Cys 


Trp 


Lys 








340 










345 










350 






Lys 


Leu 


Phe 


Val 


Glu 


He 


Gly 


Tyr 


Met 


Asn 


















355 










360 



















(2) INFORMATION FOR SEQ ID NO: 404 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 0 amino acids 

(B) TYPE: amino acid 
f D> TOPOLOGY: linear 

til) MOLECULE TYPE: protein 

tiii) HYPOTHETICAL: YES 

(Yi) ORIGINAL SOURCE: 

(A) ORGANISM: Perphyromonae gingival is 

(ix) FEATURE: 

(A) MAI IE/ KEY : mi sc_featu re 

(B) LOCATION' 1...64 0 

[y,i) SE0UEUCE DESCRIPTION: SEQ ID NO: 4 0 1 

Met Gly Lys lie He Gly He Asp Leu Gly Thr Thr Asn Ser Cys Val 

IS 10 15 

Ser Val Leu Glu Gly Asn Glu Pro He Val He Thr Asn Ser Ulu Gly 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



339/ 490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 









20 










2b 










30 






Lys 


Arg 


Thr 


Thr 


Pro 


Ser 


Val 


Val 


Ala 


Phe 


Val 


Asp 


Gly 


Gly Glu 


Arg 






35 










40 










45 








Lys 


Val 


Gly 


Acp 


Pro 


Ala 


Lys 


Arg 


Gin 


Ala 


lie 


Thr 


Asn 


Pro 


Thr 


Lys 




50 










55 










60 










Thr 


lie 


Tyr 


Sei 


He 


Lys 


Arg 


Phe 


Met 


Gly 


Glu 


The 


Tyr 


Asp 


Gin 


Val 


65 










70 










75 










80 


Sec 


Arg 


Glu 


Val 


Glu 


Arg 


Val 


Pro 


Phe 


Lys 


Val 


Val 


Arg 


Gly Asp 


Asn 










65 










90 










95 




Acn 


Thr 


Pro 


Arg 


Val 


Asp 


Tie 


Asp 


Gly 


Arg 


Leu 


Tyr 


Thr 


Pro 


Gin 


Glu 








100 










105 










110 






Ila 


Ser 


Ala 


Met 


He 


Leu 


Gin 


Lys 


Met 


Lys 


Lys 


Thr 


Ala 


Glu 


Aso 


Tyr 






115 










120 










125 








Leu 


Gly 


Gin 


Glu 


Val 


Thr 


Gl u 


Ala 


Val 


He 


Thr 


Val 


Pro 


Ala 


Tyr 


Phe 




130 










135 










140 










Asn 


Asp 


Ala 


Gl n 


Arg 


Gin 


Ala 


Thr 


Lys 


Glu 


Ala 


Gly 


G1 u 


He 


Ala 


Gly 


Mb 










150 










155 










160 


Leu 


Lys 


Val 


Arg 


Arg 


He 


Val 


Asn 


Gin 


Pro 


Thr 


Ala 


Ala 


Ser 


Leu 


Ala 










165 










170 










175 




Tyc 


Gly 


Leu 


Acp 


Lys 


Ser 


Asn 


Lys 


Asp HsL 


Lys 


Tic 


Ala 


Val 


Phe 


Asp 








ieo 










105 










190 






Leu 


Gly 


Gly 


Gly 


The 


Phe 


Asp 


Tie 


Ser 


He 


Leu 


Glu 


Leu 


Gly 


Asp 


Gly 






195 










200 










205 








Val 


phe 


Glu 


Val 


Ly3 


Ser 


Thr 


Asn 


Gly Asp 


Thr 


His 


Leu 


Gly 


Gly 


Asp 




210 










215 










220 










Asp 


Phe 


Asp 


His 


Val 


Hit 


Ho 


Asp 


Trp 


Leu 


Ala 


Glu 


Glu 


Ptie 


L 1 s 


Ser 


2? 5 










230 










235 










240 


Gin 


Glu 


Gly 


Val 


Asp 


Leu 


Arg 


Gin 


Asp 


Pro 


I let 


Ala 


Het 


Gin 


Arg 


Leu 










24 5 










250 










2S5 




Lys 


Glu 


Ala 


Ala 


Glu 


Lys 


Ala 


Lys 


He 


Glu 


Leu 


Ser 


Ser 


Thr 


S?r 


Ser 








2G0 










255 










270 






Thr 


Glu 


lie 


Asn 


Leu 


Pro 


Tyr 


lis 


Met 


Pro 


Val 


Asn 


Gly 


He 


Pro 


Lys 






275 










280 










285 






)\xs 


Leu 


Val 


Met 


Thr 


Leu 


Thr 


Arg 


Ala- 


Lys 


Fhe 


Glu 


Gin 


Leu 


Ala 


Asp 




290 










295 










300 










Arg 


L«m 


lie 


Gin 


Ala 


Cya 


Val 


Ala 


Pro 


Cys 


Glu 


Thr 


Ala 


Leu 


Lv 


Asp 


305 










310 










315 










320 


Ala 


Gly 


Met 


Ser 


Arg 


Gly 


Asp 


He 


Asp 


Glu 


Val 


He 


Leu 


Val 


Gly 


Gly 










325 










330 










335 




Sor 


Thr 


Arg 


He 


Pro 


Ala 


He 


Gin 


Glu 


He 


Val 


Glu 


Lys 


He 


Phe 


Gly 








340 










345 










350 






Lys 


Ala 


Pro 


Ser 


Lye 


Gly Val 


Asn 


Pro 


Asp 


Glu 


Val 


Val 


Ala 


Val 


Gly 






355 










360 










365 








Ala 


Ala 


Tie 


Gin 


Gly Gly Val 


Leu 


Thr 


Gly Glu 


Val 


Lys 


Asp 


Val 


Leu 




370 










37 5 










380 










Leu 


Leu 


Asp 


Val 


Thr 


Fro 


Lett 


Ser 


Leu 


Gly 


He 


Glu 


Thr 


Met 


Gly 


Gly 


385 










390 










395 










400 


Val 


Met 


Thr 


Arq 


Leu 


He 


Asp 


Ala 


Asn 


Thr 


Thr 


He 


Pro 


Thr 


Lys 


Lys 










4 05 










410 










415 


Ser 


Glu 


He 


Phe 


Thr 


Thr 


Ala 


Val 


Asp Asn 


Gin 


Pre 


Ser 


Val 


Glu 


He 








420 










42S 










430 






His 


Val 


Leu 


Gin 


Gly Glu 


Arg 


Ser 


Leu 


Ala 


Lys 


Asp 


Asn 


Lys 


Ser 


He 






435 










440 










445 








Gly 


Arg 


Phe 


Asn 


Leu 


Asp 


Gly 


He 


Ala 


Pro 


Ala 


Pro 


Arg 


Gin 


Thr 


Pro 




4 50 










455 










460 










Gin 


He 


Glu 


Val 


Thr 


Phe 


Asp 


He 


Asp 


Ala 


Asn 


Gly 


He 


Leu 


Asn 


Val 


465 










470 










475 










480 


Thr 


Ala 


His 


Pi£p 


Lys 


Ala 


Thr 


Gly Lys 


Lys 


Gin 


Asn 


lie 


Arg 


He 


Glu 










485 










490 










4 95 




Ala 


Ser 


Ser 


Gly 


Leu 


Ser 


Asp 


Asp 


Glu 


He 


Lys 


Arg 


Met 


Lys 


Glu 


Glu 








500 










505 










510 






Ala 


Gin 


Ala 


Ar>n 


Ala 


Glu 


Ala 


Asp 


Lys 


Lys 


Glu 


Lys 


Glu 


Axg 


He 


Asp 






515 










520 










525 








Lys 


lie 


Asn 


Gin 


Ala 


Asp 


Ser 


Met 


lie 


the 


Gin 


Thr 


Glu 


Lys 


Gin 


Leu 




530 










535 










54 0 








Lys 


GlU 


L~u 


Cly 


Asp 


Lys 


Phe 


Pro 


Ala 


Asp 


Lys 


Lys 


Ala 


Pro 


He 


Asp 


54 5 










5 5n 










555 










560 


Thr 


Ala 


Leu 


Asp 


Lys 


Leu 


Lys 


Glu 


Ala 


His 


Lys 


Ala 


Gin 


Asp 


Val 


Ala 










565 










57 0 










575 




Ala 


Xle 


A.~p 


Thr 


Ala 


Met 


Ala 


Glu 


Leu 


Gin 


Thr 


Ala 


Leu 


Ser 


Ala 


Ala 








580 










585 










5 90 






Gly 


Glu 


Glu 


Leu 


Tyr 


Lys 


Asn 


Ala 


Gly Ala 


Ala 


Gin 


Gly 


Gly Ala Gin 






595 










600 










605 








Pro 


Gil 


Pro 


Asp 


Phe 


Gly 


Gly 


Ala 


Gin 


Gly 


Pro 


Ser 


Ala 


Gly Asp 


Gin 




610 










615 










620 











SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



10 



340/490 

Plo Ser Asp Asp Ly* Asn Val Thr Asp Val Arp Phe Glu Glu Val Ly t 
630 635 6 4 0 

(2) INFORMATION FOR SEQ ID HO: A 05 

(I> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4<o amino acicis 

(B) TYPE: aitij.no acid 
(D) TOPOLOGY: linear 



Ui) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Poxphyromonas gingivals 

lix) FEATURE: 
« n (A) NAME/ KEY: misc featur- 

ZU tBJ LOCATION 1...44 9 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:405 

25 net Arg Tyr Asp Leu Ala He 13 e Gly Gly Gly Pro A la sly Tyr Thr 

Ala Ala Glu Arg Ala Ala Ly 5 Gly Gly Leu Ly* Thr Leu Leu lie Glu 

Lys A*n Ala Leu Gly Gly Val Cyn Itj Asn Glu Gly Cys iL Pro Thr 
On , 40 45 

Lys Thr Leu Leu Tyr Ser Ala ty» Val Leu His Gin He Ala Thr Ala 



55 60 
Ser Lv* Tyr Ala Val Ser Gly Thr Ala Asp Sly Leu Asp Leu Gly Ly» 



40 Glu Ala Gin Tyr Ly S Ala Ala Asn Leu Leu Leu Cy» Thr Gly Ser Giu 



35 Val He Ala Arg Ly- S Gly L y s He lie Arg Lys Leu Thr Ala Gly ??. 

90 os 
Arg Ser Arg Leu Thr Glu Ala Gly Val Glu lleL Val Thr Ala Glu Ala 

1 Q*~ i i n 

Thr Val Thr Gly Cys Asp Ala Asp Gly He Tie Gly lie Thr Ala Gly 

Gin 

Thr Phe 11= Pro Pro „« Pro Gly Val Glu Tnr Qlu Tyr ^ ^ 

45 A=„ Arg Glu Ala Leu Gin A*n Ly S Glu lie Pro Th£ Ser Leu Val He 

He Gly Gly Gly Val He Gly Met Glu Phe Ala Ser Phe Phe lit Gly 

t- 185 190 

He Gly Thr Gin Val His Val Val Glu Met Leu Pro Glu He Leu A*n 

50 Gly He A,p Pro Glu His Ala Ila Het Leu Ar, Ala ml Tyr Glu Ly» 
2« G1V 115 Ly£ Phs ^ u Gly His L " s Va * Thr Ser Val Arg A 3 n 

55 2* 01u TYr G ' u «1» Ser Ly s Glu He Glu My 

250 5>e;e: 

Glu Arg He Leu Met Ser Val Gly Arg Arg Pro Val Leu Gin Gly Phe 

Glu Ser Leu Gly Leu Val Leu Ala Gly Lys Gly Val Lys Thr Asn Gl u 

cn Z 5 280 ooc 

60 Arg Met ai„ Thr Ser Leu Pro Asn Val Tyr Ala Gly Asp He Thr 

Oly Phe Ser Leu Leu Ala Hi's Thr Ala Val Arg Glu Ala Glu Val Al* 

31 c Ton 

65 Val Asp Gin He Leu Gly Lys Thr Ar P Glu Thr^ Met Ser Tyr Ar, III 
Val Pro Gly Val Val Tyr Thr A S n Pro Glu Val Ala Gly Val ely Glu 
Thr Glu Glu Ssr Leu Arg Lys Ala «y Arg Ala Tyr Thr vfl Arg Arg 

70 Leu Pro Ala Phe Ser Gly Arg Phe Val Ala Glu lit Glu Gin Gly 

Asn Gly Glu Cyr Lys Leu Leu Lex, A^p G lu Glu A^n Arg Lsu II- Gl" 
390 395 J 

75 ^ U ?ij Fr ° Ala He Val Thr Ala ill 

405 410 41£ 
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list Ala lie Glu Thr Gly Met Thr Asp Arg Gin lis Glu Arg lie He 

420 425 430 

Phe Pro Kis Pro Thr Val Gly Glu He Leu Lys Glu Thr Leu Ala Gly 
43S 440 445 

Gly 



(2 1 IUFORllATIOtl FOR SEQ ID (10:406 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 941 amino acid.- 
IB) TYPE: amino acid 
(Dl TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

( " i ) ORI GIUAL SOURCE : 

tA) ORGANISM: Porphyromcnas- gxngxvalis 

(ix) FEATURE: 

(A) II Ai IE/KEY : miscfeature 
t B ) LOCATION 1 ... 941 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4Q6 



lleL 


Glu 


LCU 


Lye 


Arg 


Phe 


Leu 


Ser 


Leu 


G^- 


Leu 


Leu 


Leu 


Val 


Gly 


Phe 


1 








5 










iO 










15 




11? 


Pro 


list 


Lys 


Leu 


Ser 


Ala 


Gin 


Gin 


Ala 


Gill 


Pro 


Leu 


Pro 


Thr Asp 








20 










25 










30 






Pro 


Ala 


Val 


Arg 


Val 


Gly 


Lys 


Leu 


Asp 


Asn 


Gly 


Leu 


Thr 


Tyr 


Phe 


He 






35 










40 










45 








Arg 


His 


Asn 


Glu 


Asn 


Pro 


Lys 


Asp 


Arg 


Ala 


Asp 


Phe 


Phe 


He 


Ala 


Gin 




50 










55 










60 










Lys 


Val 


Gly 


Ser 


He 


Leu 


Glu 


Glu 


Asp 


Ser 


Gin 


Ser 


Gly 


Leu 


Ala. 


His 


65 










70 








75 








80 


Phe 


Leu 


Glu 


His 


Met 


Ala 


Phe 


Asn 


Gly 


Thr. 


Lys 


Arn 


Phe 


Pro 


Gly Lys 










85 










90 








95 




Asi\ 


L-u 


He 


Asn 


Tyr 


Leu 


Glu 


Thr 


He 


Gly 


Val 


Arg 


Phe 


Gly Gin Aen 








100 










105 










110 






Leu 


Asn 


Ala 


Ser 


Thr 


Gly 


Phe 


Asp 


Lys 


Thr 


Glu 




Thr 


He 


Met 


Asp 






lis 










120 








125 








Val 


Fro 


Thr 


Thr 


Arg 


Gin 


Gly 


He 


He 


Asp 


Ser 


Cys 


Leu 


Leu 


He 


Leu 




130 










135 








140 










Hi.- 


A~p 


Trp 


Ser 


Asn 


Am 


He 


Thr 


Leu 


Asp 


Gly 


His 


G]u 


He 


Asp 


Glu 


145 










150 










155 










160 


Glu 


Arg 


Gly 


Vr,l 


He 


Gin 


Glu 


Glu 


Trp 


Arg 


Ala 


Arg 


Arg 


Asp 


Ala 


Asn 










165 










170 










175 




Leu 


Arg 


Met 


Phe 


Glu 


Ala 


He 


Leu 


ALa 


Lys 


Ala 


Met 


Pro 


Gly Asn 


Lye 








ieo 










185 








190 






Tyr 


Ala 


Glxi 


Arg 


Het 


Pro 


He 


Gly 


Leu 


Ilet 


Asp 


Val 


Val 


Leu 


Asn 


Phe 






195 










200 










205 








Lys 


hijt 


Asp 


Glu 


Leu 


Arg 


Asn 


Tyr 


Tyr 


Lys 


Lys 


Trp 


Tyr 


Arg 


Pro Asp 




210 










215 










220 










Lsu 


Gin 


Gly 


Leu 


Val 


He 


Val 


Gly 


Asp 


He 


Asp 


Val 


Asp Tyr 


Val 


Glu 


225 










230 










235 










240 


Asn 


Lys 


Ila 


Lys 


Glu 


Lsu 


Phe 


Lys 


Asp 


Val 


Pro 


Ala 


Pro 


Vai 


Asn 


Pro 










245 










250 










255 




Ala 


Glu 


Arg 


He 


Tyr 


Thr 


Pro 


Val 


Glu 


Asp 


Asn 


Asp 


Glu 


Pro 


He 


Val 








260 










265 










270 






Ala 


He 


Ala 


Thr 


A*»p 


Ala 


Glu 


Ala 


Thr 


Thr 


Thr 


Gin 


Leu 


Ser 


He 


Ser 






275 










280 










2S5 








Ph» 


Ly~ 


Ser 


Asp 


Fro 


Thr 


Pro 


Gin 


Glu 


Vai 


Arg 


Gly 


Ser 


He 


Phe 


Gly 




290 










295 








300 










Leu 


Val 


Glu 


Asp 


Tyr 


Het 


Lys 


Gin 


Val 


He 


Thr 


Thr 


Ala 


Val 


Asn 


Gi j 


305 










31 0 










315 










320 


Arg 


I.sn 


Ser 


Glu 


He 


Thr 


His 


Lys 


Pro 


Asn 


Ala 


Pro 


Phe 


Leu 


Ser 


Ala 










325 










330 










335 




Gly 


Ala 


Phe 


Phe 


Ser 


Asn 


Phe 


Het 


Tyr 


He 


Thr 


Gin 


Thr Lys 


Asp 


Ala 








340 










345 










350 






Phe 


Asr. 


Phe 


Val 


Ala 


Thr 


Val 


Arg 


Glu 


Gly 


Glu 


Ala 


Glu 


Lys 


Ala 


Het 






355 










360 








365 








Asn 


Ala 


Leu 


Val 


Ala 


Glu 


He 


Glu 


Ser 


Ltu 


Arg 


Gin 


Phe 


Gly lie 


Thr 




37 0 










37 5 










3fi0 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



Lys 


Gly 


Glu 


Tyr 


Asp 


Arg 


Ala 


Arg 


Thr 


A^n 


Val 


Leu 


Lys 


Arg 


Tyr 


Glu 


365 










390 










3 95 










4 00 


Asn 


Gin 


Tyr 


Asn 


Glu 


Arg 


Asp 


Lys 


Arg 


Lys 


Asn 


Asn 


Ala 


Tyr 


Ala 


Asn 










405 










410 










415 




Glu 


Tyr 


Ser 


Thr Tyr 


Phe 


Thr 


Asp Gly Gly Tyr 


He 


Pro 


Gly 


He 


Glu 








420 










425 










430 






Val 


Glu 


Tyr 


Gin 


Thr 


Val 


Asn 


Ala 


Phe 


Ala 


Pro 


Gin 


Val 


Pro 


Leu 


Glu 






435 










440 










44b 








Ma 


Phe 


A?n 


Gin 


Ala 


He 


Ala 


Gin 


Met 


Ilr 


Asp 


Pro 


Val 


Lys 


Asn 


Ala 




450 










455 










460 










Val 


Val 


Thr 


Leu 


Thr 


Gly 


Pro 


Ser 


Lys 


Ala 


Glu 


Ala 


Lys 


He 


Pro 


Ser 


465 










470 










475 










480 


Glu 


Ala 


Asp 


Phe 


Leu 


Ala 


Aid 


Phe 


Lys 


Ala 


AT a 


Arg 


Gin 


Gin 


Lys 


Val 










485 










4 90 










4 95 




Glu 


Ala 


Lys 


Lys 


Asp 


Glu 


Val 


Ser 


Asp 


Gin 


Lys 


Leu 


Met 


Glu 


Lys 


Ala 








soo 










505 










510 






Pro 


Lys 


Ala 


Gly Lys 


He 


Val 


Ser 


Glu 


Lys 


Lye 


A^p 


Gin 


Lys 


Phe 


Gly 






515 










520 










525 








Thr 


Thr 


Glu 


Leu 


Thr 


Leu 


Ser 


Asn 


Gly 


He 


Lys 


Val 


Tyr 


Leu 


Lys 


Lys 




530 










535 










540 






Thr 


A£p 


Phe- 


Lys 


Ser 


Asn 


Glu 


He 


Leu 


»et 


S~r 


Ala 


Leu 


Ser 


Pro 


Gly 


5-15 










550 










555 










560 


Gly 


He 


Leu 


Ser 


Gly 


Ly^ 


His 


Ala 


Pro 


Asn 


Gin 


Ser 


Val 


Met 


Asn 


Ser 










565 










570 










575 




Ph- 


J let 


Asn 


Val 


Gly 


Gly 


Leu 


Gly A-?n 


Phe 


Asp 


Ala 


He 


Gin 


Leu 


Arp 








500 










585 










5 90 




Lye 


Val 


Leu 


Thr 


Gly Arg 


Ser 


Ala 


Ser 


Val 


Scr 


Pro 


S#»r 


Leu 


Ser 


Leu 






595 










SOO 










605 








I. en 


Ser 


Glu 


Gly 


Leu 


Set: 


Gly 


Lys 


Thr 


Thr 


Val 


Glu 


Asp 


Met 


Glu 


Thr 




610 










615 










620 










Phe 


Phe 


Gin 


Leu 


He 


Tyr 


Leu 


Gin 


Met 


Thr 


Aid 


Asn Arg 


Lys 


Asp 


Pro 


625 










630 










635 








640 


Glu 


Ala 


Phe 


Lys 


Ala 


The 


Gin 


Glu 


Lys 


Leu 


Tyr 


Asn 


Asn 


Leu 


Lys 


Asn 










64 5 










650 










655 




Gin 


Glu 


Aid 


Asn 


Pro 


Met 


Ala 


Ala 


Leu 


Met 


Asp 


Ser 


lie 


Arg 


His 


Thr 








660 










665 










670 






Met 


Tyr 


Gly 


Asp 


Asn 


Fro 


Met 


Met. 


Lys 


Pro 


H-t 


Lys 


Ala 


Ala 


Asp 


Val 






675 










680 










6BS 






Glu 


Lys 


Val 


Asn 


Tyr As-p 


Gin 


Val 


Met 


Ala 


Phe 


Tyr 


Asn 


Glu 


Arg 


Phe 




690 










695 










700 










Ala 


Asp 


Ala 


Gly Asp 


Phe 


Met 


Phe 


Phe 


Phe 


He 


Gly Asn 


Leu Asp 


Glu 


70S 










710 










715 










720 


Ala 


Lys 


net 


Lys 


Pro 


Leu 


He 


Glu 


Thr 


Tyr 


Leu 


Ala 


Ser 


leu 


Pro 


Asn 










725 










730 










735 




Leu 


Ly3 


Arg 


Gly Asp 


Lys Met 


Asn 


Lys 


Ala 


Gin 


Val 


Pro 


Ala 


Ala 


Arg 








740 










745 










7 50 






Ser 


Gly 


Lys 


He 


Asp 


Cys 


Lys 


Phe 


Glu 


Lys 


Glu 


Met 


Asp 


Thr 


Pro 


Ser 






755 










7 60 










765 








Thr 


Thr 


He 


Phe 


Asp 


Val 


Val 


Ser 


Gly Asn Val 


Glu 


Tyr 


Thr 


Leu 


Lys 














775 










780 






Asn 


Sei 


Leu 


Leu 


Leu 


Glu 


Val 


Phe 


Ser 


Ala 


Val 


M«t 


Asp 


Gin 


Val 


Tyr 


785 










790 










795 










800 


Thr 


Ala 


Thr 


Val 


Arg 


Glu 


Lys 


Glu 


Gly 


Gly Ala 


Tyr 


Ser 


Val 


Ala 


Ala 










805 










810 










815 




Phe 


Gly 


Gly 


Leu 


Glu 


Gin 


Tyr 


Pro 


Gin 


Pro 


Lys 


Ala 


Leu 


Met 


Gin 


He 








820 










825 










830 






Tyc 


Phe 


Pro 


Thr Asp 


Pro 


Ala 


Arg 


Ala 


Glu 


Glu 


Met 


Am 


Ala 


11* 


Val 






835 










840 










845 








Phe 


Ala 


GJu 


Leu 


Glu 


Lys 


Leu 


Ala 


Lys 


Glu 


Gly 


Pro 


Asn 


Val 


Glu 


Tyr 




850 










055 










860 








the 


Lys 


Lys 


Thr 


He 


Glu 


A«n 


Leu 


Asn 


Lys 


Gin 


His 


Lys Glu 


Sec 


Leu 


865 










870 










□ 75 










880 


Arg 


GLu 


Asn 


Arg, 


Phe 


Trp 


Leu 


Glu 


Ala 


n*t 


Lys 


Ala 


Ser 


Phe 


Phe 


Glu 










BBS 










890 










895 




Gly 


Asn 


Asp 


Phe 


He 


Thr 


A.^p 


Tyr 


Glu 


Ser 


Val 


Leu 


Asn Gly 


Leu 


Thr 








900 










905 










910 






Pro 


Aid 


Glu 


Leu 


Gin 


Lys 


Phe 


Ala 


Ala 


Asp 


T,~u 


Leu 


Lys 


Gin 


Gin 


Asn 






915 










920 








925 








Arg 


Val 


Val 


Val 




Met 


Ala 


Pro 


Val 


Ala 


Lys 


Ala" 


Gin 










930 










935 










94 0 











(2) IHFORltATlOW FOR SZQ ID HO: 4 07 



ti) SEQUENCE CHARACTERISTICS: 
75 (A) LEttGTH: 68 4 amino acids 
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{B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) IIOLECULE TYPC: protein 

fill) HYPOTHETICAL: YES 

(vlj ORIGINAL SOURCE: 

(A> ORGANISM: Porphyromonas ginyivdlis 

(ix) fEWURE: 

(a) NAME/ KEY: miscjeatuce 
(B) LOCATION 1...684 

|xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407 

Met Ser l,yr Lys Cly Thr lie Gly Val Thr Ser Arp flsn Tie Phe Pro 

15 10 IS 

Val lie Lys Lys Phe Lsu Tyr Ser Asp His Glu He Phe Leu Aty Glu 

20 25 30 

He Val Ser Asn Ala Val Asp Ala Thr Gin Ly*- Leu Lys Thr Leu Thr 

35 40 45 

Ser Val Gly Glu Phe Lys Gly Glu Thr Gly Asp Leu Arg Val Thr Vdl 

50 55 fin 

Ser Val Asp Glu Val Ala Arg Thr He Thr Val sir Asp Arg Gly Val 
65 70 75 eo 

Gly Wet Thr Glu Glu Glu Val Glu Lys Tyr lie Asn Gin II- Ala Ph- 
85 90 q S 

Scr Scr A1 * Glu Ph - Leu Glu Lys Tyr Lys Asp As P Lys Ala Ala 

100 io5 110 

He He Gly His Phe Gly Lou Gly Phe Tyr Ser Ala Phe Met Val Ser 

115 120 
Glu Arg Val Asp Val He Thr Arg Ser Phe Arg Glu Asp Ala Thr Ala 

13° 135 li0 

Val Lys Trp Ser Cys Asp Gly Ser Pro Glu Tyr Thr Leu Glu Pro Ala 



145 15 0 



155 i6o 



Asp Lys Ala Asp Arg Gly Thr Asp He Val Met His He Asp Glu Glu 

165 170 175 

Asn Ser Glu Phe Leu Lys Lys Glu Lys He Glu Gly Leu Leu Gly Lys 

100 185 190 

Tyr Cys Lys Phe Leu Thr Val Pro He H fi Phe Gly Lye Lys Gin Glu 

195 200 205 

Trp Lys Asp Gly Lys Met Gin Asp Thr Asp tilu Asp Asn Gin He Asn 

2'° 215 2^0 

Asp Thr His Pro Ala Trp Thr Lys Lys Pro Ala Asp Lou Lys Asp Glu 
r Z 230 235 240 

Asp Tyr Lys Glu Phe Tyr Arg Ser Leu Tyr Pro Ilet Ser Glu Glu Pro 

245 250 25S 

Leu Phe Trp He His Leu Asn Val Asp Tyr Pro Fhe Asn Leu Thr Gly 

260 265 270 

He Leu Tyr Phe Pro Lys He Lys Asn Asn Leu Asp Leu Gin Arg Asn 

27j 280 285 

Lya. He Gin Leu Tyr Cys Asn Gin Val Tyr Val Thr Asp Glu Val Gin 

290 2S5 300 
Gly He Val Pro Asp Phe Leu Thr Leu Leu His G] y Val He Asp Ser 
J 05 310 



315 320 



Pre Asp He Pro Leu Asn Val Ser Arq Ser Tyr Leu Gin S«r Asp Ala 

325 330 335 

Asn Val Lys Lys He Ser Ser His He Thr Lys Lys Val Ala Asp Arg 

340 345 
Leu Glu Glu He Phe Lys Asn Asp Arg Pro Thr Phe Glu Glu Lys Trp 

355 36C 365 

Asp ser Leu Lys Leu Pho Val Glu Tyr Gly Met Leu Thr Asp Glu Lyr. 

370 375 3oo * 

Phe Tyr Glu Arq Ala Ala Lys Phe Phe Leu Phe Thr Asp Met Asp Gly 
t 390 395 400 

His Lys Tyi Thr Phe Asp Glu Tyr Arg Thr Leu Val Glu Gly Val ^ln 

4( >5 410 415 

Thr Asp Lys As P Gly Gin Val Val Tyr Leu Tyr Ala"Thr Asp Lys His 

420 425 430 

Gly Gin Tyr Scr His Val Lys Arg Ala Ser Asp Lye Gly Tyr S~r Val 

435 440 445 

Ilet Leu Leu Asp Gly Gin Leu Asp Pro His He Val Ser Leu Leu Glu 

450 455 4 60 

Gin Lys Leu Glu Lys Thr Hit- Phe Val Arg Val Asp Ser Asp Thr lis 
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4 60 


A 


n 




lie 




Lys? 


u 


Glu 


Arg 




Glu 


"n 1 


Lys 


Leu 


Ser 


Arp 




















4 90 














Thr 


Glu 


Arci 


Al a 


Thr 


Leu 


Val 


Lys 


r »u 




Glu 


Al a 


Arg 


Leu 


Pro 


Arg 








SOO 










505 










510 






Asp 


Glu 


L j .= 


Lys 


Hi 3 


The 


Asn 


Val 


Aid 


Phe 


Glu 


Ser 


Leu 


Gly 


Ala 


Glu 






51S 










520 










525 








Gly 


Glu 


Ala 


He 


Leu 


He 


Thr 


Gin 


Ala 


Glu 


Phe 


Het 


Arg 


Arg 


Met 


Arg 




530 










535 










540 










Asp 


Ilet 


Al « 


ain 


Leu 


Gin 


Pro 


Gly 


Met 


Sor 


Phe 


Tyr 


Gly 


Glu 


Leu 


Pro 


5<5 










550 










555 










560 


Asp 


i"sr 


i'yr 


Aim 


Leu 


Val 


Leu 


Asn 


Thr 


Asp 


His 


Pro 


Leu 


He 


Asp 


Arg 










56b 










570 










575 




Val 


Leu 


Ser 


GlV 


Glu 


Lys 


Glu 


Ser 


Vsl 


Glu 


Pro 


Ser 


Leu 


Thr 


Glu 


Leu 








seo 










585 










590 






Arg 


Ala 


Lys 


lie 


Ala 


Glu 


Leu 


Lys 


Al3 


Glu 


Glu 


Ala 


Lys 


Leu 


Leu 


Asp 






5?S 










6G0 










605 








Glu 


Glu 


Lys 


Gly 


Lys 


Lys 


Pro 


Glu 


Glu 


Tie 


Pro 


Val 


Ala 


Thr 


Lys 


Glu 




610 










615 










620 










Ala 


Lys 


Glu 


Asn 


Asn 


Ala 


Val 


Glu 


Gin 


Ala 


Lys 


Thr 


Glu Gly 


Ser 


He 


625 










630 










635 










640 


Asn 


Arp 


Clti 


Leu 


Thr 


Lys 


Tyr 


Ala 


Gin 


Asp 


Asn 


Glu 


Leu 


He 


Gly 


Gin 










645 










650 










655 




Lsu 


lie 


Asp 


Leu 


At a 


Leu 


Leu 


Gly 


Set 


Gly 


Lou 


Leu 


Thr 


Gly 


Glu 


Ala 
















6fc5 








670 






Leu 


Ala 


Clu 


t'he 


He 


Aro 


Arg 


Sor 


Gin 


Arg 


Leu 


Leu 














67 5 










630 

















\2) INFORMATION FOR SEQ. ID NO: 4 OS 

Hi SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 amino acids 

(B) TYPE: amino acid 
tn) TOPOLOGi : linear 

(ii) MOLECULE TYPE : protein 

(Hi) HYPOTHETICAL: YES 

(VI) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

Us) FEATURE: 

(A) NAME /KEY: misc_f ea ture 
IB) LOCATION 1 ... 464 



Ui) SEQUENCE DESCRIPTION: SEQ ID HO: 4 08 



Met 


Glu 


Lys 


Leu 


He 


Asp 


He 


Leu 


Val 


Val 


Asp 


Asp 


Asp 


Val 


Ala 


Val 


1 








S 










10 










15 




Cys 


Aid 


Ala 


Leu 


Arg 


Leu 


Val 


Leu 


Lys 


Arg 


Ala 


Gly Tyr Asn 


Pro 


Val 








20 










25 










30 






He 


Ala 


Asn 


Ser 


Pro 


Asp 


Glu 


Ala 


Leu 


Ser 


lie 


Met 


Arg 


Asn 


Pro 


Asp 






35 










40 










45 








Gly 


Gly 


C*s 


Lys 


Pro 


Ala 


Val 


Tie 


Leu 


Met 


Asp 


Met 


Asn 


Phe 


Ser 


Leu 




50 










55 










60 










Ser 


Thr 


Set 


Gly 


Arg 


Glu 


Gly 


Leu 


Glu 


Leu 


Leu 


Glu 


Lys Met 


Gin 


He 


65 










70 










75 










90 


Phe 


Thr 


Ser 


Cys 


Pro 


Val 


He 


Leu 


Met 


Thr 


Ala 


Trp 


Ala 


Ser 


He 


Pro 










85 










90 










95 




Leu 


Ala 


Val 


Glu 


Gly 


Met 


Arg 


Leu 


Gly 


Ala 


Phe 


Asp 


Phe 


He 


Gly 


Lys 








100 










105 










110 






Pro 


Tip 


A^p 


Asn 


Asp 


Arg 


Leu 


Leu 


Arg 


Thr 


He 


Asp 


Thr 


Ala 


Leu 


His 






115 










120 








12S 








Leu 


Ala 


Ala 


Pro 


Ser 


Ala 


Val 


Ala 


Asn 


Pro 


Ser 


Glu 


Gin 


Ser 


Asp Arg 




130 










135 










140 










Arp 


Thr 


Ala 


Arg 


Gin 


Pro 


Lys 


Ala 


Thr 


Vai 


Gin 


Glu 


Asn 


Asp 


Pro 


Cyr 


145 










150 










155 










160 


Ala 




He 


He 


Gly 


Arg 


Ser 


Asp 


Ala 


He 


Cys 


hy^r 


'He 


Lys 


Glu 


Arg 










16b 










170 










] 75 




He 


Arg 


Arg 


lie 


Ala 


Pro 


Thr 


Hin 


Ala 


Ser 


Val 


Leu 


He 


Tiir 


Gly Glu 








180 










185 










150 






Ser 


Gly 


Thr 


Gly 


Lys 


Glu 


Leu 


He 


Ala 


Glu 


Ala 


Leu 


His 


Arg Gly 


Ser 






195 










200 










205 








Lys 


Arg 


Ala 


Ser 


Ala 


Pro 


Phe 


Val 




Val 


Asn 


L-n 


Gly 


Gly 


He 


Pr? 



SUBSTIT U T E SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



345/490 





210 










215 










220 










Gl U 


Ser 


Leu 


Phe 


Glu 


Sec 


Glu 


Leu 


Phe 


Gly 


Hii 


Lys 


Lyc 


Gly 


Ala 


Phe 


225 










230 










235 










240 


Thr 


Asn 


Ala 


Phe 


Ser 
245 


Asp 


Arg 


Lys 


Gly 


Arg 
250 


Pho 


Glu 


Leu 


Ala 


Asp 
255 


Gly 


Gly 


Thr 


He 


Phe 


Leu 


Asp 


Glu 


He 


Gly 


Glu 


Leu 


Pro 


Val 


Gly Asn 


Gln 
















265 










270 






Val 


Lys 


Leu 


Leu Arg 


Val 


Leu 


Gin 


Glu 


Gin 


Thr 


Phe 


Glu 


Pro 


Leu 


Gly 






27S 










200 










285 








Gl u 


Ser 
290 


Val 


Ser 


His 


Arg 


Val 

295 


APp 


He 


Arg 


Val 


Val 

300 


Ser 


Ala 


Thr 


Asn 


Ala 


Ser 


L*=u 


Glu 


Arg 


Met 


Val 


Ala 


Glu 


Gly 


Arg 


Phe 


Arg 


Glu 


At*p 


Leu 


305 










310 










315 










320 


Tyr 


Tyr 


Arg 


He 


Asn 
325 


Leu 


He 


His 


Leu 


His 
330 


Leu 


Pro 


Pro 


Leu 


Arg 
335 


Glu 


Arg 


Gin 


Glu 


A-p 


Tie 


Gin 


Leu 


Leu 


Val 


Glu 


Ala 


Phe 


Ser 


Glu 


Ala 


Phe 






34G 










345 










350 






Ala 


Gin 


i'ar 
355 


Asn 


Gly 


Leu 


Pro 


His 

360 


Ala 


Val 


Trp 


Ser 


Ala 
365 


Glu 


Ala 


Met 


Arg 


Arg 
37 0 


He 


Cys 


Ala 


Met 


Pro 

37S 


Leu 


Pro 


Gly 


Asn 


Val 
380 


Arg 


Glu 


Lsu 


Lys 


Asn 


Val 


Val 


Glu 


Arg 


Thr 


Leu 


Leu 


Leu 


Ser 


Gly 


Ser 


Arg 


Glu 


He 


Ser 


385 










390 










395 










4 00 


Ala 


Arg 


Asp 


Val 


Ala 
4X)5 


Asp 


Phe 


Gly 


S«r 


Gin 
410 


Val 


Thr 


Ale 


Ala 


Asp 
415 


tUs 


Ser 


Asp 


Glu 


Arg 
420 


Ala 


Lsu 


Thr 


Asp 


Met 
425 


Glu 


Glu 


Ala 


Ala 


He 
430 


Arg 


Glu 


Thr 


Leu 


Thr 
435 




Tyr 


Asn 


Gly 


Asn 
440 


Val 


Ser 


Arq 


Ala 


Ala 
445 


Arg 


Ala 


Lsu 




Leu 
450 


Ser 


Arg 


Ala 


Ala 


Leu 
455 


Tyr 


Arg 


Arg 


Met 


Glu 
460 


Lys 


Tyr 


Gly 


Leu 



(2) INFORMATION FOR SEQ ID HO: 4 09 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 250 amino acids 
[Q) TYPE: amino acid 
(0| TOPOLOGY: linear 

(li) MOLECULE TYPE; protein 

(iii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

{A) ORGAN I SIi: Forphyromonas gingivalis 

(ix) FEATURE: 

(A) IIAIIE/KEY: mirc_f eat ure 
(BJ LOCATION 1. . .250 



(xH SEQUENCE DESCRIPTION: SEQ ID NO: 409 



Het 


Leu 


Lys 


He 


I. ye 


Asn 


Leu 


His 


Ala 


Thr 


Val 


Gin 


Gly 


Lys 


Glu 


Tie 


1 








5 










10 








15 




Leu 


Lys 


Gly 


He 


Asn 


Leu 


Glu 


He 


Asn 


Ala 


Gly 


Glu 


He 


His 


Ala 


He 








20 










25 










30 






Het 


Gly 


Pro 


Asn 


Gly 


Ser 


Gly 


Lys 


Ser 


Thr 


Leu 


Ser 


Ser 


Val 


Leu 


Val 






35 










40 










45 








Gly 


His 


Pro 


Ser 


Phe 


Glu 


Val 


Thr 


Glu 


Gly 


Glu 


Val 


Thr 


Phe 


Asn 


Gly 




50 










55 










60 








He 


Asp 


Leu 


Leu 


Glu 


Leu 


Glu 


Pro 


Glu 


Glu 


Arg 


Ala 


His 


Leu 


Gly 


Leu 


65 










70 










75 








SO 


Ph* 


Leu 


Ser 


Phe 


Gin 


Tyr 


Pro 


Val 


Glu 


He 


Pro 


Gly Val 


Ser 


Met 


Val 










85 










90 










95 




Asn 


Phe 


list 


Arg 


Ala 


Ala 


Val 


Asn 


Glu 


His 


Axg 


Lys 


Ala 


He 


Gly Ala 








100 










105 










110 






Gl u 


Pro 


Val 


Ser 


Ala 


Ser 


Asp 


Phe 


Leu 


Lys 


Met 


Het 


Arg 


Glu 


Lys 


Arg 






315 










120 










125 






Ala 


He 


Val 


Glu 


Leu 


Asp 


Asn 


Lys 


Leu 


Ala 


Ser 


Arg' 


'Ser 


Val 


Asn 


Glu 




130 










135 










140 










Gly 


Phe 


Ser 


Gly 


Gly 


Glu 


Lys 


Lys 


Arg 


Asn 


Glu 


He 


Phe 


Gin 


Het 


Ale* 


145 










150 










155 










160 


Het 


Leu 


Glu 


Pro 


Lyr 


Leu 


Al« 


He 


Leu 


Asp 


Glu 


Thr 


Asp 


Ser 


Gly 


Leu 










3 65 










170 










175 




Asp 


He 


Asp 


Ala 


Lou 


Arg 


lie 


Val 


Ala 


Gly 


Gly 


Val 


Asn 


Arg 


Leu 


Arg 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



346/ 490 







180 








165 










190 






Ser 


Pro Glu 


Asn 


Ala 


Ala 


lie Val 


Ile 


Thr 


HI* 


Tyr 


Gin 


Arg 


Leu 


Leu 




195 








200 










205 








Glu 


Tyr He 


Lys 


Pro 


Asp 


Phe Val 


His 


Val 


Leu 


Tyr 


Lys 


Gly 


Arg 


r j c 




210 








215 








220 










Val 


Lys Ser 


Gly 


Gly 


Ala 


Glu Leu 


Ala 


Lpu 


Thr 


Leu 


Glu 


Glu 


Lys 


Gly 


225 








230 








235 










240 


Tyr 


Asp Trp 


Ile 


Lys 


Glu 


Glu Ile 


Gly 


Glu 




















24b 








250 














(2) 


III FORI (ATIOH 


FOR 


SEQ 


ID HO:410 

















ti> SEUUfcJJCE CHARACTERISTICS: 

(A) LEIJGTH: 4 61 amino acids 
(B\ TYPE: amino acid 
(D) TOPOLOGY: linear 



tli) nOLDCULE TYPE: protein 

liix) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

{ A) ORGANISM: Porphyromonae ginqivalis 

Uxl FEATURE; 

(A) HA! IE /KEY : raise feature 
tB) LOCATION' 1- . .461 



(xi) SEQUEUCE DESCRIPTION: SEQ ID NO: 410 



Met 


Al* 


Lys 


Glu 


J.ys 


Thr 


lie 


Tyr 


Val 


Cys 


Arg 


Ser 


cys 


Gly Thr 


Lys 


1 








5 










10 










15 




Tyr 


Ala 


Ly3 


Trp 


Gin 


Gly 


Asn 


Cys 


A?n 


Ala 


Cys 


Gly 


Glu 


Trp 


Ann 


Cys 








20 










25 










30 






lie 


Asp 


Glu 


Glu 


Lys 


Val 


Pro 


Ala 


Pro 


Ala 


Ser 


Gly 


Lys 


Hie 


Ala 


Ala 






35 










40 










45 








Lys 


Ser 


Phe 


Met 


Pro 


Arg 


Glu 


Gin 


Asp 


Asn 


Arg 


Pro 


Arg 


Leu 


Leu 


Gin 




50 










55 










60 










Asp 


Val 


Glu 


Ser 


Gly 


Asp 


Glu 


Glu 


Arg 


Ile 


Arg 


Leu 


Gly 


Asp 


Glu 


Glu 


65 










70 










75 










ao 


Phe 


Asp 


Arg 


Val 


Leu 


Gly 


Gly 


Gly 


lie 


Val 


Lys 


Gly Ala 


Phe 


Val 


Leu 










05 










90 










95 




Leu 


Gly 


Gly Glu 


Pro 


Gly 


Ile 


Gly 


Lys 


Ser 


Thr 


Leu 


Ile 


Leu 


Gin 


Thr 








100 










105 










110 






Val 


Leu 


Arg 


Leu 


Pro 


Gln 


Leu 


Arg 


Thr 


Leu 


Tyr 


Val 


Ser 


Gly 


Glu 


Glu 






115 










120 










125 








Ser 


All 


Arg 


Gin 


Leu 


Lys 


Met 


Arg 


Ala 


Glu 


Arg 


Leu 


Gly 


Gin 


Ala 


Met 




130 










135 










140 










Asn 


Gly 


Cys 


Tyr 


Vol 


Tyr 


Cy- 


Glu 


Thi 


Asn 


Ile 


Glu 


Arg 


lie 


Leu 


Ser 


145 










150 










155 










160 


Arg 


Ala 


Glu 


Glu 


LeU 


Thr 


Pro 


Asp 


Leu 


Leu 


Val 


lie 


Asp 


Ser 


Ile 


Gin 










165 








170 










175 




Thr 


Val 


Tyr 


Thr 


Glu 


Glu 


Met 


Glu 


Ser 


Ser 


Ala 


Gly 


Ser 


Val 


Gly 


Gin 








180 










105 










190 






lie 


Arg 


Glu 


C^'s 


Ala 


Ala 


Leu 


Leu 


Leu 


Lys 


Tyr 


Cys 


Lys 


Thr 


Thr 


Gly 






195 










200 










205 








Ile 


Pro 


Val 


lie 


Val 


Ile 


Gly 


His 


Ile 


Thr 


Lys 


Glu 


Gly 


Ser 


lie 


Ala 




210 










215 










220 










Gly 


Pro 


Lys 


Val 


Leu 


Glu 


His 


Tie 


Val 


Asp 


Thr 


Val 


Leu 


Leu 


Phe 


Asp 


225 










230 










235 










240 


Gly Asp 


Lye 


His 


His 


Leu 


Tyr 


Arg 


Ile 


Leu 


Arg 


Gly Gin 


Lys 


Asn 


Arg 










245 










250 










255 




Tyi 


Gly 


Ser 


Thr 


Ser 


Glu 


Lsu 


Gly 


Tie 


Tyr 


Glu 


Met 


Arg 


Gin 


Asp Gly 








260 










265 










270 






Leu 


Arg 


Gly Val 


Glu 


Asn 


Pro 


Ser 


Glu 


His 


Leu 


lie 


Thr 


Arg 


Asn 


Arg 






275 










280 










285 








Glu 


Asp 


Leu 


Ser 


Gly 


Ile 


Ala 


Tl = 


Ala 


Val 


Ala 


Het 


Glu 


Gly 


Ile 


Arg 




290 










295 










300 










Pro 


lie 


Leu 


11- 


Glu 


Ala 


Gin 


Ala 


Leu 


Val 


Ser 


Ser 


Ala 


lie 


Tyr 


Ala 


305 










310 










315 








320 


Asn 


Pro 


Gin -Arg 


Ser 


Ala 


Thr 


Gly 


Phe 


Asp 


Ile 


Arg 


Arg 


(let- 


Asn 


Met 










325 










330 










335 




Leu 


Leu 


Ala 


Val 


Leu 


Glu 


Lys 


Arg 


Ala 


Gly 


Phe 


Lys 


Leu 


He 


Gin 


Lys 








340 










345 










350 






Asp 


Val 


Phe 


Leu 


Am 


lie 


Ala 


Gly 


Gly 


Ile 


Lys 


lis 


Ala 


Asp 


Pro 


Ala 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



347/ 490 







355 










360 










365 








Thr 


Asp 
370 


Leu 


Ala 


Val 


no 


Ser 
375 


Ala 


Val 


Leu 


Ala 


Ser 
300 


Ser 


Leu 


Asp 


Tie 


Val 


lie 


Pro 


Pro 


Ala 


Val 


Cys 


Met 


Thr 


Gly 


Glu 


Val 


Gly Leu 


Ser 


Gly 


385 










390 










395 










400 


Glu 


11* 


Arg 


Pre 


Val 
405 


Ser 


Arg 


lie 


Glu 


Gln 
410 


Arg 


lie 


Thr 


Glu 


Ala 
415 


Arq 


Arg 


11* 


Gly 


Phe 
4?0 


Lys 


Glu 


lie 


Leu 


Val 
425 


Pro 


Ala 


Asp 


Asn 


Phe 
430 


Arg 


Gin 


Glu 


Asp 


Al~ 


Gly Arg 


Phe 


Gly 


lie 


Arg 


Leu 


Val 


Pro 


Val 


Arg 


Lys 


Val 






4J5 










440 










44b 








Glu 


Glu 
450 


Ala 


Phe 


Arg 


His 


Leu 
455 


Phe 


Ser 


Lys 


Gly 


Arq 
460 


Glu 









\2) INFORMATION FOR SEQ TD NO: 4 11 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 
ID) TOPOLOGY : linear 

til J MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

Ivi) ORIGINAL SOURCE : 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) IJAHF./KEY: misc feature 
tB) LOCATION 1...271 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 411 



Met 


Asn 


Ser 


Arg 


His 


Leu 


Thr 


He 


Thr 


He 


He 


Ala 


Gly Leu 


Scr 


Leu 


1 








5 










10 










15 




Phe 


Val 


Leu 


Thr 


Lsu 


Gly 


Gly 


Cys 


Ser 


Val 


Ala 


Gin 


Gin 


Asp 


Thr 


Gin 








20 










25 










30 






Trp 


Thr 


Leu 


Gly 


Gly 


Lys: 


Lou 


Phe 


Thr 


Ser 


Ala 


Trp 


lib 


Gin 


Arg 


Ser 






35 










40 










45 








Ala 


Glu 


Tyr 


Gin 


Ala 


Leu 


Cys 


He 


Gin 


Ala 


Tyr 


Asn 


He 


Ala 


Thr 


Glu 




50 










55 










60 










Arg 


Val 


Asp 


Ala 


Leu 


Pro 


Ala 


Glu 


Arg 


Lys 


Gin 


Gly 


Asp Arg 


Pro 


Tyr 


65 










70 










75 










eo 


Ala 


He 


Val 


Thr 


Asp 


He 


Arp 


Glu 


Thr 


He 


Leu 


Asp 


Asn 


Thr 


Pro 


Asn 










05 










90 










9b 




Ser 


Val 


Tyr 


Gin 


Ala 


Leu 


Arg 


Gly 


LVs 


Asp 


Tyr 


Asp 


Glu 


Glu 


Thr 


Trp 








100 










105 










110 






Gly 


Lys 


Trp 


Cys 


Ala 


Gin 


Ala 


Asp 


Ala 


Asp 


Thr 


Leu 


Ala 


Gly 


Ale 


Leu 






115 










120 










125 








Ser 


Phe 


Phe 


Leu 


His 


Ala 


Ala 


Asn 


Lye 


Gly 


He 


Glu 


Val 


Phe 


Tyr 


Val 




130 










135 










140 










Thr 


Asn 


Arg 


Arg 


Asp 


Asn 


Leu 


Arg 


Glu 


Ala 


Thr 


Leu 


Gin 


Asn 


Leu 


Gin 


145 










150 










155 










160 


Arg 


Tyr 


Gly 


Phe 


Pro 


Phe 


Ala 


Asp 


Glu 


Glu 


His 


Leu 


Leu 


Thr 


Thr 


His 










165 










17 0 










175 




Gly 


Pro 


Ser 


Asp 


Lys 


Glu 


Pro 


Arg 


Arg 


Leu 


Lys 


He 


Gin 


Glu 


Gin 


Tyr 








180 










185 










190 






Glu 


lie 


Val 


Leu 


Leu 


He 


Gly 


Asp 


Asn 


Leu 


Gly Asp 


Phe 


Mi** 


His 


Phe 






19b 










200 










205 








Phe 


Asn 


Thr 


Lys 


Glu 


Glu 


Ser 


Gly 


Arg 


Lys 


Gin 


Ala 


Leu 


Gly 


Leu 


Thr 




210 










215 










220 










Ala 


Gly 


Glu 


Phe 


Gly 


Arg 


His 


Phe 


He 


Met 


Leu 


Pro 


Asn 


Pro 


Asn 


Tyr 


225 










230 










235 










240 


Gly 


Ser 


Trp 


Glu 


Pro 


Ala 


Trp 


Tyr 


Gly 


Gly 


Lys 


Tyr 


Pro 


Pro 


Leu 


Pro 










245 










250 










255 




Glu 


Arg 


Asp 


Lys 


Ala 


Leu 


Lys 


Gin 


Leu 


Hie 


Ser 


01r\ 


Asn 


Ser 


Arg 










260 










26b 










270 




(2> 


INFORMATION 


FOR 


SEO 


ID NO: 412 

















ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



348/ 490 



PCT/AU98/01023 



MOLECULE TY PS: protein 

(iii) HYPOTHETICAL: YES 

(vil ORIGINAL SOURCE: 

(A> ORGANISM: Porphyromonas gingival is 

tixl FEATURE: 

{A) HA11E/KEY: misc_feature 
tBl LOCATIOM 1. . .417 



(xi) SF.QUE1ICE DESCRIPTION: SEQ ID NO: 412 



llet 


Ser 


Thr 


Asn 


lie 


Asp 


Val 


Gin 


Gin 


He 


Lys 


Gin 


Arg 


Phe 


Gly 


He 


1 








S 










io 








15 




lie 


Gly 


Soc 


Ser 


Pro 


Leu 


Het 


olU 


His 


Ala 


Tl« 


Arg 


Val 


Ala 


Ala 


Gin 








20 










2b 










30 






Val 


Ala 


Pro 


Thr 


Asp 


Met 


Ser 


Val 


Leu 


Val 


Thr 


Gly 


Glu 


Sor 


Gly 


Ser 






35 










40 










45 








Gly 


Lys 


Glu 


Phe 


Ph- 


Pro 


Gin 


lie 


11^ 


His 


Tyr 


Tyr 


Ser 


Ala 


Arg 


Lye 




50 










55 










60 










His 


His 


Ser 


Tyr 


lie 


Ala 


Val 


Asn 


Cys 


Gly 


Ala 


He 


Pro 


Glu 


Gly 


Thr 


65 










7Q 










75 










80 


lie 


Asp 


Ser 


Glu 


Leu 


Phe 


Gly 


His 


Arg 


Lys 


Gly 


Ser 


Phe 


Thr 


Gly 


Ala 










85 










90 










95 




Val 


Ssr 


Asp 


Ar:j 


Lys 


Gly 


Tyr 


Phe 


Glu 


Clu 


Ala 


Ser 


Gly Gly 


Thr 


Hp 








100 










105 










110 






Phe 


Leu 


Asp 


Glu 


Val 


Gly 


GlU 


Leu 


Pro 


Leu 


Pro 


Thr 


Gin 


Ala 


Arg 


Leu 






115 








120 










125 








Lou 


Arg 


Val 


Leu 


Glu 


Thr 


Gly 


Glu 


Phe 


He 


Pro 


Val 


Gly Ala 


Ser 


Gin 




130 










135 










140 










Ser 


Gin 


Lys 


Thr 


Asp 


Val 


Arg 


He 


Val 


Ala 


Ala 


Thr 


Asn 


Val 


Asn 


Leu 


145 










150 










155 










160 


Ly.«? 


Glu 


Ala 


Val 


Aid 


Asn 


Gly 


Lys 


Phe 


Arg 


Glu 


Asp 


Leu 


Phe 


The 


Arg 










16b 










no 










175 




Leu 


Asn 


Thr 


Val 


Pro 


lie 


Glu 


Val 


Pro 


Ala 


T.fU 


Arg 


Met 


Arg 


Pro 


Asp 








180 










105 










190 






Asp 


Val 


Pro 


Leu 


Leu 


Phe 


Arg 


Arg 


Phe 


Ala 


Ala 


Asp 


Ser 


Ala 


Glu 


Lys 






195 










200 










205 








Tyr 


Arg 


Met 


Pro 


Pro 


Leu 


Arg 


Leu 


Ser 


Asp 


Glu 


Ala 


Arg 


Thr 


He 


Leu 




210 










215 










220 










Met 


Arcf 


Tyr 


Arg 


Trp 


Pro 


Gly 


Asn 


Val 


Arg 


Glu 


Leu 


Arg 


Asn 


He 


Thr 


225 










230 










235 










240 


Asp 


Arg 


Leu 


Ser 


lie 


Leu 


Glu 


Glu 


Glu 


Arg 


Thr 


Val 


Ser 


Ala 


Glu 


Thr 










245 










250 










255 




lit* 


Thr 


Arg 


Tyr 


Leu 


Asp 


Ala 


Glu 


Gly 


Met 


Gin 


Asp 


Leu 


His 


Pro 


Val 








260 










265 










270 






Vol 


He 


Arg 


Arg 


Asn 


Glu 


Thr 


Thr 


Glu 


Ala 


Asp 


Lys 


Gin 


lie 


Pro 


His 






275 










280 










285 








Tyr 


Glu 


Arg 


Glu 


He 


Tie 


Tyr 


Gin 


Val 


Leu 


Tyr 


Ar,p 


Met 


Lys 


Lys 


Glu 




290 










295 










300 










He 


Ala 


Asp 


Leu 


Lys 


Gly 


Met 


Met 


Asn 


Arg 


Leu 


Ala 


His 


His 


Glu 


Gin 


305 










310 










315 










320 


Pro 


Ser 


Trp 


Pro 


V<il 


Gly 


Sec 


Asp 


Val 


Trp 


Gly 


Asn 


Asp Asp 


Lys 


Arg 










325 










330 










335 




The 


Ala 


Asp 


Pro 


Lys 


Trp 


Gly 


Val 


Ser 


Thr 


HXs 


Lys 


Ala 


Pro 


He 


Ala 








340 










345 










350 






Asn 


Ala 


Ala 


Glu 


Pro 


Vnl 


Glu 


Pro 


He 


Gin 


Glu 


Ala 


Ssr 


Clu 


Tyr 


Thr 






355 










360 










365 








Glu 


Asp 


Pro 


Val 


Ser 


Leu 


Glu 


Glu 


Val 


Glu 


Lys 


Lys 


Met 


He 


Ser 


Leu 




370 










37 5 










380 










Ala 


Leu 


Glu 


Arg 


Hie 


Gly 


G1 y 


ftrg 


Arg 


Lys 


Gin 


Thr 


Ala 


Glu 


Glu 


Leu 


385 










390 








395 










4 00 


Lys 


lie 


Ser 


GlU 


Arg 


Thr 


L«u 


Tyr 


Arg 


Lys 


He 


Lys 


Glu 


Tyr 


Gly 


Leu 



405 410 415 



Glu 



(2) IUFORI'tATlON FOR SEQ ID MO: 413 

fi) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 602 amino acids 
(B) TYPE: amino acid 
(DJ TOPOLOGY i linear 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



349/490 



til) IIOLECliLE TYPE: protein 

(iii) HYPOTHETICAL; YES 

(vi) ORIGINAL SOURCE; 

(A) ORGANISM: Potphyromonas gingiva lis 

( ix ) FEATURE : 

(A) NAME/KEY : miscf eature 
{B\ LOCATION 1 . . .602 



iXi) SEQUENCE DESCRIPTION: SF.Q TD HO: 413 



Met 


lie 


Leu 


Leu 


Phe 


Gly 


Gly 


Thr 


Thr 


Glu 


Gly 


Ary 


Ala 


Ala 


Ala 


Ara 


1 








5 










10 










15 




Val 


Leu 


Asp 


GlU 


Ala 


Gly 


Ser 


Pro 


Phe 


Phe 


Tyr 


Ser 


Thr 


Lys 


Gly 


Asn 








20 










25 










30 




Leu 


Gin 


Glu 


He 


Gin 


Ser 


Ser 


His 


Gly 


His 


Ai g 


Leu 


Thr 


Gly 


Ala 


Met 






35 










40 










45 






Thr 


Val 


Ala 


Asp 


Met 


Vol 


Ser 


Ph~ 


Cya 


Ara 


Lys 


Glu 


Glu 


He 


Arg 


Leu 




50 










55 










60 








lie 


Val 


A^p 


Ala 


Ala 


Hir 


Pro 


Phe 


Ala 


Glu 


Glu 


Leu 


Hif 


Ala 


Ser 


Val 


65 










70 










75 










80 


Ala 


Glu 


Ala 


Ser 


Glu 


Gin 


Thr 


Gly 


Il~ 


Pro 


Val 


Val 


Arg 


Tyr 


Glu 


Aty 










85 










90 










95 


Gin 


Tyc 


Pro 


Pro 


Arg 


Glu 


G1X1 


Gly 


lie 


Val 


Trp 


Cys 


Ala 


Asn 


Tyr 


Asp 








100 










105 










110 




Thr 


Ala 


Ala 


Glu 


Arq 


Met 


Leu 


Gly 


Acp 


Gly 


Val 


Gin 


Arq 


Leu 


Leu 


HeL 






115 










120 








125 








Leu 


Thr 


Gly 


Val 


Asn 


Thr 


He 


Pro 


Lys 


Leu 


Ala 


Ala 


Phe 


Trp 


Lys 


Glu 




3 30 










135 










140 






Azg 


Thr 


Thr 


Phe 


Cys 


Arg 


He 


Leu 


Lys 


Arg 


Asp 


Glu 


Ser 


Val 


Ala 


Leu 


145 










150 










155 










160 


Ala 


Glu 


Lye 


Asn 


Gl y 


Phe 


Pro 


Ala 


Glu 


Arg 


He 


Val 


Phe 


Phe 


Glu 


Pro 










165 










170 










175 




His 


Ala 


A5-p 


Glu 


Glu 


Leu 


Met 


Gin 


Ala 


Val 


Arg 


Pro 


Asp 


Ala 


He 


He 








180 










185 








190 






Uhr 


Lys 


Glu 


Ser 


Gly 


Glu 


ser 


Gly 


Tyr 


Phe 


Arg 


Glu 


Lys 


He 


Glu 


Ala 






195 










200 










205 








Ala 


Arg 


Arg 


Met 


Gly 


He 


Arg 


Tie 


Tyr 


Ala 


Val 


Val 


Arg 


Pro 


Pro 


Leu 




210 










215 










220 










Pro 


Pro 


ser 


Phe 


He 


Pro 


Val 


Gly 


Gly 


Pro 


Val 


Gly 


Leu 


Arg 


Arg 


Ala 


225 










230 










235 








240 


Val 


Glu 


Arg 


Leu 


Val 


Pro 


Gly 


Phe 


Phe 


Ser 


Leu 


Arg 


Ser 


Gly 


Phe 


Thr 










245 










250 








255 




Thr 


Gly 


Thr 


Thr 


Ala 


Thr 


Ala 


Ala 


Val 


Vai 


Ala 


Ala 


Met 


Tyr 


Arg 


Leu 








260 










265 










270 




Ilet 


Gri- 


Leu 


Gly 


Ser 


Leu 


Ala 


Gl ti 


Ala 


Pro 


Val 


Glu 


Leu 


Pro 


Ser 


Gly 






275 










260 










285 








lilu 


lle 


Val 


Ser 


Leu 


Pro 


He 


Ala 


Glu 


He 


Arg 


Gl u 


Glu 


Glu 


Asp 


Ala 




2 90 










295 








300 








Val 


Val 


Ser 


Ala 


Val 


Leu 


Lys 


Asp 


Ala 


Gly 


Asp 


Asp 


Pro 


Asp 


Val 


Thr 


305 










310 










315 








320 


Asn 


Cli- 


HeL 


Ala 


Val 


Cys 


Ala 


Thr 


He 


Arg 


Leu 


Asn 


Pro 


Glu 


His 


Glu 










325 










330 










335 




Glu 


val 


Arg 


Phe 


Leu 


Gin 


Gly 


Glu 


Gly 


Val 


Gly 


Val 


Val 


Thr 


Leu 


Pro 








340 










345 










350 






Gly 


Leu 


Gly 


T.«-u 


Glu 


Val 


Gly 


Gly 


Pro 


Ala 


He 


Asn 


Leu 


Val 


Pro 


Arg 






355 










360 










365 






Arg 


Het 


Met 


Thr 


Ala 


Glu 


Val 


Arg 


Arg 


Leu 


Tyr 


Ala 


Gin 


Gly 


Gly 


Val 




370 










375 










380 






Asp 


lie 


Thr 


I] ft 


Ser 


Val 


Pro 


Glu 


Gly 


Arg 


Glu 


Ala 


Ala 


Thr 


Gin 


Thr 


305 










390 








395 










4 00 


Phe 


Asn 


Pro 


Arg 


L~u 


Gly 


He 


Arg 


Asp 


Gly 


He 


Ser 


He 


He 


Gly 


Thr 










4 05 










410 










415 




Ser 


Gly 


Val 


Val 


Lys 


Pro 


Phe 


Ser 


Ala 


Glu 


Ala 


Phe 


Val 


Gly 


Ala 


He 








420 










425 










4 30 






Arg 


Lys 


Gin 


Val 


Gly 


He 


Ala 


Thr 


Ala 


Leu 


Gly 


Ala., 


Ann 


His 


He 


Val 






435 










440 










445 








Leu 


Arn 


Ser 


Gly 


Ala 


Lys 


Ser 


Glu 


Arg 


Tyr 


Val 


Lys 


Gly 


Ala 


Tyr 


Pro 




450 










455 










460 








Ala 


Leu 


He 


Pro 


Gin 


Ala 


Phe 


Val 


Gin 


Tyr 


Gly 


Asn 


Phe 


Val 


Gly 


Glu 


4 65 










470 








475 








480 


Ser 


Leu 


Ser 


Cys 


Val 


Ala 


Ser 


Phe 


Pro 


Ser 


Val 


Arg 


Ser 


Val 


Thr 


Val 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



350/490 



10 



15 



25 



30 



45 



50 



55 



60 



65 



70 



75 











485 










490 










4 95 




Gly 


lie 


Mat 


Leu 
500 


Gly 


Lys 


Ala 


Val 


Lys 
505 


Leu 


Ala 


Glu 


Gly 


Tyr 
510 


Leu 


Asp 


Thr 


His 


Ser 


Lys 


Lys 


Val 


Val 


Met_ 


Asn 


Arg 


Arp 


Phe 


Leu 


Kis 


Glu 


Leu 






515 










520 








525 








Al a 


Arg 
530 


Gin 


Ala 


Gly 


Cys 


Ser 
535 


Glu 


Asp 


He 


His 


Ala 
540 


Tie 


He 


Asp 


Ser 


Leu 


Asn 


L^u 


Ala 


Arg 


Glu 


Leu 


Trp 


Thr 




Pro 


Scr 


Ala 


Glu 


Asp 


Seu 


545 










550 




















5P0 


Arp 


Arg 


Leu 


Leu 


Arg 
565 


Lys 


He 


Ala 


Glu 


Arg 
570 


Ser 


Trp 


Glu 


Thr 


Cys 
57 5 


Arg 


Tro 


Ser 


Val 


Pro 
SBO 


Ser 


Ala 


Glu 


Leu 


Glu 
SOS 


Leu 


L-u 


Leu 


He 


Asp 
590 


GlU 




Gly 


Ala 


Tie 
595 


Acg 


Phe 


Arg 


Ila 


Gly 
600 


Gly 


Glu 















(2) INFORMATION FOR SF.Q ID NO: 414 



[1} SEQUENCE CHARACTERISTICS: 
20 {AJ LENGTH: 443 amino acide 

(B) TYPE: amino acid 
ID} TOPOLOGY: linear 



(iij MOLECULE TYPE: prctein 

(111] HYPOTHETICAL: YES 

(vil ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingrvalis 



FF.ATURE: 
tAJ NAME/KCY: nusc feature 
(BJ LOCATION 1. . .443 

35 {XI) SEQUENCE DESCRIPTION SEQ ID tiO:414 



40 



Met 


Leu 


Arg 


Thr 


Phe 


Arg 


He 


Gly 


Gly 


He 


Hie 


Pro 


Pro 


GlU 


Asn 


Lys 


1 








5 










10 










IS 




Leu 


Ser 


Ala 


Gly 


Lys 


Pro 


Val 


Glu 


Val 


Leu 


Pro 


He 


Pro 


S-r 


Gin 


Val 








20 










25 










30 






Val 


He 


Pro 


Leu 


Gly 


Gin 


His 


He 


Gly 


Ala 


Pro 


Ala 


Thr 


Ala 


Thr 


Val 






35 










40 










45 








Lys 


Lys 


GLy 


Asp 


Glu 


Val 


Lys 


Val 


Gly 


Thr 


He 


He 


Ala 


Gin 


Ala 


Gly 




50 










55 










60 








Gly 


Phe 


Val 


Ser 


Ala 


Asn 


He 


His 


Ser 


Ser 


Val 


Ser 


Gly Lys 


Val 


Leu 


65 










70 










/b 










80 




Tie 


Asp 


Asn 


Val 


Tyr 


Asp 


Ser 


Ser 


Gly 


Tyr 


Pro Lys 


Pro 


Ala 


Val 










65 










90 










95 




Phe 


He 


Ser 


V«I 


Glu 


Gly 


Asp 


Glu 


Trp 


Glu 


Glu 


Gly 


He 


Asp 


Arg 


Ser 








100 










105 










110 






Pro 


Ala 


He 


Val 


Lys 


Glu 


Cys 


Asn 


Leu 


Asp 


Ala 


Lys 


Glu 


He 


Val 


Ala 






115 










120 










12S 








Lys 


He 


Ser 


Ala 


Ala 


Gly 


He 


Val 


Gly 


Leu 


Gly 


Gly 


Ala 


Thr 


Phe 


Pro 




130 










135 








140 










Thr 


Hi? 


Val 


Lys 


Leu 


Ser 


Pro 


Pro 


Pro 


Gly 


Asn 


Lys 


Ala 


Glu 


He 


Lsu 


145 










150 








155 








160 


He 


He 


Asn 


Ala 


Val 


Glu 


Cys 


Glu 


Pro 


Tyr 


Lsu 


Thr 


Ser 


Asp 


His 


Val 










165 










170 










175 




Leu 


net 


I.-ui 


GlU 


His 


Gly 


Glu 


Glu 


He 


Ilet 


He 


Gly Val 


Ser 


He 


Leu 








160 










185 










190 






Met 


Lyr 


Ala 


He 


Gl n 


Val 


Asn 


Lys 


Ala 


Val 


He 


Gly Val 


Glu 


Asn 


Asn 






195 










200 










205 








Lys 


Lys 


Asp 


Ala 


He 


Ala 


His 


Leu 


Thr 


Lys 


Leu 


Ala 


Thr 


Ala 


Tyr 


Pro 




210 










215 










220 








Gly 


He 


Glu 


Val 


Met 


Pro 


Leu 


Lys 


Val 


Gin 


Tyr 


Pro 


Gin 


Clo- 


Gly 


Glu 












230 










235 








240 


L/s 


Gin 


Leu 


He 


Arp 


Ala 


Val 


He 


Arg 


Lys 


Gin 


Val 


Lys 


ser 


Gly 


Ala 










245 










250 










255 




Leu 


Pro 


Tl- 


Ser 


Thr 


Gly 


Ala 


Val 


Val 


Gin 


Asn 


Val, 


..Gly 


Thr 


Val 


Phe 








260 










265 










270 






Ala 


Val 


Tyr 


Glu 


Ala 


Val 


Gin 


Lys 


Asn 


Lys 


Pro 


Leu 


Val 


Glu 


Arg 


He 






275 










280 










285 






Val 


Thr 


Val 


Thr 


Gly 


Lys 


Lys 


Leu 


Ser 


Arg 


Pro 


Ser 


Asn 


Leu 


Leu 


Val 




290 










295 










300 










Arg 


He 


Gly 


Thr 


Pro 


lie 


Ala 


Ala 


Lou 


He 


Glu 


Ala 


Ala 


Gly 


Gly 


Leu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/O10Z3 



351/490 





305 










310 










315 








320 




Pro 


Glu 


Asn 


Thr 


Gly 
32b 


Lys 


lie 


He 


Gly 


Gly 
330 


Gly 


fro 


Met 


Met 


Gly Arg 
335 




Ala 


Leu 


Leu 


Ser 


Pro 


Asp 


Val 


Pro 


Val 


Thr 


Lys 


Gly 


Ser 


Ser 


Gly Val 


5 








340 










345 










350 






Leu 


lie 


Leu 


Asp Arg 


Glu 


Glu 


Ala 


Val 


Arg 


Lys 


Pro 


Met 


Arg 


Asp cys 








355 










360 










365 








1]^ 


Arg 


Cys 


Ala 


Lys 


cys 


Val 


Gly 


Val 


Cys 


Pro 


Met 


Gly 


Leu 


Asn Pro 


10 




370 










375 










380 








Ala 

385 


Phe 


Leu 


Met 


Arg 


Asp 
390 


Thr 


Leu 


Tyr 


Lys 


Ser 
395 


Trp 


Glu 


Thr 


Ala Glu 
400 




Lys 


Gly Asn 


Val 


Vai 


Asp 


Cys 


He 


Glu 


Cys 


Gly 


Ser 


Cys 


Ser 


Phe Thr 












405 










410 










415 


15 


Cys 


Pre 


Ala 


Asn 


Arg 


Pro 


Leu 


Leu 


Asp 


Tyr 


lie 


Arq 


Gin 


Ala 


Lys Lys 








420 










425 










430 






Thr 


Val 


Met 
435 


Gly 


He 


Gin 


Arg 


Ala 
440 


Arg 


Lys 


Gin 











20 



25 



50 



55 



60 



65 



70 



75 



12) INFORMATION FOR S£Q ID HO; 415 

[i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 9 amino acids 
(D) TYPE: amino acid 
{D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 



30 <vi) ORIGINAL SOURCE : 

(A) ORGANISM: Porphyromonas gingival is 

Ux> FEATURE: 

IA) NAME/KEY: mlscjeatute 
35 (B} LOCATION 1...479 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 4 15 

40 



45 



Met 


Lys 


Arg 


He 


Gin 


Leu 


Thr 


Leu 


He 


Ala 


Leu 


Phe 


Ala 


Ala 


Val 


Ala 


1 








5 










10 










15 




Gly 


Leu 


Val 


Ala 


Gin 


Asn 


Ala 


Tyr 


Glu 


Gly 


Val 


He 


Ser Tyr Lys 


He 








20 










25 










30 






Ser 


Leu 


Asp 


Lys 


Thr 


Gly 


Asn 


Lys 


Val 


Val 


Leu 


Asn 


Gly Ala 


Ala 


Asp 






35 










40 










45 








Met 


Ser 


Asn 


Leu 


Lys 


Leu 


Lys 


Ser 


Thr 


Gin 


Met 


Ho 


He 


Val 


Thr 


Pro 




50 










55 










60 










He 


Leu 


Arg 


Ser 


Glu 


Asp 


Gly 


Thr 


Ser 


Arg 


Val 


Glu 


Phe 


Pro 


Ser 


Vol 


65 










7 0 










75 










80 


Val 


He 


Thr 


Gly 


Arq 


Asn 


Arg 


Thr 




Ala 


Leu 


Lys 


Arg 


Glu 


He 


Ala 










85 










90 










95 




Ph~ 


Ser 


Ser 


Ala 


Leu 


Pro 


Gin 


Ala 


Lys 


His 


Ala 


Ala 


Gin 


Tyr 


He 


Arg 








100 










105 










110 




Axg 


Hit 


Asn 


Gly 


T.yj? 


Ser 


Glu 


Gin 


Phe 


Ala 


Phe 


Thr 


Gly Glu 


His 


Ala 






115 










120 










125 








Tyr 


Ala 


Ser 


Trp 


Met 


Met 


Asp 


Ala 


Lys 


Phe 


Val 


Val 


Arg Glu 


Glu 


Val 




130 










135 










140 










Arg 


Gly 


Cys 


Ala 


Lys 


Cys 


Pro 


Val 


Gly 


Leu 


Ser 


Ser 


Asn 


He 


Val 


Pro 


145 










150 








155 










160 


Phe 


Asp 


Pro 


Leu 


Phe 


Asn 


Pro 


Ala 


Glu 


Ala 


Pro 


Tyr 


Leu 


Leu 


Ala 


His 










165 










170 








17 5 




He 


Thr 


Pro 


Ala 


Glu 


Glu 


Val 


Glu 


Lys 


Gin 


Arg 


Glu 


Ser 


Ser 


Phe 


Asp 








180 










185 










1 90 






Ala 


Tyr 


He 


Asn 


Phe 


Lys 


Val 


Asn 


Lys 


Ala 


Asp 


Val 


Leu 


Pro 


Glu 


Tyr 






195 










200 










205 








Arg 


A*n 


Asn 


Lys 


Ala 


Glu 


Leu 


Glu 


Lys 


He 


Lys 


Glu 


Phe 


Val 


Ser 


Thr 




210 










215 








220 










Val 


Lys 


Ala 


Asn 


Pro 


Asn 


Tyr 


Ser 


Val 


Asn 


Lys 


Met 


He 


He 


Glu 


Gly 


225 










230 










235 










240 


Phe 


Ala 


Ser 


Pro 


Glu 


Ala 


Ser 


He 


Ala 


His 


Asn 


L/s 


Ala 


Leu 


Ser 


Glu 










245 










250 










255 




Axg 


Arg 


Ala 


I.y* 


Arg 


Leu 


Ala 


Glu 


Glu 


Leu 


Val 


Arg 


Lys 


Tyr 


Gly 


Lys 








260 










26b 










270 






Thr 


Leu 


Pro 


Asn 


He 


Thr 


Thr 


Glu 


Phe 


Gly 


Gly 


Glu 


Asp 


Trp 


Lys 


G3y 






275 










280 










285 








Leu 


Lyt 


Leu 


Ala 


lie 


Glu 


Lys 


Ser 


Asp 


He 


Ala 


Asp 


Arg 


Asp 


Arg 


Val 
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352/490 



35 



50 



55 



60 



65 



70 



75 





290 










2 95 










300 










Leu 


Glu 


He 


T ] e 


Asn 


Ser 


Asp 


Lys 


Tyr 


Ala 


A.«?p 


Asp 


Asp 


Ala 


Arg 


Glu 


305 










310 










315 










320 


Gin 


Ala 


Uu 


Lys 


Gin 


Leu 


Ser 


Ser 


Tyr 


Arg 


Tyr 


11* 


Leu Asp CJln 


He 










325 










330 










335 




Tyr 


Pro 


Asn 


Leu 


Arg 


Arg 


Arn 


Thr 


He 


Thr 


Met 


Gly 


Tyr 


He 


Val 


Axg 








34 0 










345 










350 






Asp 


Tyr 


Thr 


Lsu 


Glu 


GlU 


Ala 


Arg 


Glu 


He 


He 


Lys 


Thr 


Ala 


Pro 


Lys 






355 










360 










365 








Glu 


Leu 


Ser 


Gl u 


Ala 


Glu 


Itet 


Tyr 


Arg 


Val 


Ala 


J let 


Ser 


Tyr 


Pro 


Glu 




370 










375 










380 










Gly 


Ills 


Gin 


Glu 


Arg 


Leu 


Phe 


Ala 


Leu 


Asn 


Thr 


Thr 


Leu 


Lys 


Tyr 


Phe 


385 










390 










395 










4 00 


Pro 


Glu 


Ser 


Val 


Thr 


Gly Arg 


lie 


Asn 


Leu 


Ala 


Val 


Ala 


Ala 


Phe 


Asn 










4 05 










410 










415 




Gly 


Gly 


Asp 


Val 


tiln 


Gin 


Ala 


He 


Ala 


Leu 


Leu 


Ser 


Pro 


He 


Gin 


Thr 








420 










4 25 










430 






Glu 


L> s 


Gly 


Val 


Ser 


A^n 


He 


Leu 


Gly 


Ala 


Ala 


Tyr 


Ala 


Arq 


Thr 


Gly 






435 










440 










445 








Asp 


Phe 


Ala 


Arg 


Ala 


Glu 


Thr 


Phe 


Phe 


Arg 


Lys 


Ala 


Val 


Ala 


Glu 


Gly 




450 










455 










460 










Asp Ala 


Asn 


Ala 


G) n 


Arg 


Asn 


Leu 


Asp 


Met 


Leu 


Leu 


Gly Lys 


Lys 




465 










4"?0 










475 













10 



15 



20 



25 (2) I1IFORI 1ATION FOR SEQ ID NO: 41 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 amino acids 

(B) TYPE: amino acid 
30 (P) TOPOLOGY: linear 

tii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 



[Vil ORIGINAL SOURCE: 

(A) ORGANISM; Porphyromonas gingival is 



FEATURE: 

40 (A) HAI IE/KEY : miscjeatum 
f B J LOCATION 1 383 



45 





lx±) SEQUENCE DESCRIPTION: SEQ ID NO: 416 








Met 


Ala Glu 


Lys 


Arg 


Asp 


Tyr 


Tyr 


Glu 


Val 


Leu 


Gly 


val Ser 


Lys 


Asn 


1 






5 










10 








15 




Ala 


Thr Asp 


Aep 


Glu 


T.eu 


Lys 


Lys 


Ala 


Tyr 


Arg 


Lys 


Lys Ala 


He 


Gin 






20 










25 








30 






Tyr 


His Pro 


Asp 


Lys 


Asn 


Pro 


Gly 


A.<rp 


Lys 


Glu 


Ala 


Giu Glu 


His 


Phe 




35 










40 










45 






Lys 


Glu Val 


Ala 


Glu 


Ala 


Tyr 


Asp 


Val 


Leu 


Ser 


Asp 


Pro Gin 


Lys Arg 




50 








55 










60 








Ser 


Gin Tyr 


Asp 


Gin 


Phe 


Gly 


His 


Ala 


Gly 


Leu 


Gly 


Gly Ala 


Ala Gly 


65 








70 










7b 








80 


Gly Gly Phe 


Ser 


Gly 


Gly 


Gly 


Met 


Ser 


Met 


Glu 


Asp 


He Phe 


Ser 


Arg 








85 










90 








95 


Phe 


Gly Asp 


Leu 


Phe 


Gly 


Gly 


Phe 


Gly 


Gly 


Phe 


Gly 


Gly Phe 


Ser 


Asp 






100 










105 








110 






Net 


Gly Gly 


Gly 


Ser 


Arg 


Arg 


Arg 


Val 


Arq 


Arg 


Gly 


Ser Asp 


Leu 


Arg 




115 










120 










12S 






Val 


Arg Val 


Lys 


Leu 


Ser 


Leu 


Ala 


Asp 


He 


Ser 


Lys 


Gly Val 


Glu 


Lys 




130 








135 










140 








Lys 


Val Lys 


Val 


Lys 


Lys 


Gin 


Vol 


Val 


Cys 


Ser 


Lys 


Cys Arg 


Gly Asp 


14 5 








150 










155 








160 


Gly 


Thr Glu 


Glu 


Ala 


Asn 


Gly 


Lys 


Thr 


Thr 


Cys 


Gin 


Thr Cys 


His 


Gly 








165 










no 








175 




Thr Gly Val 


Val 


Thr 


Arg 


Val 


Ser 


Asn 


Thr 


Phe 


Leu 


Gly Ala 


Met 


Gin 






180 










185 








190 






Thr 


Glr\ Ser 


Thr 


Cyr* 


Pro 


Thr 


Cys 


His 


Gly 


Glu 


Gly 


Glu He 


He 


Thr 




195 










200 










205 






Lys: 


Pro Cy."* 


S^r 


Lys 


Cys 


Lys 


Gly 


Glu 


Gly 


Val 


Glu 


He Gly Glu 


Glu 




210 








215 










220 








Val 


He Ser 


Phe 


His 


He 


Pro 


Ala 


Gly 


Val 


Ala 


Glu 


Gly Met 


Gin 


Met 


225 








230 










235 






240 


Ser 


Val Asn 


Gly 


Lys 


Gly 


Asn 


Air, 


Ala 


t*ro 


Arg 


Cli- 


Gly Val 


Asn 


Gly 
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245 250 



Asp 


Lsu 


He 


Val 


Val 


He 


Ala 


Glu 


Glu 








260 










265 


A^n 


Gly 


Aiii 


Asp 


Leu 


lis 


Tyr 


Asn 


Leu 






275 








280 




He 


T.ys 


Civ- 


Gly 


Ser 


Val 


Glu 


Val 


Pro 




290 










2?b 






He 


Arg 


ile 


Glu 


Ala 


Gly 


Thr 


Gin 


Pro 


305 










310 








Am 


I.ys 


Gly 


Leu 


Pro 


Ser 


Val 


Asn 


Gly 










3?5 








Val 


Asn 


Val 


Asn 


Val 


Tyr 


lie 


Pro 


Glu 


'Jin 
















345 


Ala 


He 


Ala 


Ala 


Met 


Glu 


Asn 


Ser 






355 










360 


Ala 


Ala 


Ary 


Lys 


Asp 


lie 


Asp 


Lys 


Lys 




370 










375 


(2) 


IN FORI I^TIOII 


FOR 


SEQ 


ID NO; 41'/ 



250 255 
Pro Asp Pro Asn Leu He Arg 
270 

Leu He Ser Val Pro Leu Ala 
205 

Thr Hp Asp Gly Arg Ala Lye 
300 

Gly Lys Met Leu Arg Leu Arg 
31S 320 
Tyr Gly Met Gly Asd Gin Leu 
330 335 
Ser He Asp Ala Lys Asp Glu 
350 

Asp Scr Phs Lys Pro Thr Asp 
365 

Tyr Arg Glu Me L Leu Asp 

380 



(1) SEQUENCE CHARACTER T ST I CS: 

(A) LENGTH: 2 93 amino acids 
iP.) TYPE: arrino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: psoLein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyroraonas gingiva lis 

(is) FEATURE: 

(A) NAME/ KEY: misc_f -ature 
t B ) LOCATION 1 ... 2 93 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 417 

Met Lys Ly~ Leu He Leu Ala Thr Leu Gly Leu Met Ala He Ala Met. 

7 5 10 is 

Leu Ser Cys Ser Ser Asn Asn Lys Asp Leu Glu Asn Lys Gly Glu Ala 

20 25 jo 

Thr Leu Leu Val Thr Phe Gly Ser Ser Tyr Lys Ala Pro Arc, Glu Thr 

35 4 o 45 

Tyr Ala Lys life Glu Lys Thr Phe Ala Ala Ala Tyr Pro Asp Gin Arg 

50 55 60 

He Ser Trp Thr Tyr Thr Ser Ser II- He Arg Lys Lys Leu Ala Gin 

Gin Gly He Tyr He Aarp Ala Pro Asp Glu Ala x^u Glu Lys L-u Ala 

e5 90 95 

Arg Leu Gly Tyr Lys Lys He Asn Val Gin Ser Leu His Val He Pro 

100 105 no 

Gly Arg Glu Tyr Asp Glu Met He Asp Phe Val Asn Lys Phe Lys Ala 

H J 120 i?s 

Ala His Ser Asp He Thr Val Lys Val Gly Ala Pro Leu Phe Asp Thr 

Asp Glu Asp list Arg Glu Val Ala Glu lie Leu His Lys Arg Phe Gin 

155 160 
Gin Thr He Glu Lys Gly Glu Ala He Val Phe Met Gly His Gly Thr 

165 i7 0 175 

Glu His Ala Ala Asn Asp Arg Tyr Ala Arg He Asn Lys He Met Lys 
180 lfl 5 190 

Tyr foe LyS PhS MeL Ile Val Thr v *l Glu Ser Asp Pro Ser 

195 200 205 

He Asn Asp Val Ile Ala Glu Leu Lys Glu Thr Gly Ala Thr Ala Val 

Thr Met Met Pre Leu Met Ser Val Ala Gly Asp His Ala Thr Asn Asn 
225 230 235 24n 

MeL Ala GJy Asp Glu Asp Asp Ser Trp Lys Thr Wu Leu Thr Asn Ala 

245 250 25S 

Gly Tyr Thr Val Ser Ii« Asp Lys Leu Asp Asn Gly Asn Phe Ser Ala 

2^0 2 65 270 

Leu Gly Asp He Glu Glu He Arg Asn He Trp Leu Lys His Met Lys 

' 3 280 28S 

Ala Thr Ser Ala Arg 
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2 00 

12) INFORMATION FOR SEQ ID 110:418 

5 (U SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 356 amino acids 

[B) TYPE: amino acid 
{D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 

tiii) HYPOTHETICAL: YES 

tvi) ORIGINAL SOURCE: 
15 (A) ORGAilISM: Porphyromonas gingivalis 

lix) FEATURE: 

(A) HAltE/KEY: mi£C_feature 

(B) LOCATIOH 1. . .356 



(y.i) SEQUENCE DESCRIPTION SEQ TD HO: 419 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 <2] INFORMATION FOR SEQ ID HO: 41? 

til SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 757 amino acids 

(B) TYPE: amino acid 
75 1 D) TOPOLOGY; linear 



Met 


Thr 


Set 


Val 


Ser 


His 


Leu 


Arg 


Thr 


He 


Ser 


Val 


Ala 


Giy 


He 


Leu 


1 


















10 










15 




Ala 


Ala 


Leu 


Gly 


Gly 


Ala 


Val 


Leu 


lie 


T,-U 


Phe 


Gly 


Val 


Asn 


L°u 


Phe 








20 










25 










30 






Leu 


Gly 


S^r 


Val 


Ala 


Tie 


Pro 


Ilet 


Ser 


Glu 


He 


Phe 


Arg 


His 


Leu 


Phe 






35 










40 










45 








Ser 


As-p 


Arg 


Pro 


Glu 


Gly 


Gly 


Glu 


Ala 


Leu 


Val 


His 


Tyr 


Am 


lis 


Leu 




50 










5b 










60 










Trp 


Lys 


Ser 


Arg 


Leu 


Pro 


Glu 


Ala 


Leu 


Thr 


Ala 


Ala 


Phe 


Ala 


Gly 


Ala 


65 










70 










75 










80 


Gly 


T.su 


Sec 


Val 


Ser 


Gly 


Leu 


Gin 


Met 


CJn 


Thr 


Val 


Phe 


Arg 


Asn 


Pro 




















90 










95 




Leu 


Ala 


Gly 


Pro 


Ser 


Val 


Leu 


Gly 


He 


Ser 


Ser 


Gly 


Ala 


Ser 


Leu 


Gly 








100 










105 










110 






Val 


Ala 


Leu 


Val 


Val 


Leu 


Leu 


Ser 


Gly 


Ser 


Leu 


Gly 


Gly 


Val 


Ala 


Leu 






115 










120 








125 








Ser 


Ser 


Leu 


Gly 


Tyr 


Met 


Gly 


GlU 


Val 


Ala 


Met 


Asn 


He 


Ala 


Ala 


Ala 




130 










135 










140 










Val 


Gly 


Ser 


Leu 


Ala 


Val 


Met 


Gly 


Leu 


He 


Val 


Phe 


Val 


Ser 


Thr 


Lys 


145 










150 










155 










160 


V<j1 


Arg 


Ser 


His 


Val 


Thr 


Leu 


Leu 


lie 


He 


Gly Val 


Met 


He 


Gly 


Tyr 










165 










no 










175 




Val 


Ala 


Thr 


Ala 


Val 


lie 


Gly 


Val 


Phe 


Lys 


Phe 


Pne 


Ser 


He 


Glu 


Glu 








] fiO 










165 










190 






Asp 


lie 


Aig 


Ala 


Tyr 


Val 


lie 


Trp 


Gly 


Leu 


Gly 


Ser 


Phe 


Ser 


Arg 


Ala 






195 










2Q0 










205 








Thr 


Acp 


S^r 


Gin 


Leu 


Ser 


Phe 


Phe 


Ala 


He 


Leu 


tiet 


Leu 


He 


Phe 


He 




210 










215 










220 










Pro 


Ala 


Gly 


llet 


Leu 


Leu 


Val 


Lys 


Gin 


Leu 


Am 


Leu 


I.eu 


Leu 


Leu 


Gly 


225 










230 










235 










240 


Glu 


Ser 


Tyr 


Ala 


Arg 


Asm 


Leu 


Gly 


Leu 


Asn 


Thr 


Arg 


Arg 


Ala 


Arg 


Leu 










245 










250 










255 




Leu 


Val 


Il» 


Ser 


Ser 


Ala 


Gly 


Leu 


Leu 


He 


Ala 


Thr 


Val 


Thr 


Ala 


Tyr 








260 










265 










270 






Cys 


Gly 


Pro 


lie 


Gly 


Phe 


Leu 


Gly Met 


Ala 


Val 


Pro 


His 


Leu 


Ala 


Arg 






275 










280 










285 








Val 


lis 


Fhe 


His 


Thr 


Ser 


Asp 


His 


Arg 


He 


Leu 


Met 


Pro 


Ala 


Thr 


cys 




290 










295 










300 










Leu 


He 


Gly 


Ser 


Ala 


Leu 


Ala 


Leu 


Phe 


Cys 


Asn 


He 


He 


Ala 


Arg 


Met 


305 










310 










315 










320 


Pro 


Gly 


Fhe 


Glu 


Gly 


Ala 


Leu 


Pro 


Val 


A.**n 


Ser 


Val 


Thr 


Ala 


Leu 


Val 










325 










330 










335 




Gly Ala 


Pro 


lis 


lie 


Val 


Thr 


Val 


Leu 


Phe 


Arg 


Arg 


Arg 


Arg 


Phe 


Lys 








34 0 










345 










350 






Glu 


Glu 


Thr 


Asp 






























355 
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(ix) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

il:<) FEATURE: 

(A) NAI IE/ray : misc^feature 

(B) LOCATION 1. . .757 

C;i> SEQL'F.NCE DESCRIPTION : SEQ ID I/O; 419 

Hex Arg Thr Lys Thr He Phe Phe Ala He He Ser Phe He Ala Leu 

. 5 10 15 

Leu Ser Ser Ser Leu Sor Ala Gin Ser Lys Ala Vai Leu Thr Gly Scr 

20 25 30 

Val Ser Asp Ala Glu Thr Gly Glu Pro Leu Ala Gly Ala Arg He Glu 

35 40 4£ 

Val T.yr HI? Thr Asn He Val Ala Gly Ala Asp Ala Gly Gly His Ph- 

^° 55 60 

talu ile Lys Asn Leu Pro Ala Gly Gin His Thr He He Cy* S=r L-u 
5? 70 75 8 n 

olv Gly Tyr Cly Gin Lys Glu Glu Val Val Ala He Glu Ala Gly Gin 

35 SJO <* 5 

Thr i-ys Thr He Scr Fhe Ala Leu Arg Leu Arg Thr Asm Asn Leu Glu 

100 105 110 

Glu Vai Val Val Thr Gly Thr Gly Thr Arg Tyr Arg Leu Val Asp Ala 

1x5 120 12 S 

Pro Vai Ala Thr Glu Val Leu Thr Ala Lys Asp He Ala Ser Phe Ser 

130 135 140 

Ala Pro Thr Ser Glu Ala Leu Leu Gin Gly Leu Ser Pro Ser Ph- Asp 
150 155 160 

Phe Gly Pro Asn Leu Met Gly Ser Phe Met Gin Leu Asn Gly L-u Ser 

165 170 175 

Ser Lys Tyr Ile Leu He Leu He Asp Gly Lys Arg Val Tyr Gly Asp 

180 18b 190 

Val Gly Gly Gin Ala Asp Leu Ser Arg Ile Ser Pro Asp Gin Ile Glu 

195 200 205 

Arg lie Glu Leu Val Lys Gly Ala Ser Ser Ser Leu Tyr Gly Ser Asp 

-10 215 220 

Ala He Ala Gly Val He Aon Val lie Thr Lys Lys Asn Thr Asn Arg 



230 



235 240 



Leu Scr Ala Tyr Thr Ser His Arg Ile Ser Lye Tyr Asn Asp Arg Sin 

245 250 255 

Thr Asn Thr Ser Leu Asp He A/in Ile Gly Lys Phe Ser Ser Asn Thr 

260 2*55 2 70 

Asn Tyr Phe Phe Tyr His Thr Asp Gly Trp Gin Asn Ser Pro Phe Glu 

275 280 285 

Ile Lys Lys Lys Lys Gly Ser Gly Glu Pro Val Leu Glu Glu Thr Tyr 

290 295 300 

Lys Lys Thr Phe Arg Ala Gin Glu Asn Gin Gly Val Ser Gin Ser Leu 
305 310 315 3 2 o 

Ser Tyr Tyr Ala Thr Asn Asn Leu Ser Phe S-r Gly Asn Val Gin Tyr 

325 330 335 

Asn Lys Arg Gin Ile Phe Thr Pro Thr Phe Ser Glu Lys Lys Ala Tyr 

340 345 350 

Asp Net Arp Tyr Arg Ala Leu Thr Ala Ser Lou Gly Thr A,n Tyr Leu 
355 3 6 o 36S 

Ph ~ ™ GAy LeU HiS Thr LSU Ser Phe Ala Va l Tyr Asp Arg 

^ i J 3 7 5 380 

Phe «£« Phe Gly Tyr Leu Tyr His Asp Lys A*p Ser Ser Glu Ser Leu 

r? 390 3 * 5 400 

Asn Gln G1 Y Gin Thr Glu Gin Pro Thr Phe Phe Pro Gly Gin 

405 410 415 

Leu Arg Am Lys Asn Asp Gin Ile Arg Tyr Thr Ala Clu Ala Arg Gly 

420 42b 430 

Val Phe Thr Leu Pro Tyr Ala Gin Lys Leu Thr Gly Gly Leu Glu Tyr 

43S 440 445 

Phe Arg Glu Glu Leu He Ser Pro Tyr Asn Leu He Thr Asp Lys Ala 

450 455 460 

Asp Ala Ser Thr Leu Ser Ala Tyr Val Gin Aso Glu Trp Lys Pro L-u 
470 4 /b ,_ 0 

As r Trp Phe Asn Met Thr Ala Gly Phe Arg Leu Val Hi* His Gin Glu 
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10 



15 



20 



25 



30 



35 



45 



50 



65 



70 



75 











485 










4 90 










495 




Phe 


Gly 


The 


Arg 


Met 


Thr 


Pro 


Lys 


Val 


Ser 


He 


Leu 


Ala 


hys 


Tyr 


Gly 








500 










505 










510 






Pro 


Leu 


Asn 


Phe 


Arg 


Ala 


Thr 


Tyr 


Ala 


Asn 


Gly 


Tyr 


Lys 


Thr 


Pro 


Thr 






515 










520 










525 








Leu 


Lys 


Glu 


Leu 


Phe 


Ala 


Arg 


Asn 


Glu 


Leu 


Thr 


Thr 


Met 


Gly 


Ser 


His 




530 










535 










540 










A^n 


Leu 


Tyr 


Leu 


Gly 


Asn 


Ala 


A£p 


I.pu 


Lys 


Pro 


Gin 


Met 


Ser 


Aep 


Tyr 


545 










550 










555 










560 


Tyr 


Ala 


Leu 


Gly 


Leu 


Glu 


Tyr 


Asn 


Gin 


Gly 


Pro 


He 


Ser 


Phe 


Ser 


Ala 










565 










S7 0 










575 




The 


Val 


Tyr 


Arp 


Asn 


Glu 


Leu 


Arg 


Asn 


Leu 


He 


Ser 


Phe 


Met 


Asp 


He 








580 










585 










590 






Pro 


Thx 


Ser 


Pro 


Glu 


HXs 


Glu 


Ala 


Gin 


Gly 


He 


Lys 


Lys 


Thr 


Lys 


Gin 






595 










600 










605 








Tyr 


Ala 


Asn 


lie 


Gly 


Lys 


Ala 


Arg 


ser 


Arg 


Gly 


Leu 


Asp 


Val 


Leu 


Cya 




610 










615 










620 










Asp 


Ala 


Ser 


lie 


Gly 


Trp 


Gly 


lie 


Lys 


Leu 


Gly Ala 


Gly 


Tyr 


Ser 


Leu 


625 










630 










635 










64 0 


Val 


Glu 


Ala 


Lys 


Asn 


Leu 


Gin 


Thr 


Asp 


Glu 


Trp 


Leu 


Glu 


Gly Ala 


Ala 










645 










650 










655 




Arg 


His 


Arg 


Ala 


Asn 


Val 


IUs 


Ala 


Asp 


Trp 


Val 


His 


Tytr 


Trp 


Gly 


Gin 








660 










665 










670 






Tyr 


Arc* 


Leu 


Gly Val 


Ser 


Leu 


Phe 


Gly 


Arg 


He 


Gin 


Ser 


Glu 


Arg 


Tyr 






675 










680 










605 








Tyr 


Lyr 


Arp 


Gly 


Asn 


Ala 


Pro 


Aip 


Tyr 


Thr 


Leu 


Trp 


Arg 


Leu 


Ala 


Thr 




690 










695 










700 










S-r 


His 


Arg 


Phe 


Ala 


Hi £7 


Phe 


Arq 


His 


lie 


11* 


Leu 


Asp 


Civ- 


Thr 


Leu 


705 










710 










715 










720 


Gly 


lie 


Asp 


Asn 


Leu 


Phe 


Asp 


Tyr 


Val 


Asp 


Asp 


Arg 


Pro 


Met 


Gly 


Val 










7P5 










730 










735 




Asn 


Tyr 


Ala 


Thr 


Val 


Thr 


Pro 


Gly 


Arg 


Thr 


Phe 


Pha 


Ala 


Gin 


He 


Ala 








740 










745 










750 






He 


Arg 


Phe 


Asn 


Asn 




























755 





























(2) INFORMATION FOR SEQ ID 110:420 



ti> SEQUENCE CHARACTERISTICS: 
40 <A) LENGTH: 331 amino acids 

(8) TYPE: amino acia 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(ili) HYPOTHETICAL: YES 

tvi) ORIGINAL SOURCE: 

(A} ORGAN I S1 1: Poxphyromonas gingival xs 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION 1. . .331 



55 {xl> SEQUENCE DESCRIPTION: SEQ ID N0:420 



60 



Met 


Thr 


Asp 


Asn 


Lys 


Gin 


Arg 


Asn 


lie 


Val 


Phe 


Pro 


Ala 


Phe 


Leu 


Leu 


1 








5 










10 










15 




Leu 


Leu 


Gly 


Val 


He 


Ala 


Val 


Val 


Thr 


He 


Val 


Gly 


Phe 


Phe 


Met 


Leu 








20 










25 










30 






Arg 


Tro 


Ala 


Glu 


Glu 


He 


He 


G)n 


Gly 


Gin 


He 


Glu 


Val 


Thr 


Glu 


Tyr 






35 










40 










45 






Arg 


Val 


Ser 


Ser 


Lys 


Val 


Pro 


Gly 


Arg 


Tie 


Ly* 


Glu 


Leu 


Arg 


Val 


Ser 




50 










55 










60 










Glu 


Gly 


Gin 


Gin 


Val 


Gin 


Ala 


Gly 


Asp 


Thr 


Leu 


Ala 


Val 


He 


Glu 


Ala 


65 










70 










75 










80 


Pro 


Aep 


Val 


Ala 


Ala 


Lys 


Met 


Glu 


Gin 


Ala 


Lys 


Ala 


Ala 


Glu 


Ala 


Ala 










85 










90 










95 




Ala 


Gin 


Ala 


Gin 


Asn 


Ala 


Lys 


Ala 


Leu 


Lys 


Gli 


Ala- 


Arg 


S-r 


Glu 


Gin 








100 










105 










110 






He 


Gin 


Ala 


Ala 


Tyr 


Glu 


Met 


Trp 


Gin 


Lys 


Ala 


Gin 


Ala 


Gly 


Val 


Ala 






115 










120 










125 








He 


Ala 


Thr 


Lys 


Thr 


His 


Gin 


Arg 


Val 


Gin 


Asn 


Leu 


Tyr Asp Gin 


Gly 




130 










135 










140 








Val 


Val 


Pro 


Ala 


Gin 


Lys 


Leu 


Asp 


Glu 


Ala 


Thr 


Ala 


Gin 


Arg 


Asp 


Ala 
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35 



45 



50 



55 



60 



65 



70 



75 



145 










150 










lbb 










160 


Ala 


Tie 


Ala 


Thr 


Gin 


Lys 


Ala 


Ala 


Glu 


Ala 


Gin 


Tyr 


Asn 


Met 


Ala 


Arg 










: 65 










no 










175 




Asn 


Gly 


Ala 


Glu 


Arg 


Glu 


Asp 


Lys 


Leu 


Ala 


Ala 


Ser 


Ala 


Leu 


Val 


Ar P 








180 










185 










190 






Arg 


Ala 


Arg 


Gly 


Ala 


Val 


Ala 


Glu 


Val 


Glu 


Ser 


Tyr 


He 


Asn 


Glu 


Thr 






195 










200 










205 








Tyr 


Leu 


He 


Ala 


Pro 


Arg 


Ala 


Gly 


Glu 


Val 


Sec 


Glu 


He 


Phe 


Pro 






210 










215 










220 










Ala 


Gly 


Glu 


Leu 


Val 


Gly 


Thr 


Gly 


Ala 


Pro 


lie 


Met 


Asn 


He 


Ala 


Glu 


225 










230 








235 










240 


Met 


Sly 


Asp 


Met 


Trp 


Ala 


Ser 


Phe 


Ala 


Val 


Arg 


Glu 


Asp 


Phe 


Leu 


Ser 










24 5 










250 










255 




Scr 


Met 


Thr 


Met 


Gly 


Ala 


Val 


Leu 


Glu 


Thr 


Val 


Val 


Pro 


Ala 


Leu 


Asn 








2 fir. 










265 










270 






Glu 


Glu 


Lys 


Val 


Arg 


Phe 


Lys 


He 


Thr 


Phe 


He 


Lys 


Asn 


Met 


Gly 


Thr 






275 










280 










285 








Tyr 


Ala 


Ala 


Trp 


Lys 


Ala 


Thr 


Lys 


Thr 


Thr 


Gly 


Gin 


Tyr 


Asp 


Leu 


Li E 




290 










2 95 










300 










Thr 


Phe 


Glu 


Val 


Lys 


Ala 


Thr 


Leu 


Ala 


Asp 


Lys 


Asp 


Lys 


Ala 


Gin 


Lys 


305 










310 










315 












Leu 


Arg 


Pro 


Gly Met 


Ser 


Val 


Tie 


He 


Arg 


Lye 




















325 










330 















10 



15 



20 



25 {2) II f FORMATION FOR SF.Q ID HO; 421 

(i > SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 267 amino acids 
(E) TYPE: amino acid 
30 ( D) TOPOLOGY: linear 

(il) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 



(ix) FEATURE: 
40 (A) NAME/ KEY i mis cofeature 

{ D ) LOCATION 1. . .267 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421 



Met 


Arg 


He 


Val 


Ser 


Asn 


Phe 


Leu 


Phe 


Val 


Ser 


Phe 


Ser 


Val 


Leu 


Leu 


1 








5 










10 










15 




Phe 


Ma 


Ser 


Cys 


Arg 


Ser 


Gin 


Arg 


Glu 


Lys 


Val 


Val 


Tyr 


Leu 


Gin 


Asp 








20 










25 










30 






He 


Gin 


Thr 


Phe 


Asn 


Arg 


Glu 


He 


He 


Ala 


Lys 


Pro 


Tyr Asp 


Val 


Lys 






35 










40 










45 








He 


Glu 


Lys 


Asp 


Asp 


Val 


Leu 


Asn 


lie 


Leu 


Val 


Ser 


Ser 


Axg 


Asp 


Pro 




50 










55 










60 








Glu 


Leu 


Ser 


Thr 


Pro 


Tyr 


Asn 


Gin 


Val 


Leu 


Thr 


Thr 


Arg 


Ala 


Leu 


Ala 


65 










70 










76 










SO 


Arg 


Asn 


Gly 


Tyr 


Gly 


Thr 


Asn 


S-r 


Asn 


Glu 


Gly 


Phe 


Leu 


Val 


Asp 


Ser 










85 










90 










95 




Lys 


Gly 


Tyr 


lis 


Asn 


Tyr 


Pro 


He 


Leu 


Gly 


Gin 


He 


Tyr Val 


Glu 


Giy 








100 










105 










110 






Leu 


Thr 


Arg 


Thr 


Glu 


Leu 


Glu 


Lys 


Glu 


He 


Gin 


Lys 


Axg 


He 


He 


S-r 






lis 










120 








12b 








Ser 


Gly 


Phe 


He 


Lys 


Asp 


Pro 


Thr 


Val 


Thr 


Val 


Gin 


Leu 


Gin 


Asn 


Fhe 




130 










135 










140 










Lys 


Val 


Ser 


Val 


Leu 


Gly 


Glu 


Val 


Asn 


His 


Pro 


Gly 


Ser 


Met 


Ser 


Val 


145 










150 










155 










160 


Lys 


Gly 


Glu 


Arg 


He 


Thr 


Leu 


Leu 


Glu 


Ala 


He 


Gly 


Met 


Ala 


Gly 


A*p 










165 










170 










175 




Leu 


Thr 


He 


Tyr 


Gly 


Arg 


Arg 


Asp 


Arg 


Val 


Phe 


Val 


He 


Arg 


Glu 


Thr 








180 










185 










190 






Asp 


Gly 


Hit 


Arg 


Glu 


Val 


Phe 


Gin 


Thr 


Asp 


Leu 


Arg 


Lys 


Ala 


Asp 


Leu 






195 










200 










205 








Leu 


Ala 


Ser 


Pio 


Val 


Tyr 


Tyr 


Leu 


His 


Gin 


Aen 


Asp 


Vul 


11- 


Tyr 


Val 




210 










2iS 










220 










Glu 


Pro 


Asn 


Asp 


Lys 


Lys 


Thr 


Gin 


Met 


Ser 


Glu 


He 


Asn 


Gin 


Acn 


Asn 


225 










230 










235 










2 "i 0 


A?n 


Val 


Asn 


Val 


Trp 


T.cu 


Ser 


Val 


Thr 


Scr 


Thr 


Leu 


Val 


Ser 


He 


Ser 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



S-p Tii €?.v ^"^ ^" 



WO 99/29870 PCT/AU98/01023 

358/490 

245 250 255 

Thr Leu Thr lie Thr lie He Asp Lys Thr Lys 
260 265 

5 {2) HIFOWlATIOiJ FOR SEQ ID NO : 4 22 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 569 amino acids 
IB| TYPE: amino acid 
10 (l» TOPOLOGY: linear 

Uii MOLECULE TYPE: protein 

llli) HYPOTHETICAL : YES 



15 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(vH ORIGI1IAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 



<ixj FEATURE: 
20 <A) MAME/KEY: misc_feature 

(B) LOCATION 1 . . ,569 



25 





(Kii 


! SEQUENCE DESCRIPTION: SEQ ID NO: 4 22 










Jlet 


Lys 


Ly&- 


Thr 


Asn 


Leu 


Phe 


Leu 


Ser 


Leu 


Leu 


Vttl 


He 


Phe 


He 


Thr 


1 








•* 










10 










15 




Gly 


Srr 


Phe 


I let 


Thr 


Al a 


Cys 


Ala 


Gin 


Lys 


Ser 


Lys 


Thr 


Asn 


Lye 


Leu 








20 




















30 






Thr 


Glu 


Glu 


Asp 


Arg 


Scr 


Arg 


Asn 


Glu 


Tyr 


Val 


Gin 


Ser 


Met 


Asp 


Val 






3* 










40 










45 








Leu 


Ser 




He 


He 


Gly 


Asn 


Val 


Arg 


Leu 


Tyr 


Phe 


Val 


Asp 


Thr 


He 




50 










55 










60 










Ser 


He 


Lys 


His 


Met 


Thr 


Arg 


Arg 


Gly 


He 


Asp 


Ala 


Met 


Leu 


Cly 


Gly 


65 










70 










75 










AO 


Leu 


Asp 


Pro 


Tyr 


Thr 


Glu 


Tyr 


He 


Pro 


Tyr 


Glu 


Glu 


Met 


Asp 


Glu 


Leu 










85 










90 










95 




Lys 


Leu 


I let 


Thr 


Thr 


Gly 


Glu 


Tyr 


Ala 


Gly 


Val 


Gly 


Ala 


He 


He 


Ser 








100 










ICS 










110 






Gin 


Arg 


Pro 


Asp 


Ser 


Ala 


Val 


He 


lie 


Gin 


Arg 


Pro 


Met 


Glu 


Gly Met 






115 










120 










125 








Pro 


Ala 


Asp 


Glu 


Ala 


Gly 


Leu 


He 


Ala 


Gly 


Asp 


Arg 


He 


Leu 


Thr 


He 




130 










135 










140 










Asp 


Gly 


Lys 


Asp 


Phe 


Arg 


Lys 


Scr 


Thr 


Thr 


Pro 


Lys 


Val 


Ser 


Gin 


Ala 


145 










150 










155 










160 


Leu 


Lys 


Gly 


lie 


Ala 


Gly 


Thr 


Val 


Ala 


Lys 


Val 


Thr 


Val 


Met 


Arg 


Tyr 










165 










170 














Giy 


Glu 


Thr 


Lys 


Pro 


Arg 


Thr 


Phe 


Ser 


Val 


Lys 


Arg 


Gin 


Lys 


Val 


11" 








130 










185 










190 






list 


Asn 


Ser 


Val 


Thr 


Tyr 


Ser 


Gly 


Met 


Leu 


Asp 


Gly 


Sec 


He 


Gly 


Tyr 
















POO 










205 






He 


Arg 


Leu 


Asn 


Asn 


Phe 


Thr 


Asp 


Lys 


Ser 


Ala 


<siu 


Glu 


Val 


Arg 


Thr 




210 










215 










220 








Ala 


Leu 


Leu 


Asp 


Leu 


Arg 


Asp 


Lys 


Gin 


Gly 


Ala 


Lys 


Gly Leu He 


Leu 


22S 










230 










235 










240 


Asp 


Leu 


Arg 


Gly 


Asn 


Gly 


Gly 


Gly 


Leu 


Met 


Gin 


Ala 


Ala 


He 


Glu 


He 










245 










250 










255 




Val 


Asn 


Leu 


Phe 


Val 


Pro 


Lys 


Gly 


Lys 


Glu 


Val 


Val 


Thr 


Thr 


Lys 


Gly 








260 










265 










27 0 






Acq 


He 


Ala 


Glu 


Ser 


Ala 


Ser 


Val 


Phe 


Arg 


Thr 


Leu 


Thr 


Glu 


Pro 


lie 






27 5 










280 










28b 








Asp 


Thr 


Lys 


Leu 


Pro 


He 


Val 


Val 


Leu 


lie 


Asp 


Gly 


Gin 


Ser 


Ala 


Ser 














295 










300 










S-r 


Ssr 


Glu 


He 


Val 


Ala 


Gl y 


Ala 


Leu 


Gin 


Asp 


Met 


Asp 


Arg 


A3 a 


Val 


305 










310 










315 










320 


Leu 


Met 


Gly 


Gin 


Lys 


Ser 


Tyr 


Gly 


Lys 


Gly 


I.eu 


Val 


Gin 


Thr 


Thr 


Arg 










325 










330 










335 


Gin 


Leu 


Pro 


Tyr 


Asn 


Gly 


Val 


He 


Lys 


Leu 


Thr 


Thr 


Ala 


Lys 


Tyr 


Tyr 








340 










3-15 










350 


He 


Pro 


Ser 


Gly 


Arg 


cya 


He 


Gin 


Arg 


I.eu 


Asp 


Tyr* 


Ser Arg 


Thr 


Asn 






355 










360 










365 








Arg 


Thr 


Gly 


Met 


Ala 


Thr 


A) a 


He 


Pro 


Asp 


Ser 


Leu 


His 


Lys 


He 


Phe 




370 










37 5 










380 










Tyr 


Thr 


Ala 


Ala 


Gly 


Arg 


Arg 


Val 


Glu 


Asp 


Ala 


Gly 


Gly 


He 


Leu 


Pro 


385 










390 








395 










400 


Asp 


He 


GLu 


Val 


Lys 


Gin 


Asp 


Thr 


Ala 


Alo 


Thr 


Leu 


Leu 


Tyr 


Tyr 


Met 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/010Z3 



359/490 











405 










410 










415 




Al a 


He 


Asn 


Asn 


Asp 


Val 


Phe 


Asp 


Phe 


Val 


Thr 


Gly 


Tyr 


Val 


Leu 


Lys 








420 










425 










430 






His 


Lys 


Thr 


He 


Ala 


Lys 


Pro 


Giu 


Asp 


Phe 


Ser 


Ile 


Thr 


Asn 


Glu 


Asp 






435 










440 










445 








Tyr 


Ala 


Ala 


Phe 


Cys 


Lys 


Mst 


Met 


GlU 


Glu 


Lys 


Lys 


Phe 


Asp 


Tyr 


Asp 




450 










4bb 










4 60 










Axg 


Gin 


Ser 


Gly 


Lys 


Met 


Leu 


Asp 


Lys 


Leu 


Gl u 


Glu 


Leu 


ALa 


Lys 


He 


4 65 










470 










475 










480 


GXu 


Gly 


Tyr 


Leu 


Pro 


Glu 


Ala 


Asn 


Ser 


Glu 


Leu 


Lys 


Ala 


Leu 


Axg 


Glu 










485 










490 










495 




Lys 


Leu 


Lys 


Pro 


Arn 


Leu 


Ser 


Arg 


Asp 


Leu 


Leu 


Arg 


Phe 


Lys 


I.ys 


Glu 








500 










505 










510 






He 


Thr 


Asn 


Tyr 


Leu 


Asn 


Arn 


Glu 


He 


Val 


Thr 


Axg 


Tyr 


Tyr 


Tyr 


Glu 






515 










5?0 










525 








Arg 


Gly 


scr 


He 


Arq 


Gin 


Ser 


Leu 


Pro 


Glu 


Asp 


Lys 


Val 


Val 


Lys 


Giu 




530 










535 










540 










Ala 


He 


Lys 


Leu 


Leu 


Lys 


Arp 


His 


Pro 


Glu 


Gin 


He 


Arg 


Gin 


lie 


Leu 


545 










550 










555 










560 


Ala 


Ala 


Pro 


Lys 


Ala 


GlU 


Asn 


Lys 


Gly 

















565 



(2) INFORMATION FOR SEQ ID tiO:423 

|i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 981 amino acids 
<R} TYPE: amino acid 
(D) TOPOLOGV: linear 

(li) MOLECULE TYPE: protein 

(ili) HYPOTHETICAL: YES 

(Vl ) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ixj FEATURE i 

(A) NAME /KEY: misr:_f eature 
IB) LOCATION 1. . . 981 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 423* 



Het 


Gin 


Asn 


Lys 


Gly 


Phe 


Val 


He 


Val 


He 


Thr 


Ser 


Ala 


Leu 


Ala 


He 


1 








5 










10 










IS 




He 


Cys 


Ala 


Phe 


Tyr 


Leu 


Ser 


Phe 


Ser 


Phe 


Val 


Thr 


Asn 


Arg 


Tyr 


Glu 








20 










25 










30 






Lys 


Lys 


Ala 


Lys 


Ala 


Het 


Gly 


Asp 


Val 


Ala 


Gly 


HeL 


Ala 


Tyr 


I.Gll 


Asp 






35 










40 










45 








Ser 


Met 


Set- 


Asn 


GlU 


Lys 


Vai 


Trp 


Phe 


Giy 


Tyr 


Thr 


Leu 


Lys 


Glu 


Ala 




50 










55 








60 










Gin 


Ala 


Gin 


Gin 


He 


Gly 


Leu 


Gly 


Leu 


Asp 


Leu 


Lys 


Gly Gly 


Mot 


Asn 


65 










70 










7S 










80 


Val 


He 


Leu 


Lys 


Leu 


Asn 


Ala 


Ser 


Asp 


Leu 


Leu 


Arg 


Asn 


Leu 


Ser 


Asn 










85 










90 










95 




Lys 


Ser 


Leu 


Asp 


Pro 


Asn 


Phe 


Asn 


Lys 


Ala 


Leu 


Glu 


Asn 


Ala 


Ala 


Lys 








100 










105 










110 




Ser 


Thr 


Glu 


Gin 


Ser 


Asp 


the 


lie 


Asp 


He 


Phe 


Val 


Lys 


GlU 


Tyr 


Arg 






115 










120 










125 








Lys 


Leu 


Asp 


Pro 


Asn 


Gly 


Arg 


Leu 


Ala 


Val 


He 


Phe 


Gly 


Ser 


Gly 


Asp 




130 










135 










140 










Leu 


Arg 


Asp 


Gin 


He 


Thr 


Ala 


Lys 


Ser 


Thr 


Asp 


Ala 


Asp 


Val 


Val 


Arg 


145 










150 










155 










160 


T.*u 


Leu 


Lys 


Glu 


Lys 


Tyr 


A.^n 


Ser 


Ala 


Val 


Glu 


Ala 


Ser 


Phe 


A^n 


Val 










165 










170 










175 




Leu 


Arg 


Ala 


Arg 


lie 


Asp 


Ala 


Phe 


Gly 


Val 


Val 


Ala 


Pro 


Asn 


Leu 


Gin 








180 










185 










190 






Arg 


Leu 


Glu 


Gly 


Gin 


Gly 


Arg 


He 


Leu 


Val 


31u 


Leu 


Pro 


Gly 


Val 


Lys 






195 










200 










205 








Asp 


Pro 


Glu 


Arg 


Val 


Arg 


Thr 


Leu 


Leu 


Gin 


Arg 


Str 


Ala 


Asn 


Leu 


Gin 




210 










215 








220 










Phe 


Trp 


Arg 


Thr 


Tyr 


Lys 


Phe 


Glu 


Glu 


Val 


Ser 


Gly 


Asp 


Leu 


He 


Ala 


225 










230 










235 






240 


Aid 


Asn 


Asp 


Arg 


Leu 


Ser 


Glu 


Leu 


Ala 


Met 


Asn 


Asn 


Thr 


Asp 


Ala 


Thr 










245 










250 








255 




Pro 


Glu 


Thr 


Glu 


Pro 


Ala 


Thr 


Thr 


A^p 


Ser 


Val 


Ala 


Ala 


Thr 


Ala 


Arp 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



260 



Ser 


Ala 


Ala 
27 5 


Val 


Gin 


Ala 


Val 


Glu 


Ala 
290 


Lys 


Asp 


Ala 


Thr 


Arg 
295 


Pro 


,r al 


Asn 


Arg 


Gly 


Gly 


Ala 


305 










310 




Met 


Ala 


Gin 


Tie 


Ser 
325 


Glu 


Met 


Thr 


Arq 


Glu 


Asp 
340 


Val 


Lfcu 


Phe 


Pro 


Glu 


Thr 

355 


i.ys 


Lys 


Glu 


Thr 


Thr 


Asn 
37 0 


Arg 


Thr 


Gl y 




Pro 
375 


Ala 


Lys 


Scr 


ASp 


Ile 


Gin 


Asn 


335 










390 




Ser 


Met 


Thr 


Met 


Asn 
405 


Glu 


Glu 


Lys 


Asp 


Asn 


Val 
420 


Gly 


Arg 


Ala 


Tyr 


i'er 


Ala 
4 35 


Pro 


Asn 


Vol 


Asn 


He 


Ser 
450 


Gly 


HI:? 


Phe 


Thr 


Val 
455 


Leu 


Asn 


Ser 


Gly 


Lys 


l!et 


Asp 


-165 










470 




Val 


He 


Gly 


Pro 


Thr 
405 


Leu 


Gly 


Ser 


Ph^ 


Leu 


Leu 
500 


Ala 


Leu 


Val 


Tyr 


Gly 


Phe 
515 




Pro 


Gly 


Leu 


Ser 


Phe 

530 


Phe 


Thr 


Leu 


Gly 


Val 
535 


Leu 


Ser 


Gly 


He 


Ala 


Gly 


Leu 


545 










550 




Ala 


Asn 


Val 


Leu 


He 
565 


Phe 


Glu 


Lys 


Thr 


Pro 


Tie 
580 


Arg 


Ala 


Val 


Ala 


He 


Phe 
595 


Asp 


Ser 


Asn 


Val 


Phe 


Leu 
610 


Tyr 


Gly 


Thr 


Gly 


Pro 
615 


He 


Gly 


Leu 


He 


Ala 


Ser 


Phe 


625 










630 




Val 


Phe 


Glu 


Lys 


Leu 
645 


Ala 


L>s 


Thr 


Thr 


Stir 


He 
660 


Thr 


Arg 


Asn 


Leu 


Gly 


Lys 
675 


Arg 


Lys 


Thr 


Gly 


Leu 


Gly 
S90 


Leu 


lie 


Ala 


Ser 


Phe 
695 


Phe 


Ser 


Gly 


Gly 


Arg 


Asn 


Tyr 


705 










710 




Sen 


GlU 


Ala 


Val 


Arg 

725 


Ser 


Ala 


Leu 


Val 


Thr 


ser 
740 


lie 


Gly 


Thr 


Asn 


Tyr 


Lyt 
755 


He 


Gin 


Glu 


Glu 


Asp 


Lyz 
770 


Leu 


Tyr 


Gin 


Ser 


Leu 
77b 


Ala 


Asp 


Gin 


Phe 


Leu 


As p 


Asn 


785 










790 




Ser 


I1et 


Ser 


Ser 


Asp 
005 


He 


Thr 


Ser 


Met 


Hr 


Phe 
820 


Met 


Ala 


He 


Set 


Phe 


Ser 
835 


Ala 


Gly 


Val 


Phe 


He 


He 
BSO 


Ala 


Leu 


Tyr 


Ala 


Leu 
855 



360/490 





265 










270 






Ala 


Asp 




Ala 


Thr 


Val 


Ala 


Gin 


Lys 


280 










285 








Lys 


Asp 


Ala 


Leu 


Phe 


Ser 


Leu 


Leu 


Thr 










300 










Val 


Val 


Gly 


Val 


Ala 


Arg 


Arg 


Ala 


Asn 








315 










320 


Leu 


Gin 


Gin 


Ala 


His 


Asp 


Leu 


Lys 


Val 






330 










335 




Lou 


Trp 


Gly 


Ala 


Lys 


Ala 


He 


Glu 


Asp 




345 










350 






Asp 


Leu 


Tyr 


Glu 


Leu 


Tyr 


Ala 


He 


Arg 


360 










365 








Asp 


Leu 


Gly 


Gly 


Asp 


Val 


Val 


Thr 


Ser 










380 










Asp 


Phe 


Gly 


Arg 


Ser 


Glu 


Pro 


He 


Val 








395 










400 


Gly Al=» 


Arg 


Lys 


Trp 


Ala 


Arg 


He 


Thr 






410 










415 




He 


Ala 


He 


Val 


Leu 


Acp 


GJy 


Val 


Val 














4 30 






P 


Gl Ll 


He 


Thr 


Gly 


Gly 


Arg 


Ser 


Gin 


4 40 










445 








Glu 


V? u 


Al* 


Gly 


A^p 


Leu 


Ala 


Asn 


Val 










460 










Ala 


Thr 


Val 


$et 


He 


Glu 


Gin 


Glu 


Asn 








475 










480 


Ala 


Glu 


Ser 


He 


Lys 


Ala 


Gly 


Phe 


Leu 






490 










495 




He 


Leu 


Met 


Cys 


Tyr 


Met 


Cys 


Leu 


Ala 




505 










510 






lie 


Ala 


Asn 


Gly 


Ala 


Leu 


He 


Val 


Asn 


520 










525 








Leu 


Ala 


Ser 


Phe 


His 


Ala 


Val 


Leu 


Thr 










54 0 










Val 


Leu 


Thr 


Leu 


Gly 


Met 


Ala 


Val 


Asp 








555 










560 


Arg 


He 


Lys 


Glu 


Glu 


Leu 


Arg 


Ala 


Gly 






570 










575 




Thr 


Asp 


Gly 


Tyr 


Gly Asn 


Ala 


Phe 


Ser 




585 










590 






Thr 


Thr 


He 


lie 


Thr 


Gly He 


He 


Leu 


600 










605 








He 


Arg 


Gly 


Phe 


Ala 


Thr 


Thr 


Leu 


He 






620 










He 


Thr 


Ala 


Val 


Phe 


Leu 


Thr 


Arg 


He 








635 










64 0 


Lys 


Gly 


Arg 


Leu 


Asp 


Lys 


He 


Thr 


Phe 






650 










655 




Leu 


Leu 


Val 


Asn 


Pro 


Ser 


Tyr 


Asn 


He 




665 










670 






Phe 


He 


He 


Pro 


Val 


He 


He 


He 


Val 


680 










685 








Thr 


He 


Gly 


Leu 


Asa 


Arg 


Gly 


He 


Glu 










700 










Val 


Val 


Lys 


Phe 


Asp 


Gin 


Pro 


Val 


Ser 








715 










720 


Leu 


Ser 


Ser 


Pro 


Leu 


Gin 


Glu 


Lys 


Val 






730 










735 




Glu 


Gly 


Thr 


Glu 


Val 


Arg 


He 


Ser 


Thr 




745 










750 






Ser 


Glu 


Glu 


Tht 


Glu 


Ala 


Glu 


He 


Thr 


760 










7 65 








Lys 


Gly 


Phe 


Tyr 


Thr 


Gin 


Gin 


Pro 


Thr 










7B0 










He 


lie 


Ser 


Ser 


Gin 


Lys 


Val 


Ser 


Pro 








795 










800 


Arg 


Gly 


Ala 


He 


Trp 


Ala 


Val 


Leu 


Leu 






810 










815 




Tyr 


He 


Leu 


He 


Arg 


Phe 


Arg 


Asp 


He 




825 










830 






Val 


Ser 


Val 


Ala 


Ala 


Thr 


Thr 


Phe 


Cys 


840 










845 








Leu 


Trp 


Lys 


lie 


Leu 


Pro 


Phe 


Thr 


Met 



860 
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Glu lie Asp Gin Asn Phe lie Ala Ala lie Leu Ala lie 11a Gly Tyr 
865 870 875 889 

Ser Leu Asn A.^p Thr Val Val Val Phe Asp Arg lie Arg Glu Thr Met 
885 890 995 

5 Lys Leu Tyr Pro Asn Arg Asp Arq Tyr Gin Val lie Asn Asp Ala Leu 
900 905 91Q 

Asn Ser Thr Leu Gly Arg Thr Leu Asn Thr Ser Leu Thr Thr Phe He 

915 920 925 

Val IIet Leu Val lie Phe lie Phe Gly Gly Ala Thr Met Arg Ser Phe 
10 930 935 940 

Thr Phe Ser He L*u Leu Gly He Val lie Gly Thr Tyr Ser Thr Leu 
945 950 955 960 

Phe Val Ala The Pro Leu Ala Tyr Glu He Gin Lys Arg Lys Leu Asn 
965 97 0 975 

15 Lys Ala Ala Lys Lys 
980 

(2) INFORMATION FOR SEQ ID NO: 4 24 

20 ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1017 amino acids 
(B> TYPE: amino acid 
(O) TOPOLOGY: linear 

25 <ii) HOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 
30 (A) ORGANISll: Porphyroroonas gingival!:* 

(ix> FEATURE: 

(A) NAME/ KEY: mi?c_f satuce 
{ B) LOCATION 1...1017 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(Xl) SEQUENCE DESCRIPTION: SEQ ID HO: 424 



Met 


Lys 


Arg 


Met 


Thr 


Leu 


Phe 


Phe 


Leu 


Cys 


Leu 


Leu 


Thr 


Ser 


He 


Gly 


1 








5 










10 










15 




Trp Ala 


Met 


Ala 


Gin 


Asn 


Arg 


Thr 


Val 


Lys 


Gly 


Thr 


Val 


He 


Ser 


Ser 








20 










25 










30 






Glu 


Asp 


Asn 


Glu 


Pro 


Leu 


He 


Gly 


Ala 


Asn 


Val 


Val 


Val 


Val 


Gly Asn 






35 










40 










45 








Thr 


Thr 


tie 


Gly 


Ala 


Ala 


Thr 


Asp 


Leu 


Asp 


Gly 


Asn 


Phe 


Thr 


Leu 


Ser 




50 










55 










60 










Val 


Pro 


Ala 


Asn 


Ala 


Lys 


H^t 


Leu 


Arg 


Val 


Ser 


Tyr 


Ser 


Gly 


Met 


Thr 


65 










70 










75 










8C 


Thr 


Lys 


Glu 


Val 


Ala 


He 


Ala 


Asn 


Val 


Met 


Lys 


He 


Val 


Leu 


Aep 


Pro 










85 










90 










95 




Asp 


Ser 


Lys 


Val 


Leu 


Glu 


Gin 


Val 


Val 


Val 


Leu 


Gly 


Tyr 


Gly Thr 


Gly 








100 










105 










110 






Gin Lys 


Leu 


Ser 


Thr 


Val 


Ser 


Gly 


Ser 


Val 


Ala 


Lys 


Val 


Ser 


Ser 


Glu 






115 










120 










125 








Lys 


Leu 


Ala 


Glu 


Lys 


Pro 


Val 


Ala 


Asn 


He 


Met 


Asp 


Ala 


Leu 


Gin 


Gly 




130 










135 










140 








Gin 


Val 


Ala 


Gly 


Met 


Gin 


Val 


Met 


Thr 


Thr 


Ser 


Gly 


Asp 


Pro 


Thr 


Ala 


145 










150 










155 










160 


Val 


Ala 


Ser 


Val 


Glu 


He 


His 


Gly 


Thr 


Gly 


Ser 


Leu 


Gly 


Ala 


Ser 


Set 










165 










170 










175 




Ala 


Pro 


Leu 


Tyr 


He 


Val 


Asp 


Gly 


Het 


Gin 


Thr 


Ser 


Leu 


Asp Val 


Val 








i en 










185 










190 






Ala 


The 


Het 


Asn 


Pro 


Asn 


Asp 


Phe 


Glu 


Ser 


llet 


Scr 


Val 


L-u 


Lys 


Asp 






195 










200 










205 






Ala 


Scr 


Ala 


Thr 


Ser 


He 


Tyr 


Gly 


Ala 


Arg 


Ala 


Ala 


Asn 


Gly Val Val 




210 










215 








220 










Phe 


He 


Gin 


Thr 


Lys 


Lys 


Gly 


Lys 


list 


Ser 


Glu 


Arg 


Gly 


Arg 


He 


Thr 


22S 










230 










235 










240 


Phe 


Asn 


Ala 


Ser 


Tyr 


Gly 


He 


Ser 


Gin 


He 


Leu 


Asn 


Thr 


Lyr 


Pro 


Leu 










245 










250 








255 




Asp 


Asn 


Met 


net 


Thr 


Gly 


Asp 


Glu 


Leu 


Leu 


Afp 


Phe 


Gin 


Val 


Lys 


Ala 








260 










265 










27 C 






Gly 


Phe 


Trp 


Gly 


Asn 


Asn 


Gin 


Thr 


Val 


Gin 


Lys 


Val 


Lys 


Asp 


Met 


He 






275 










280 








285 








Leu 


Ala 


Gly 


Ala 


Glu 


Asp 


Leu 


Tyr 


Gly 


Asn 


Tyr 


Asp 


Ser 


Leu 


Lys 


Asp 




2?n 










295 










300 
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Glu 


Tyr 


f.ly 


Lys 


Thr 


Leu 


Phe 


Pro 


305 










310 






Trp 


Leu 


L^i* 


Al a 


Leu 


Phe 


Lys 


Thr 










325 








Ser 


Ph?* 


Ser 


Gly 


Gly 


Ser 


Gin 


Gly 


















Tyr 


Phe 


Asp 


Gin 


Glu 


Gly 


Het 


Ala 






355 










360 


Tyr 


Ser 


Gly 


Arg 


Leu 


Asn 


Phe 


Glu 




370 










375 




Vol 


Gly 


Ala 


Asn 


Leu 


Ser Gly Ala 


3es 










330 






Tyr 


Phe 


Gl y 


Lys 


Tyr 


Tyr 


Met 


Gly 










4 05 








Me L 


Pro 


Arg 


Tyr 


Tyr 


Asn 


Pro 


?ho 








420 










Val 


Tyr 


Tyr 


Met 


T\r 


Gly 


Ala 


Thr 
















440 


Phe 


Ala 


Lye 


I1?t 


Arq 


Pro 


Phe 


Ser 




450 










455 




A~n 


Gl y 


Phe 


Ala 


Gin 


He 


Thr 


Pro 


465 










470 






Gin 


Ala 


Gly 


Val 


Acp 


He 


Thr 


Asn 










485 








Pro 


Asn 


Asn 


Pro 


Tyr 


Asp 


Ser 


Thr 








500 










Ala 


Tyr 


Ary 


Asp 


Val 


Ser 


Ly* 


Ser 






SIS 










520 


Pho 


ser 


lie 


Asp 


Glu 


Lye 


His 


Asp 




530 










535 




Tyr 


lie 


Glu 


Tyr 


Glu 


Gly 


Asp 


Val 












550 






Glu 


Ser 


Asp 


Lys 


Leu 


11st 


Leu 


Leu 










565 








Leu 


Ser 


l.eu 


Pro 


Glu 


His 


Arg 


Val 








580 








phe 


Ser 


Arg 


Phe 


A3n 


Tyr 


Gly 


Phe 






595 










600 


Sec 


Val 


Arg 


A*?n 


Asp 


Gin 


Ser 


Ser 




610 










615 




Ala 


Trp 


Phe 


Tyr 


Ser 


Val 


Gly Gly 


625 










530 






lie 


Gin 


Glu 


Ser 


Asn 


Trp Leu 


Ser 










645 








Gl y 


Thr 


Thr 


Gly Asn 


Ser 


Glu 


He 








660 










Val 


Thr 


Val 


Asn 


Asn 


Tyr 


Thr 


Glu 






67 5 










600 


The 


Al a 


Gly 


Asn 


Pro 


Aep 


Leu 


Ser 














695 




Phe 


Gly 


Lett 


Ala 


Ala 


Gly Ala 


Phe 


70b 










710 






A*p 


Phe 


Tyr 


V*l 


Arg 


Thr 


Thr 


Asrv 










725 








Pro 


Tyr 


lie 


Ser 


Gly 


Phe 


Phe 


S~r 








740 










Ly* 


Asn 


Thr 


Gly Val 


Asp 


Leu 


Ser 






755 










760 


Lys 


Asp 


Trp 


Asn 


Val 


Tyr 


Ala 


Ser 




77 0 










775 




Glu 


He 


Thr 


Lys 


Leu 


Fhe 


Phe 


Gly 


785 










790 




Thr 


Gly 


Thr 


He 


Trp 


Glu 


He 


Gly 










805 






Glu 


Tyr 


Ala 


Gly 


Tie 


Asp 


Lys 


Lys 








820 










Pro 


Gly 


Gin 


Val 


Asp 


Al3 


Asp 


Gly 






835 










840 


Ser 


Ala 


Aep 


Leu 


Glu 


Thr 


Arg 


lie 




650 










855 




Thr 


Gly 


Gly 


Phe 


Ser 


Leu 


Cly Ala 


865 










870 






Ala 


Asp 


Phe 


Ala 


Tyr 


He 


Val 


Gly 










885 








Tyr 


Phe 


Thr 


Glu 


Asn 


Ala 


Gly Gly 



362/ 490 



Val 


Asp 


Phe 


Affn 


His 


Asp 


Ala 


Asp 






315 










320 


Ala 


Pro 


Thr 


Ser 


Gin 


Gl y 


Asp 


He 




330 










335 




Thr 


Ser 


Tyr 


Tyr 


Ala 


Ser 


He 


Gly 


345 










350 






Arg 


Glu 


Pro 


Ala 


Asn 


Phe 


Lys 


Ary 










365 








Ser 


Arg 


He 


Asn 


Glu 


Trp 


Leu 


Lys 








380 










He 


Ala 


Asn 


Arg 


Arg 


Ser 


Ala 


Asp 






395 










400 


Ser 


Gly 


Thr 


Phe 


Gly 


Val 


Leu 


Thr 




410 










415 




Aep 


Val 


Asn 


Gly 


Asp 


Leu 


Ala 


Aep 


425 










430 






Arg 


Pro 


Sol 


IteL 


Thr 


GJ U 


Pro 


Tyr 










445 








Ser 


Glu 


Ser 


His 


Gin 


Ala 


Asn 


Val 








4 60 










He 


Lys 


Gly 


Leu 


Thr 


Lou 


I.ys> 


Ala 






475 










480 


Thr 


Arq 


Thr 


Ser 


Ser 


Lys 


Arg 


M«t 




4 9C 










4?5 




Pro 


Leu 


Gly 


Glu 


Arg 


Atg 


Glu 


Arg 


505 










510 






Phe 


Thr 


Asn 


Tnr 


Ala 


Glu 


Tyr 


Lys 










525 








Leu 


Thr 


Ala 


Leu 


/let 


Gly 


His 


Glu 








540 










He 


Gly 


Ala 


Ser 


Ser 


Lys 


Gly 


Phe 






555 










560 


Ser 


Gin 


Gly 


Lys 


Thr 


Gly Asn 


Ser 




570 










57 5 




Ala 


GlU 


Tyr 


Ala 


Tyr 


Leu 


Ser 


Phe 


585 










590 






Asp 


Lys 


Trp 


Met 


Tyr 


He 


Asp 


Phe 










605 








Arg 


Phe 


Gly 


Ser 


Asn 


A&n 


Arg 


Ser 








620 










Met 


Phe 


Asp 


He 


Tyr 


Asn 


Lys 


Phe 






63b 










640 


Asp 


Leu 


Arg 


Leu 


Lys 


Met 


Ser Tyr 




650 










655 




Gly Asn 


Tyr 


Asn 


His 


Gin 


Ala 


Leu 


665 










670 






A^p 


Ala 


f let 


Gly 


Leu 


Ser 


He 


Ser 










685 








Trp 


Glu 


Lys 


Gin 


Ser 


Gin 


Phe 


Asn 


















Asn 


Asn 


Arg 


Leu 


Sei 


Ala 


Glu 


Val 






■us 










720 


Asp 




Leu 


He 


Acp 


Val 


Pro 


llet 




730 










735 




Gin 


Tyr 


Gin 


Asn 


Val 


Gly Ser 


Met 


745 










750 






Leu 


Lys 


Gly 


Thr 


He 


Tyr 


Gin 


Asn 










765 








Ala 


Asn 


Phe 


Asn 


Tyr 


Asn 


Arg 


Gin 








700 










Leu 


Asn 


Lys 


Tyr 


I let 


Leu 


Pro 


Asn 






795 










800 


Tyr 


Pr? 


Asn 


Ser 


Phe 


Tyr Met 


Ala 




810 










015 




Thr 


Gly 


Lys 


Gin 


Leu 


Trp Tyr 


Val 


825 










B30 






Asn 


Lys 


Val 


Thr 


The 


Ser 


Gin 


Tyr 


Asp 


Lys 


Ser 


Val 


845 
Thr 


Pro 


Pro 


He 








860 










Ser 


Trp 


Lye 


ciy 


Leu 


Ser 


Leu 


Asp 






875 










080 


Lys 


Trp 


Met 


He 


Asn 


Asn 


Asp 


Arg 




830 










8 95 




Leu 


Met 


Gin 


Leu 


A^Ji 


Lys 


As-p 


Lys 
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900 905 910 

I let Leu l.rn Asn Ala Trp Thr Glu Asp Asn Lys Glu Thr Asp Val Pro 

915 920 925 

Lys Leu Gly Gin Sei Pro Gin Phe Asp Thr His Leu Leu Glu Asn Ala 
5 930 935 940 

Ser Phe L»u Arg Leu Lys Asn Leu Lys Leu Thr Tyr Val Leu Pro Asn 
945 950 955 960 

Ser Leu Phe Ala Gly Gin Asn Val lie Gly Gly Ala Arg Val Tyr Leu 
965 970 975 

10 tlet Ala A rn Asn Leu Leu Thr Val Thr Lys Tyr Lys Gly Ph~ Asp Pro 
980 985 990 

Glu Ala Gly Gly Asn Val Gly Lys Asn Gin Tyr Pro Asn Ser Lys Gin 

995 1000 1005 

Tyr Val Ala 'i)y He Gin Leu Ser Phe 
15 1010 1015 

(2) IHFORtlATION FOR SEQ ID HO: 4 25 

(i) SEQUENCE CHARACTERISTICS: 
20 tAl LENGTH: 1014 amino acid.? 

(DJ TYPE : amino acid 
ID) TOPOLOGY: linear 



25 



30 



45 



50 



55 



60 



65 



70 



75 



<ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAN l Si t : e<=rphyromanas gingival ic 



{ ijc ) FEATURE : 

(A) NAT IE /KEY: misc_f eat u re 

(B) LOCATION 1...1014 

35 (xi> SEQUENCE DESCRIPTION: SEQ ID NO: 425 



40 



Met 


Thr 


Leu 


Phe 


Phe 


Leu 


Cys 


Leu 


Leu 


Thr 


Ser 


He 


Gly 


Trp 


Ala 


Het 


1 








5 










10 








15 




Ala 


Gln 


Asn 


Arq 


Thr 


Val 


Lys 


Gly 


Thr 


Val 


He 


Ser 


Ser 


Glu Asp 


Asn 








20 










25 










30 






Glu 


Pro 


Leu 


He 


Gly 


Ala 


Asn 


Val 


Val 


Val 


Val 


Gly Asn 


Thr 


Thr 


He 






35 










40 










45 








Gly 


Ala 


Ala 


Thr 


Asp 


Leu 


Asp 


Gly 


Asn 


Phe 


Thr 


Leu 


Ser 


Val 


Pro 


-Ala 




50 










55 










60 










Asn 


Ala 


Lys 


ifet 


Leu 


Arg 


Val 


Ser 


Tyr 


Ser 


Gly 


Met 


Thr 


Thr 


Lys 


Glu 


65 










70 










75 










80 


Val 


Ala 


He 


Ala 


Asn 


Vol 


Met 


Lys 


He 


Val 


Leu 


Asp 


Pro 


Ar»p 


Ser 


Lys 










85 










9C 








95 




Val 


Leu 


Glu 


Gin 


Val 


Val 


Val 


I.C-U 


Gly 


Tyr 


Gly 


Thr 


Gly 


Gin 


L ys 


Leu 








100 










105 










110 






Ser 


Thr 


Val 


Ser 


ciy 


ssr 


Val 


Ala 


Lys 


Val 


Ser 


Ser 


Glu 


Lys 


Leu 


Ala 






115 










120 










125 






Glu 


Lys 


Pro 


Val 


Ala 


Asn 


He 


Met 


Asp 


Ala 


Leu 


Gin 


Gly 


Gin 


Val 


Ala 




130 










135 










140 








Gly 


He I 


Gin 


Val 


n~t 


Thr 


Thr 


Ser 


Gly 


Asp 


Pio 


Thr 


Ala 


Val 


Ala 


Ser 


145 










150 










155 










160 


Val 


Glu 


He 


His 


Gly 


Thr 


Gly 


Ser 


Leu 


Gly 


Ala 


Ser 


Ser 


Ala 


Pro 


Leu 










165 










170 










175 




Tyr 


He 


Val 


Asp 


Gly 


(let 


Gin 


Thr 


Ser 


Leu 


Asp 


Val 


Val 


Ala 


Thr 


Met 








180 










185 










190 






Asn 


Pro 


Asn 


Asp 


Phe 


Glu 


Ser 


Met 


Ser 


Val 


Leu 


Lys 


Asp 


Ala 


Ser 


7vla 






195 










200 










205 








Thr 


Ser 


He 


Tyr 


Gly 


Ala 


Arg 


Ala 


Ala 


Asn 


Gly 


Val 


Val 


Phe 


He 


Gin 




210 










215 










220 










Thr 


Lys 


Lys 


Gly 


Lys 


Met 


Ser 


Glu 


Arg 


GLy 


Arg 


He 


Thr 


Phe 


Asn 


Ala 


225 










230 










235 










240 


Ser 


Tyr 


01 y 


He 


Ser 


Gin 


He 


Leu 


Asn 


Thr 


Lys 


Pro 


Leu 


Asp 


Asn 


Met 










245 










250 










255 




Met 


Thr 


Gly 


Asp 


Glu 


I.eu 


Leu 


Asp 


Phe 


Gin 


Val 


Lys 


Ala 


Gly 


Phe 


Trp 








260 










265 










27 0 






Gly Asn 


Asn 


Gin 


Thr 


Val 


Gin 


Lys 


Val 


Lys 


Asp 


Met 


He 


Leu 


Ala 


siy 






275 










280 










285 






Ala 


Glu 


Asp 


Leu 


Tyr 


Gly 


Asn 


Tyr 


Asp 


Ser 


Leu 


Lys 


Asp 


Glu 


Tyr 


Gly 




290 










295 










300 










Lys 


Thr 


Leu 


Phe 


Pro 


Val 


Asp 


Phe 


A.*n 


His 


Asp 


Ala 


Asp 


Trp 


Leu 


Lys 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



3Cb 










310 










315 










320 


Ala 


Leu 


Phe 


Lys 


Thr 


Ala 


Pro 


Thr 


Ser 


Gin 


Gly 


Acp 


Tie 


Ser 


Phe 


Ser 










325 










330 










335 




111 J 


Giv 


Ser 


Gin 




Thr 


Se r 


_ 

i yr 


Tyr 


Ala 


Ser 


lie 


Gly Tyr 


Phe 


sp 








340 










345 










3b0 






Gin 


Glu 


Gly Met 


Ala 


Ar 


Glu 


Pro 


Ala 


Aen 


Phe 


i .ys 


Arg 


Tyr 


Ser 


Gl>* 






355 










360 










365 








A a 


L*»u 




Phe 


Glu 


Ser 


Arg 


He 


Asn 


Glu 


Trp 


Leu 


Lys 


Val 


Gl v 


Ala 




370 










37 5 










380 










Asn 




Ser 


Gly 


Ala 


He 


Ala 




Arg 


Arg 


Ser 


Ala 


Asp 


Tyr 


Phe 


Gly 


385 










3 90 










395 










4 00 


Ly~ 




Tyr 


Met 


Gly 


Ser 


Gly 


Thr 


Phe 


Gly Val 


Leu 


Thr 


Met 


Pro 


Arg 










405 










410 










4 15 






Tvr 


Asn 


Pro 


Phe 


As p 


Val 


Asn 


Gly Asp 


Leu 


Ala 


Asp Val 


Tvr 


Tvr 








420 










425 










430 






I let 


Tyr 


Gly Ala 


Thr 


Aro 


Pro 


Ser 


Met 


Thr 


GlU 


Pro 


Tyr 


Phe 


Ala 


Lys 






435 










440 










445 








M & t 


Arg 


Pro 


Phe 


Ser 


Ser 


Glu 


Ser 


His 


Gin 


Ala 


Asn 


Val 


Asn 


Gly 


Phe 




4 50 










4 55 










460 










Ala 


Gin 


He 


Thr 


ro 




Lys 


Gly 


Leu 


Thr 


Leu 


Lys 


Ala 


Gin 


Ala 


Gly 












470 










4 75 












Va JL 




He 


Thr 


Aen 


Th r 




Thr 


Ser 


Ser 


Lys 


Arg 


Met 


Pro 


Asn 


Asn 










4 85 










4 90 










4 95 




Pro 


iir 


Asp 


S?r 


Thr 


Pro 


Leu 


Gly 


Glu 


Arg 


Arg 


Glu 


Arg 


Ala 


Tyr 










500 










505 










510 






A«D 


Val 


Ser 


Lys 


Ser 


Phe 


Thr 


Asn 


Thr 


Ala 


Glu 


Tyr 


Lys 


Phe 


Ser 


He 






S15 










520 










525 








As 


Glu 


Lys 


His 


Asp 


Leu 


Thr 


Ala 


Leu 


Het 


Gly 


His 


Glu 


Tyr 


He 






530 










535 










540 










( yr 


Glu 


Gly 


Asp 


Val 


He 


bl V 


Ala 


Ser 


er 


Lys 


Gly 


Phe 


Glu 


Ser 


Asp 


54 5 










550 










5tb 










560 


Lys 




Hst 


Leu 


Leu 


Ser 


Gl n 


Gly 


Lys 




Gly Asn 


Ser 


Leu 


Ser 


L**u 










565 










0 










575 




Fro 




Hi^ 


Arg 


Val 


Ala 


Glu 


Tyr 


Ala 


Tyr 


Leu 


Ser 


Phe 


Phe 


Ser 


Arg 








580 










S8b 










590 






Ph 0 


Asn 


Tyr 


Gly 


Phe 


Asp 


Lys 


Trp 


Met 


iyr 


He 


Asp 


Phe 


Ser 


Val 


g 






595 










600 










605 








Asn 


Asp 


Gin 


Ser 


Ser 


Arg 


Phe 


Gly Ser 


Asn 


Asn 


Arg 


Ser 


Ala 


Trp 


Phe 




610 










615 










620 










Tyr 


Ser 


Val 


Gly 


Gly 


Met 


Phe 


Asp 


He 


Tyr 


Asn 


Lys 


Phe 


He 


Gin 


Glu 


625 










630 










635 










640 


Ser 


Asn 


Trp 


Leu 


Ser 


Asp 


Leu 


Arg 


Leu 


Lys 


Met 


Ser 


Tyr Gly Thr 


Thr 










645 










650 










655 




Gly 


Asn 


Ser 


Glu 


He 


Gly 


Asn 


Tyr 


Asn 


His 


Gin 


Ala 


Leu 


Val 


Thr 


Val 








660 










665 










670 






Asn 


Asn 


Tyr 


Thr 


Glu 


Asp 


Ala 


Het 


Gly 


Leu 


S=r 


He 


Ser 


Thr 


Ala 


Gly 






67 S 










600 










685 








Asn 


Pro 


Asp 


Leu 


Ser 


Trp 


Glu 


Lys 


Gin 


Ser 


Gin 


Phe 


Aen 


Phe 


Gly 


Leu 




690 










695 










"00 










Ala 


Ala 


Gly 


Ala 


Phe 


Asn 


Asn 


Arg 


Leu 


Ser 


Ala 


Glu 


Val 


Asp 


Phe 


Tyr 


705 










710 










715 










720 


Val 


Arg 


Thr 


Thr 


Asn. Asp 


Ust 


Leu 


He 


Asp 


Val 


Pro 


Met 


Pro 


Tyr 


He 










725 










730 










735 




Ser 


Cli- 


Phe 


Phe 


Ser 


Gin 


Tyr 


Gin 


Asn 


Val 


Gly Ser 


Met 


Lys 


Asn 


Thr 








740 










745 










750 






Gly 


val 


Asp Leu 


Ser 


Leu 


Lys 


Gly Thr 


He 


Tyr 


Gin 


Asn 


Lys 


Asp 


Trp 






755 










760 










7 65 








Asn 


Val 


Tyr 


Ala 


Ser 


Ala 


Asn 


Phe 


Asn 


Tyr 


Asn 


Arg 


Gin 


Glu 


He 


Thr 




770 










775 










780 










Lys 


Leu 


Phe 


Phe 


Gly Leu 


Asn 


Lys 


Tyr 


Met 


Leu 


Pro 


Asn 


Thr 


Gly 


Thr 


785 










7 90 










795 










800 


He 


Trp 


Glu 


lie 


Gly 


Tyr 


Pro 


Asn 


Ser 


Phe 


Tyr 


Het 


Ala 


Glu 


Tyr 


Ala 










805 










810 










815 




Cli- 


lis 


Asp 


Lys 


Lys 


Thr 


Gly 


Lys 


Gin 


Ltiu 


Trp Tyr 


Val 


Pro 


Gly 


Gin 








820 










825 










830 






val 


Atrp 


Ala 


Asp 


Gly 


Asn 


Lys 


Val 


Thr 


Thr 


Ser 


Gin 


Tyx 


Ser 


Ala 


Asp 






835 










840 










045 








Leu 


Glu 


Thr 


Arg 


He 


Asp 


L,s 


Ser 


Val 


Thr 


Pro 


Pro 


He 


Thr 


Gly 


Gly 




850 










855 










860- 








Phe 


Ser 


Leu 


Gly 


Ala 


Ser 


Trp 


Lys 


Gly 


Leu 


Ser 


Leu 


Asp 


Ala 


Asp 


Phe 


865 










870 










07 S 










880 


Ala 


Tyr 


He 


Val 


Gly Lys 


Trp 


Het 


He 


Asn 


Asn 


Asp 


Arg 


Tyr 


Phe 


Thr 










885 










890 










895 




Glu 


Asn 


Ala 


Gly 


Gly 


Leu 


Met 


Gin 


Leu 


A^n 


Lys 


Asp 


Lys 


Met 


Leu 


Leu 








900 










905 










910 







SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



365/ 490 



Pi£T\ Ala Trp Thr Giu Asp Asn Lys Glu Thr Asp Val Pro Lys Leu Gly 

915 920 92S 

Gin Ser Pro Gin Phe Asp Thr His Leu Leu Glu A^n Ala Ser Phe Leu 
930 935 ?<10 

5 Arg Leu l.yr Asn Leu Lys Leu Thr Tyr Val Leu Pro Asn Ser Leu Phe 
945 950 955 960 

Ala Gly Gin Asn Val lie Gly Gly Ala Arg Val Tyr Leu Met Ala Arg 

965 970 «7S 

A&n Lou Leu Thr Val Vhr Lys Tyr Lys Gly Phe Ajtd Pro Glu Ala fily 
10 960 985 990 

Gly Asn Val GI3 Lys Asn Gin Tyr Pro Asn Ser Lys Gin Tyr Val Ala 

995 1000 1005 

Gly lie Gin Leu Ser Phe 
1010 



15 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) III FORI IATIOU FOR SEQ ID HO: 4 26 



Ul SEQUENCE CHARACTERISTICS: 

(Al LEMGT H ; 321 amino acids 
20 (PJ TYPE: amino acid 

ID} TOPOLOGY: linear 

(ii) HOLECULE TYPE: protein 

25 liii) HYPOTHETICAL: YES 

t vi > OR1G1 HAL SOURCE : 

(A) ORGANISM: Porphyromonas gingival 

30 ti:-:) FEATURE: 

(A) IfAHE/KEY : miscjeature 

(B) LOCATION 1 . . . B21 



<xi) SEQUENrE DESCRIPTION: SEQ ID NO: 4 26 



llet 


Lys 


Lys 


Lys 


Asn 


Phe 


Leu 


Leu 


Leu 


Gly 


He 


Phe 


Val 


Ala 


Leu 


Leu 


1 








5 










10 










15 




Thr 


Phe 


He 


G) y Ser Met 


Gin 


Ala 


Gin 


Gin 


Ala 


Lys 


Asp 


Tyr 


Phe 


Asn 








20 










25 










30 






Phe 


Asp 


'Jlu 


Arg Gly Glu 


Ala 


Tyr 


Phe 


Ser 


Phe 


Lye 


Val 


Pro 


Asp 


Arg 






35 










40 










45 








Ala 


Val 


Leu 


Gin 


Glu 


Lsu 


Ala 


Leu 


lie 


Met 


Ser 


He 


Asp 


Glu 


Phe 


Asp 




50 










55 










60 










Pro 


Val 


Thr 


Asn 


Glu 


Ala 


He 


Ala 


Tyr 


Ala 


Ser 


Glu 


Glu 


Glu 


Phe 


Glu 


65 










70 










75 










80 


Ala 


Phe 


Leu 


Arg 


Tyr 


Gly 


Leu 


Lys 


Pro 


Thr 


Phe 


Leu 


Thr 


Pro 


Pro 


Ser 










85 










90 










95 




Not 


Gin 


Arg 


Ala 


Val 


Glu 


Met 


Phe 


Asp 


Tyr 


Arg 


Ser 


Gly 


Glu 


Lys 


Tyr 








100 










105 










110 






Glu 


Trp 


Aen 


Ala 


Tyr 


Pro 


Thr 


Tyr 


Glu 


Ala 


Tyr 


He 


Ser 


Met 


Ke!. 


Glu 






115 










120 










125 








Glu 


Phe 


Gin 


Thr 


Lys 


Tyr 


Pro 


Ser 


Leu 


Cys 


Thr 


Thr 


Ser 


Val 


He 


Gly 




130 










135 










140 










Lys 


Ser 


Val 


Lys 


Asp 


Arg 


Lys 


Leu 


Met 


lie 


Cys 


Lys 


Leu 


Thr 


Ser 


Ser 


145 










150 










155 










160 


Ala 


Asn 


Thr 


Gly Lys 


Lys 


Pro 


Arg 


Val 


Leu 


Tyr 


Thr 


Ser 


Ihr 


Met 


His 










165 










nc 










175 




Gly 


Asp 


Glu 


Thr 


Thr 


Gly 


Tyr 


Val 


Val 


Leu 


Leu 


Arg 


Leu 


He 


Asp 


Hi-*! 








180 










185 










190 






I.cu 


Leu 


Ser 


Asn 


Tyr 


Glu 


Ser 


Asp 


Pro 


Arg 


He 


i.yr 


Asm 


He 


Leu 


Asp 






195 










200 










205 








Lys 


Thr 


G"i 0 


Val 


Trp 


lie 


Cys 


Pro 


Leu 


Thr 


Asn 


Pro 


Asp 


Gly Ala 


Tyr 




210 










215 










220 










Arg 


Ala 


Gly 


Acn 


His 


Thr 


Val 


Gin 


Gly 


Ala 


Thr 


Arg 


Tyr 


Asn 


Ala 


Asn 


225 










230 










235 








240 


Asn 


V*l 


Arp 


Leu 


Asn 


Arg 


Asn 


Phe 


Lys 


Asp 


Asp 


VdI 


Ala 


Gly 


Asp 


Hxs 










24b 










25U 










2 55 




Pro 


Asp Gly 


Lys 


Pro 


Trp 


Gin 


Pro 


Glu 


Ala 


Thr 


Ala 


Phe 


Het 


As>p 


IiC J 








260 










265 










270 






Glu 


Gly 


Afn 


Thr 


Ser 


Phe 


Val 


Leu 


Gly 


Ala 


Asn 


He 


His 


Gly 


Gly 


Thr 






275 










280 








285 






Glu 


Val 


Val 


Asn 


Tyr 


Pro 


Trp 


Asp 


Asn 


Lys 


Lys 


Glu 


Arg 


His 


Ala 


A^-p 




290 










295 










300 










Asp 


Glu 


Trp 


Tyr 


Lys 


Leu 


lie 


Ser 


Arg 


A*n 


Tyr 


Ala 


Ala 


Ala 


Cys 


Gin 


305 










310 










315 








320 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



366/490 



Set 


lie 


Ser 


Al a 


Ser 


Tyr 


Met 


Thr 


Ser 


Glu 


Thr 


Asn 


Ser 


Gly 


He 


He 










325 










330 










335 




Asn 


Gly 


Ser 


Asp 


Trp 


Tyr 


Val 


He 


Arg 


Gly 


Ser 


Arg 


Gin 


Asp 


Asn 


Ala 








340 










345 










350 






Asn 


Tyc 


Phe 


Hl£ 


Arg 


Leu 


Arg 


Glu 


He 


Thr 


Leu 


Glu 


He 


Ser 


Asn 


Thr 






355 










360 










365 








Lye 


Leu 


Val 


Pro 


Ala 


Ser 


Gin 


Lou 


Pro 


r^- • =- 


Tyr 


Trp 


Asn 


Leu 


Asn 


Lys 




370 










37 5 










300 








Glu 


S*r 


Leu 


I.PU 


Ala 


Leu 


He 


Glu 


Glu 


Ser 


Leu 


Tyr 


Gly 


He 


His 


Gly 












390 










395 










40C 


Thr 


Val 


Thr 


Ser 


Ala 


Aid 


Asn 


Gly 


Gin 


Pro 


Leu 


Lys 


Cys 


Gin 


He 


Levi 










405 










410 










415 




lie 


Glu 


Asn 


His 


Asp 


Lys 


Arg 


Asn 


Ser Asp 


Val 


Tyr 


Ser 


Asp 


Ala 


Thr 








420 










425 










430 






Thr 


Gly 


Tyr 


Tyr 


Val 


Arg 


Pro 


He 


Lys 


Ala 


<uy 


Thr 


Tyr 


Thr 


Val 


Lys 






4 35 










440 










445 








Tyr 


Lys 


Ala 


Glu 


Gly Tyr 


Pro 


Glu 


Ala 


Thr 


Arg 


Thr 


He 


Thr 


He 


Lys 




4$0 










4 55 










-160 








Arp 


Lys 


Glu 


Thr 


Val. 


He 


list 


Asp 


He 


Ala 


Leu 


Gly 


Asn 


Ser 


Val 


Pro 


465 










470 










475 










480 


Leu 


Pro 


Val 


Pro 


Asp 


Phe 


Thr 


Ala 


Ser 


Pro 


Met 


Thr 


He 


Ser 


val 


Gly 










485 










490 










4 95 




Glu 


Ser 


Val 


Gin 


Phe 


Gin 


Arp 


Gin 


Thr 


Thr 


Asn 


Asn 


Pro 


Thr 


A-^n 


Trp 








500 










505 










510 






Glu 


Trp 


Thr 


Phr 


Glu 


siy 


Gly 


Gin 


Pro 


Ala 


;iet 


Ser 


Thr 


Glu 


Gin 


Asn 






515 










520 










525 








Pro 


L-5U 


Val 


Ser 


Tyr 


Ser 


Hie 


Pro 


Gly 


Gin 


Tyr 


Asp 


Val 


Thr 


Lou 


Lys 




530 










535 










540 










Val 


Trp 


Asn 


Ala 


Ser 


Gly 


S^r 


Asn 


Thr 


He 


Thr 


Lys 


Glu 


Lys 


Phe 


He 


545 










550 










555 










560 


Thr 


Val 


Asn 


Ala 


Val 


Met 


Pro 


Val 


Ala 


Glu 


Phe 


Val 


Gly Thr 


Pro 


Thr 










565 










57 0 










575 




Glu 


lie 


Glu 


Glu 


Gly Gin Thr Val 


Ser 


Phe 


Gin 


Asn 


Gin 


Ser 


Thr 


Asn 








580 










585 










590 






Ala 


Thr 


Asn 


Tyr 


Val 


Trp 


Tie 


Phe 


Asp Gly 


Gly 


Thr 


Pro 


Ala 


Thr 


Ser 






59S 










600 










505 








Glu 


Asp 


Glu 


Arn 


Pro 


Thr 


Val 


Leu 


Tyr 


Ser 


Lys 


Ala 


Gly 


Gin 


Tyr Asp 




610 










615 










620 










Val 


The 


Leu 


Lys 


Ala 


He 


Ser 


Ala 


Sec 


Gly 


Glu 


Thr 


Val 


Lys 


Thr 


Lys 


625 










630 










635 








640 


Glu 


Lys 


Tyr 


Tie 


Thr 


Val 


Lys 


Lys 


Ala 


Pro 


Val 


Pro 


Ala 


Pro 


Val 


Ala 










6-15 










650 










655 




Asp 


Phe 


Glu 


Gly 


Thr 


Pro Arg 


Lys 


Val 


Lys 


Lys 


Gly 


Glu 


Thr 


Val 


Thr 








660 










665 










67 0 






Phe 


Lys 


Asp 


Leu 


Ser 


Thr 


Asn 


Asn 


Pro 


Thr 


Ser 


Trp 


Leu 


Trp 


Val 


Phe 






675 










680 










685 








Glu 


Gly 


Gly 


Ser 


Fio 


Ala 


Thr 


Ser 


Thr 


Glu 


Gin 


Asn 


Pro 


Val 


Val 


Thr 




690 










695 










700 










Tyr 


Aen 


Glu 


Thr 


Gly 


Lys 


Tyr 


Asp 


Val 


Gin 


Ticu 


Thr 


Ala 


Thr 


Asn 


Glu 


70S 










710 










715 










720 


Gly 


Gly 


Ser 


Asn 


Val 


Lys 


Lys 


Ala Glu Asp 


Tyr 


Ile 


Glu 


Val 


He 


Leu 










725 










730 










735 




Asp 


Asp 


Ser 


Val 


Glu 


Asp 


He 


Val 


Ala 


Gin 


Thr 


Gly 


He 


Val 


He 


Arg 








740 










745 










7 50 






Pro 


Gin 


Asn 


Gly Thr 


Lys 


Gin 


lie 


Leu 


lie 


Glu 


Ala 


Asn 


Ala 


Ala 


He 






755 










760 










765 








Lys 


A1 A 


He 


Val 


Leu Tyr Asp 


He 


Asn 


Gly Arg 


Val 


Val 


Leu 


Lys 


Thr 




770 










775 










700 








Thr 


Pro 


Asn 


Gin 


Leu 


Arg 


ser 


Thr 


Val 


Aep 


Leu 


ser 


He 


Leu 


Pro 


Glu 


785 










790 










7 95 










800 


Gly 


lie 


Tyr 


Thr 


He 


Asn 


He 


Lys 


Thr 


Glu 


Lys 


Ser 


Ala 


Arg 


Thr 


Glu 










COS 










eio 








815 




Lys 


Il- 


Hie 


lie 


Gly 

























820 



(2) IUfTORIIATIOlI FOR SEQ ID HO:4 27 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7€ amino acids 
(BJ TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 



SUBS T IT U T E SHEET (Rule 26) (RO/AU) 



ILJi C3> *£"t* U ii« 8 



10 



WO 99/29870 PCT/AU98/G1023 

367/490 



tvi) ORIGINAL. SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(ix> FEATURE: 

(A( HAKE/ KEY: mtscjeaturs 
(6 ) LOCATION 1 ... 76 

SEQUENCE DESCRIPTION: SEQ TD 110:4 27 



Met Arg Leu lie Lys Ala Phe Leu Val Gin Leu L^u Leu Leu Pro He 
15 10 15 

Phe Phs Tyr Lys Arg Phe Iia Ser Pro Leu Thr Pro Pro Ser Cys Arg 
70 25 30 

15 Phe Thr Pro Ser Cys Ser Sec Tyc Ala lie Glu Ala Leu Arg Lys Tyr 
35 40 45 

(11 y Pro Gly Lys Gly Leu Leu Leu Ser Tie Lys Ar<j He Leu Arg Cys 

50 55 60 

His Pro Trp Gly Gly Ssr Gly Tyr Asp Pro Val Pro 
20 65 7 0 75 

(2) IH FORI (ATI OH FOR SKQ ID UO: 429 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 859 amino acids 

(BJ TYPE: amino acid 
(D) TOPOLOGY: linear 



30 



35 



50 



55 



60 



65 



70 



75 



(ii) MOLECULE. TYPE: protein 

(ili) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

<ix> FEATURE: 

(A) 1IAI JE/KEY: misc_feature 
( B } LOCATION 1...859 



40 SEQUENCE DESCRIPTION: SEQ ID NO: 428 



45 



Hot 


Ala 


Tyr 


Asp 


Fhe 


Thr 


Gin 


Thr 


Phe 


Arg 


Asn 


Ser 


Leu 


Glu 


Tyr 


Ser 


1 








5 










10 










15 




Tyr 


Glr. 


Glu 


Ala 


Thr 


Arg 


Leu 


Gly 


Val 


Val 


Ala 


Val 


Thr 


Gin 


Asp 


Met 








20 










25 










30 






Leu 


Val 


Leu 


Gly 


He 


He 


Arg 


Asp 


Gly 


Afp 


Asn 


Gly 


Ala 


He 


Asp 


He 






35 










4C 










45 








Het 


Arg 


His 


Tyr 


Gly 


Tie 


Asn 


Leu 


Tyr 


Glu 


Leu 


Lys 


Arg 


Leu 


He 


Glu 




50 










55 










60 










Leu 


Glu 


Ala 


He 


Ala 


Glu 


Ser 


Leu 


Pro 


Ala 


Ser 


Pro 


Glu 


Gly 


Ser 


Pro 


65 










70 










75 










00 


lie 


Phe 


Thr 


Pro 


Ser 


Ala 


Arg 


Glu 


Ala 


lie 


Asp 


7\cp 


Ala 


Thr 


Asp 


He 










85 










90 










95 




Cys 


Ala 


Asp 


Met 


Glu 


Asp 


Glu 


Ala 


Val 


Ser 


Pro 


Vai 


His 


Leu 


Leu 


Leu 








100 










105 










110 






Ser 


lie 


Leu 


Asn 


Ser 


Thr 


Gin 


Glu 


Ser 


Leu 


Val 


Gin 


Lya 


He 


Fhe 


Met 






115 










120 










125 








Lys 


Gin 


Gly 


He 


Lys 


Tyr 


Asp 


Thr 


He 


Leu 


Ser 


Asp 


Tyr 


Phe 


Gly 


Gin 




130 










135 










14 0 










Arg 


Afn 


Pro 


Ser 


Glu 


Gly 


Lys 


Ser 


Pro 


Ser 


Glu 


net 


Glu 


He 


Leu 


Asp 


145 










150 










155 










160 


Gly 


Tyr 


Gin 


Asp 


Asn 


Asp 


Phe 


Asp 


Asp 


Glu 


Glu 


Asp 


Glu 


Ser 


Ser 


Pro 










165 










170 










17 b 




Pro 


Ser 


Gly 


Ann 


Ser 


Gly 


Thr 


Gly 


Gly 


Gly 


Ser 


Gly 


AJJp 


Ala 


Pro 


Glu 








180 










185 










19C 






Gin 


Asn 


Thr 


Gly 


Gly 


Gly 


Asp 


Thr 


Thr 


Thr 


Thr 


Thr 


Arg 


Ser 


Gly Gly 






155 










200 










205 








Asp 


Thr 


Pro 


Ala 


Leu 


Asp 


Thr 


Phe 


Gly 


Thr 


Asp 


He 


Thr 


Ala 


Het 


Ala 




210 










215 










22-0 










Ala 


Ala 


Gly 


Lye 


Liu 


Asp 


Pro 


Val 


Val 


Gly 


Arg 


Glu 


Gin 


Glu 


Tl e 


Glu 


225 










230 










235 










240 


Arg 


Vol 


lis 


Gin 


He 


Leu 


Ser 


Aig 


Arg 


Lys 


Lys 


A^n 


Asn 


Pro 


Val 


Leu 










245 










250 










255 




lie 


Gly 


Glu 


Pro 


Gly 


Val 




Lys 


Ser 


Ala 


Tie 


Val 


Glu 


Ciy 


Leu 


Ala 








260 










265 










?70 
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Glu 


Aj:g 


He 


Val 


Arn 


Arg 


Lys 




















lie 


Tl ? 


Ser 


Leu 


Asp 


Leu 


Ala 






290 










295 


5 


Gly 


Gin 


Phe 


Glu 


Glu 


Arg 


I.CU 




305 










310 






Asn 


Pro 


-^)n 


He 


He 


Leu 


Phe 












325 






10 


Ala 


Gly 


Ser 


Ala 


Ala 


Gly 


Ser 








340 








Ala 


Leu 


Ala 


Arg 


Gly 


Gin 


Val 








355 












Glu 


Tyr 


Ary 


Lys 


A^n 


lie 


Glu 


15 




37 0 










375 


Gin 


Lys 


Val 


Pro 


He 


Ala 


Pro 




385 










390 






Leu 


Gin 


AtJi 


He 


Lys 


GlU 


Lys 












4 05 






20 


Thr 


A.sp 


Glu 


Ala 


He 


Lys 


Ala 






420 








Set 




Arg 


Phe 


Phe 


Pro 


Asp 








435 












Gly 


Ala 


Ser 


Val 


His 


He 


Thr 


25 




450 










455 


lie 


Leu 


Glu 


Ala 


Glu 


Leu 


Ala 




4 65 










470 






Val 


Lys 


Ala 


Gin 


Asn 


Tyr 


Glu 












485 






30 


Aig 


Arg 


Thr 


Gin 


Gin 


Gin 


He 




500 










Gin 


Met 


Ser 


Lys 


His 


Arg 


Glu 








515 












Vol 


Val 


Ala 


Leu 


Met 


Thr 


Gly 


35 




530 










535 


Glu 


Gly 


Glu 


Arg 


Leu 


Axg 


Thr 




545 










550 






Val 


Gly 


Gin 


Asp 


Thr 


Ala 


He 












565 






40 


Asn 


Arg 


Leu 


Gly 


Leu 


Arg 


Asn 








580 










Phe 


Leu 


Gly 


Pro 


Thr 


Gly 


Val 








595 












Ala 


Glu 


Tyr 


Leu 


Phe 


Glu 


Asp 


45 




610 










615 


Ser 


Glu 


Tyr 


Met 


Glu 


Lys 


Phe 




625 










630 






Pro 


Gly 


Tyr 


Val 


Gly 


Tyr 


Glu 












645 






50 


Arg 


Arg 


Lyr 


Pro 


Tyr 


Ser 


Val 






660 








Ills 


Ala 


Arp 


Val 


Plie 


Asn 


Leu 








67 5 












Leu 


Thr 


Asp 


s«r 


Leu 


Gly 


Arg 


55 




690 










695 


lie 


lie 


Thr 


Ser 


Asn 


Val 


Gly 




705 










710 




Gly 


lie 


Giy 


Phe 


Arg 


Ser 


GlU 












725 






60 


Ser 


Arg 


Ser 


Val 


He 


Gin 


Lys 








740 








Phe 


Leu 


Arn 


Arg 


Leu 


Asp 


Asp 








755 












Thr 


Glu 


He 


Arg 


Arg 


Met 


Val 


65 




770 










775 


Arg 


lie 


His 


Arg 


Ala 


Gly 


Tyr 














790 






Asp 


Val 


He 


Ala 


Thr 


Lys 


Gly 












805 






70 


Leu 


Lys 


Arg 


Thr 


Leu 


Gin 


Asn 






820 










lie 


Leu 


Ser 


Gly 


Gin 


He 


Glu 








835 












Arg 


AFp 


Gly 


Glu 


He 


He 


Val 






Q50 










855 


75 
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Val 


Ser 


Arg 


He 


Leu 


Phe 


Asp 


Lys 


Arg 


2B0 










285 








Gin 


Het 


Val 


Ala 


Gl y 


Thr 


Lys 


Tyr 


Arg 










300 










Lys 


Ala 


Val 


Leu 


Asp 


Glu 


Leu 


Lys 


Lys 








315 










320 


He 


Asp 


Glu 


11 = 


His 


Thr 


He 


Val 


Gly 






330 










335 




Met 


Asp 


Thr 


Ala 


Asn 


Het 


Leu 


Lys 


Pro 




34 5 










350 






Gin 


Cyt 


He 


Gly 


Ala 


Thr 


Thr 


Leu 


Asp 


360 










365 






Lys 


Asp 


Gly 


Ala 


Leu 


Glu 


Arg 


Arg 


Phe 










380 










Ser 


Thr 


Ala 


Glu 


Glu 


Thr 


Leu 


Thr 


lie 








395 










400 


Tyr 


Glu 


Asp 


Tyr 


His 


Gly 


Val 


Arg 


Tyr 






410 










415 




Ala 


Val 


Glu 


Leu 


Thr 


Asp 


Arg 


Tyr 


Val 




425 










430 






Lys 


Ala 


He 


Asp 


Ala 


Het 


Asp 


Glu 


At a 


440 










445 








Asn 


Val 


Val 


Ala 


Fro 


Lys 


Glu 


He 


Gl u 










460 










Ser 


Val 


Arg 


Glu 


Asn 


Lys 


Leu 


Ser 


Ala 








4 75 










4 80 


Lsu 


Ala 


Ala 


Ser 


Phe 


Arg 


Asp 


Gin 


Glu 






490 










4 95 




Ala 


Glu 


Glu 


Lys 


Lys 


Lys Trp 


rau 


Glu 




505 










510 






Thr 


Val 


Asp 


Glu 


Asn 


Val 


Val 


Ala 


His 


520 










525 








Val 


Pro 


Ala 


Glu 


Aru 


Leu 


Ser 


Thr 


Gl y 










540 










Het 


Ala 


Asp 


Asp 


Leu 


Lys 


Thr 


Lys 


Val 








555 










560 


Glu 


Lys 


I let 


Val 


His 


Ala 


lie 


Gin 


Arg 






570 










57 5 




Glu 


Lye 


Lys 


Pro 


11 = 


Gly 


Ser 


Phe 


Leu 




585 










590 






Gly 


Lys 


Thr 


Tyr 


Leu 


Ala 


Lys 


Lys 


Leu 


600 










605 








GlU 


Asn 


Ala 


Met 


He 


Arg 


Val 


Asp 


Met 










620 










Ser 


Val 


Ssr 


Arg 


Leu 


Val 


Gly Ala 


Pro 








635 










640 


Glu 


Gly 


Gly 


Gin 


Leu 


Thr 


Glu 


Arg 


Val 






650 










655 




Val 


Leu 


Leu 


Asp 


Glu 


Ho 


Glu 


Lys 


Ala 




665 










67 0 






Leu 


Leu 


Gin 


Val 


Met 


Asp Glu Gly Gin 


680 










685 








Arg 


Val 


Asn 


Phe 


Lys 


Asn 


Thr 


Val 


He 










700 










Thr 


Axg 


Gin 


Leu 


Lys 


Asp 


Phe Gly Gin 








715 










720 


Lys 


Afc-p 


Glu 


Glu 


Ala 


Asn 


Lys 


Glu 


His 






730 










735 




Ala 


Leu 


Asn 


Lys 


Thr 


Phe 


Ser 


Pro 


Glu 




745 










750 






He 


He 


Leu 


Phe 


Asp 


Gin 


Leu 


Gly 


Lys 


760 










765 








Asp 


He 


Glu 


Leu 


Lys 


Ala 


Val 


Leu 


Ala 










780 










Asp 


Leu 


Val 


Leu 


Thr 


Asp 


Glu 


Ala 


Lys 








7 95 










000 


Tyr 


Asp 


Leu 


Gin 


Tyr 


Gly 


Ala 


Arg 


Fro 






810 










815 




Glu 


Val 


Glu 


A.«?p 


Arg 


Leu 


Thr 


Asp 


Leu 




825 










830 






Lys 


Gly 


Gin 


Thr 


Leu 


Thr 


Leu 


Ser 


Ala 


840 










845 








Gin 


Glu 


Gin 


Ala 
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(2) IIJFORMATIOH FOR SEQ ID MO: 129 

(i> SEQUENCE CHARACTERISTICS: 

<A} LENGTH: 417 amino acids 
(P) TYPE: ammo acid 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(nil HYPOTHETICAL: YES 

{vij ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromona s ginqivali? 

(i:-:> FEATURE: 

(A) DAHE/KEY: misc feature 

(B) LOCATION 1. . .-117 

C.i) SEQUENCE DESCRIPTION: SEQ ID 140: -129 
rier Asn Tyr Leu Tyr lie Leu Tic Thr Leu Le 



1 5 

Ser Gly Ala Gl j Tie Ala Phe Leu S 

20 



Leu Leu Ser Gly Phe Phe 
15 

er Ser Asp Lys Leu Arg Leu Glu 
25 30 



T.eu A-p Arg Am Arg Gly Asp Leu Thr Gly Arg Ala Leu Asn Leu Leu 

35 4c 45 

Tyr Arg His Pro Asp Gin Leu Val Thr Thr Leu Leu Val Gly Asn Asn 

b0 55 go 

lie Val Leu Val Vol Tyr Gly Leu Leu Het Ala Gly Leu Leu Ala Ala 
60 70 75 an 

Pro Leu Ala Gin Tr P lie Asp Asn Asp Ala Met Tie Val Val Leu Gin 

85 go g5 

Ser Val Leu Ser Thr lie He He Leu Pne Thr G1 Glu phe Leu pro 

100 105 no 

Lys Ala lie Phe Lys Thr Asn Ala Asn Met Met Met Arg Val Phe Ala 

115 120 125 

Leu Pro lie Val Ala lie Tyr Tyr Leu Leu Tyr Pro Leu Ser Lys Leu 

130 i35 140 

Phe Thr Gly Leu Ser Arg Ser Phe lie Arg Leu Val Asp Lys Asn Tyr 
145 150 155 160 

Asp His Tyr Leu 



Val Pro Thr Thr Val Gly Leu Gly Arq Val Asp Leu 



165 



170 175 



Ala blu Asn Met Ser Gly Glu Asn Glu Gin Asn Asp Leu Thr Thr Glu 
, r . t 180 Itfi 190 

Val Lys lie He Gin Asn Ala Leu Asp Phe Ser Gly He Gin Val Ara 

195 200 205 

Asp Cys net Tie Pro Arq Asn Glu Met He Ala Cys Glu Leu Gin Thr 

210 215 220 

Asp He Glu Val Leu Lys Thr Thr Phe lie Asp Thr Gly Leu Ser Lys 
230 235 240 

lie He He Tyr Arg Gin Asn He Asp Asp Val Val Gly Tyr He His 

245 250 255 

Ser Ser Glu Met Phe Arg Gly Gin Asp Trp Gin Lys Arg He Asn Thr 

260 265 270 

Thr Val Phj Val Pro Glu Ser Met Tyr Ala Asn Lys Leu Met Arg Leu 

275 280 285 

Leu Met Gin Arg Lys Lys Ser He Ala He Val Tic Asp Glu Leu Gly 

290 295 3oq 

Gly Thr Ala Gly Met Val Thr Leu Glu Asp Leu Val Glu Glu He Ph- 
310 

Cly Asp He Glu A»p Glu His Asp Thr Arg Lys He He Ala Lys Gin 

Leu Gly Pro His Thr Tyr Leu Val Ser Gly Arg Met Glu He Asp Asp 

340 345 350 

Val Asn Glu Aty ?he Gly Leu Ser Leu Pro Glu Ser Asp Asd Tyr Leu 

3j5 360 355 

Thr Val Ala Gly Phe He Leu Asn Ser His Gin Asn He Pro Gin Ala 

•* 0 375 ago 

Asn Glu Val Val Glu lie Ala Pro Tyr Thr Phe Thr He Leu Arq S-r 
I 8 " 390 395 403 

Ser Scr Thr Lys He Glu Leu Val Lys Met Ser He Asp Asp Gin s-r 

'° 5 410 «5 
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\?) 1 1 ' FORI IATI OH FOR SEQ TD NO:430 

fi) 3EQUENCF CHARACTERISTICS - 

(A) LENGTH: 293 amino acids 
(BJ TYPE; amino acid 
(O) TOPOLOGY: linear 

(iij MOLECULE TYPF: protein 

( 1 i i ) Hi POTHETI CAL : YES 

(vi) ORIGINAL. SOURCE: 

(A) ORGANISM: Porphyromonw gingivalis 
(ix) FEATURE: 

(A) MAT- IE /KEY: mi sc feature 

(BJ LOCATION 1. . .293 

ir.i) SEQUENCE DESCRIPTION: SEQ ID NO: 430 
Met Lys Gin Ann Tyr Pho Ly* Arc, Val Cy» Ser Leu Leu Trp Leu Val 
Leu Pro M,t Leu l le Met P ro Leu Glu V a °i Ala Ala Gln Qlu J*, ^ 
Pro A,n Glu Glu Val L.u clu ? er Leu Thr P h , Val Ala Pro Val Glu 



10 

25 1 30 
Leu Thr Ph* Val Ala Pro 

si- Th, Ala ile Glu G1 0 u Va) Glu Ala Leu « n aiu iie ^ 

Ala Thr Olu Glu Ile Ala Glu al „ Ala Vfll ftrg |° t Tyr ^ 

V^l T> ^n ». ~, - - - 7 ^ or, 



Val T> - r Arg A ., p „ y Val LyB ^ t ^ p ^ 

Phe Leu Asp Glu tep v ,l Pro Ala Sly OXu His Thr Tyr Cy s Val Glu 

Val Gl„ Tyr GlB Glv Gly val ^ ^ ^ 110 ^ 

120 12 e 



Val Lys Asp ehe Lys p „ Va , Thr , rhr ^ 125 ^ ^ 

Asp Glu Val Ser le u Asp Trp Asp Gly yal Clu ™ Lys flla 
Pr ° S " ^ & Ma V ^ -« *yr *.» "l 5 Tyr Ly S A sn ci y J£ 



155 

Leu Ilt c,y Asn i Br ^ Glu Thr »» ^ ^ ^ ^ 175 ^ 



*« aiv Tjr Tyr „. Tyr Glu ^ Glu Vii hys Tj . c ?ro - 

Thr Val Thr 

Arn Val Asp Gly «ln Ala ^ Tyr Thr Leu Ac 220 



Z05 
Ser 

; - s . al) „ in „^ Pro Tyr Thr Leu ^ Val Glu ^ L yff 



S " Ly * "» V3i ™f Val ™" «»1 Thr A,n ler Ser Leu Ser 



ne Ala oiu A ia 51,' GJ y H=t n . Thr f« ^ ^ ^ ^ „0 
*r 9 Thr Vai Ala Va! Ala Pr0 Asn Ar, 22 «„ Tyr M« Ala S Th I 
GlV Phe Tyr Ala Val Ar g P he Asp Val^ Gly A^n Lys Hi, ^ val s»r 

Li^ He Gin Val Aro 285 

2 90 

(2 J INFORMATION FOR SEQ ID NO: 4 31 

UJ SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 312 amino acids 
(BJ TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(lii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas ginoxvalis 
( iX ) FEATURE : 

(A) NAME/ KEY: misc^f eat ur- 
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(B ) LOCATION 1 . . . 312 



<xi» SEQUENCE DESCRIPTION: SEQ ID NO: 431 



Met 


lie 


Pro 


Leu 


Ser 


Glu 


Ser 


Phe 


1 








5 








Lys 


Thr 


He 


Asp 
20 


Ala 


Asp 


Gly 


Asp 


Asn 


Phe 


Thr 


Gly 


Gin 


Ser 


Gly 


Leu 






35 










40 


Gly 


61 y 


Val 


Gly 


Air. 


Leu 


Thr 


Pro 




50 










55 




Leu 


Lys 


Leu 


Pro 


Thr 


Asp 


Ala 


Leu 


65 










70 






Thr 


Gin 


Asp 


Leu 


Thr 


Ala 


Pro 


Ser 










85 








Ser 


Thr 


Gly 


A&n 


Asn 


Ala 


Ala 


Asp 








100 










Thr 


Leu 


Thr 


Ala 


Lys 


Arq 


He 


Gin 






115 










120 


Axg 


Thr 


Gin 


Gly 


Val 


Tip 


Tyr 


Gin 




130 










135 




Thr 


Lys 


Tyr 


Val 


Ala 


Phe 


Arq 


His 


145 










150 






Leu 


Asn 


Leu 


Asp 


Giu 


Val 


Sex 


He 










165 








Ala 


Pro 


C"2 


Pro 


His 


rro 


Gly 


Gly 








1B0 










Gly 


Gin 


Lys 


lie 


Ala 


Ser 


Gly 


Leu 






195 










200 


Asp Val 


Pro 


Tyr 


Gly 


Thr 


Gin 


Asp 




210 










215 




Gin 


Gly 


Asp 


Ser 


Tyr 


Lys 


Val 


Cys 


725 










230 






Ala 


Asn 


lie 


Tyr 


Gly 


Ala 


Asp 


Lys 










245 








Lys 


Thr 


He 


Val 


Ala 


Ser 


Ala 


Phe 








260 










He 


Axg 


Gly 


Arg 


I.eu 


He 


Ala 


Ser 






275 










280 


Ala 


Glu 


Asn 


Gly 


Phe 


Tyr 


Leu 


He 




2 90 










295 




Tyr 


Thr 


Glu 


I.ys 


He 


-31 n 


He 


Gin 


305 










310 







Glu 


Ser 


Gly 


He 


Pro 


Ala 


He 


Trp 




10 










15 




Gly 


Tyr 


Asn 


Trp 


Het 


His 


Leu 


Thr 


25 










30 






eye 


Val 


Ser 


Ser 


Ala 

45 


Ser 


Tyr 


He 


Asp 


Asn 


Tyr 


Leu 


He 


Thr 


Pro 


Glu 








60 










Val 


Glu 


1 1 e 


Tie 


Tyr 


Trp 


Val 


Cys 






75 












Glu 


His 


Tyr 


Aid 


Val 


Tyr 


Ser 


Ser 




90 










95 




Phe 


Val 


Asn 


Leu 


Leu 


Tyr 


Glu 


Glu 


105 










110 






Ser 


Pro 


Glu 


Leu 


He 


Arg 


Gly 


Ann 










125 








Arg 


Lys 


Val 


Val 


Leu 


Pro 


Asn 


Asp 








140 










Phe 


Asn 


Ser 


Thr 


A&p 


Asn 


Phe 


Trp 






155 










160 


Leu 


Tyr 


Thr 


Pro 


Leu 


Pro 


Arg 


Arg 




170 










175 




Tyr 


Thr 


Tyr 


Ser 


Val 


Phe 


Arg 


Arp 


185 










190 






Ser 


Ala 


Leu 


Ala 


Tyr 


He 


Asp 


Thr 










205 








Tyr 


Cys 


Val 


Gin 


Val 


Asn 


Tyr 


Leu 








220 










Lys 


Asn 


Tl- 


Val 


Val 


Ala 


Asn 


Ser 






235 










240 


Pro 


Phe 


Ala 


Leu 


Thr 


Val 


Val 


Gly 




250 










255 




Lys 


Gly 


Glu 


He 


Thr 


Leu 


Tyr 


ASp 


265 










270 






Gly Cys 


Asp 


Thr 


Leu 


Axg 


Tyr 


Lys 










28b 








Lys 


Ile 


Gin 


Val 


Asn 


Gly Thr 


Val 








300 











(2) INFORMATION FOR SEQ ID MO: 4 32 

(1} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 

Ui] MOLECULE TYPE: protein 

till) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Porphyromonas qingivalis 

(ix) FEATURE: 

(A) NAT IE /KEY: mlsc feature 
IB J LOCATION 1. . .843 

1;:1) SEQUENCE DESCRIPTION: SEQ ID NO: 4 32 

Met Lys Lys Sex Phe Leu Leu Ala He Val tlet Leu Phe Gly He Ala 

15 10 15 

Met. Glu Gly His Ser Ala Pro Val Thr Lys Glu Arg Ala Leu Ser Leu 

20 25 „ 30 

Ala Arg Leu Ala L&u Arg Gin Val Ser Leu Arg Met Gly Gin Thr Ala 

35 40 45 

Val Ser Asp Lys He Set He Asp Tyr Val Tyr Arg Gin Gly Asp Ala 

50 55 60 

Glu Axy Gly He Thr Ser Gin Glu Glu Gly Ser Pro Ala Tyr Phe Tyr 
65 70 75 90 
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Asn Gly Hi, Phe Ala Asn Asp Pro Met Arg Trp A*n Gin Gly Tyr Pec 
Trp Asn Asn Ly, Glu Pro Lou Leu Pro £n Gly His Ala Tyr Thr 
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Val Ala A,n Arg Gly Aen A*n Glu Gly Tyr Ala Leu Val Ala Ala Asp 

85 90 a c 

Asp Arg II, Pro Thr 11, Leu Ala Tyr Ser Pro „. G R £ 

Met ASP ^ HSt P " *«P A*n I.cu Arg Met Trp Leu Gin lie Tyr Asp 
Gin Glu lie Gl y Leu He Leu Ser Gly Lys Al. Gin Leu Asn Glu Glu 

10 J« ?ft Val ?r ° Ala G1U Val Ala Leu net A.p 

150 15S 16 q 

Tyr 
175 
Tyr 

15 Gly cys Val Ala Thr Ala Ala Ala Gin lie Met Arg Tyr H fs Ser Trp 

200 20"! 
Pro Leu Gl„ Gly Glu Gly Ser Phe Asp Tyr His Ala Gly Ser Leu Val 

20 MS A """ ^ G1V ?™ Ph " Gly G1U " at T ' r Tr P «« A.n Met 

Pro Gly Asn Pro Asp Leu Asp Asn Leu Thr Gin Ser Gin Val Asp Ala 
245 250 25% 

25 Tyr Glu Asn oly Ser Gly Thr Tyr Ser Val Tyr Val Val 11° Aia L „, 
Arg a™ Asn Phe Arg Tyr Lys Irg Sec l=u Cln I.eu Hi" Val Arg Ala 

30 30S *" G1 " " is As P M«t 11- Arg Gly Glu Leu Ala 

Ser Gly Arg Pro Val Tyr Tyr Ala Gly Asn Aen Gin Scr He Gly Ml" 
Ala Phe Val Cys Asp 01y Tyr flla ^ ^ ^ ^ 335 ^ 

35 Trp Gly Trp Gly Gly Val Ser Asn lly Phe Tyr Lys Lou Thr Leu Leu 

000 360 



Tyr Ala Thr Leu Met Arg Asp Val Ser Ala Ser Val Ser Hat Ser Phe 
2o ° 265 



ser Pro Thr S«r Leu Gly He Gly Gly Glu Gly n s g" eh e Thr He 

3/5 300 
Tyr Gin Glu He He Thr Gly He Glu Pro Ala Ly S Thr Pro Ala Glu 

Ala Gly Thr Asp Ala Leu Pro He Leu Ala lit Ly, Asp He CU„ A^a 



Glu Tyr Lys Ser Glu Ser Gly Leu Asn Val Gly Tyr Ser He Tyr Asn 



<1° 415 
Val Gly Tyr Ser He 

45 Thr Gly Glu G Vu Gln s „ Asn Leu Asp Leu Gly Tyr Arg Leu Asn Lys 
Ala Asp Gly Glu v„l He Glu Ly., Thr Ser Ser He Asn He Ser 

50 lis" T/C G1V ' Tyr sl " G ^ HlE Pro Clu s « «" A,a Pro Asn 

Gin Leu Ser Gin Gly u. Asn Thr He Thr Leu Leu Tyr Arg Arg Thr 

4 90 aqg. 
Gly Thr Glu Gin Trp Glu Pro Val Arg His Ala Gin Gly Gly Tyr Val 

55 Asn Ser lie Lys Val Asn Thr Thr A^p Pro Asn Asn Val Val Val Thr 
Val Asp Asn Asn Glu Gly Lys £2 Ser He Val Pro "n Scr Phe Val 

60 £45 ^ P L * U *"* S " *£J G1U H1S S « Thr He Thr Val Gi „ Phe Asn 
Ser Asp Ser Pre Asp Glu He Arg Thr Pro v" Ala Phe Ala Leu tlr 
Thr Gly Ala Thr Ala Asp Asp Val He J™ L eu Gly Trp Val 5" Ala 
65 Clu Val Pro C1 y Gly Ser Ser Asn Tyr Pro Val Val Trp L ? r Lys Asp 
Val Leu Thr Leu Ser Glu Gly A,p Tyr Thr Leu Trp lyr Arg Phe Ser 

70 I2I AE " AS " G1 " L/3 Glu ^ H= ""/ Ser Val Ser Val 

Lys Thr Pro Thr Glu Tyr Thr His Pro Leu Phe Glu Val Gly His Asn 

o45 gen 
Gin Thr Ser Thr Tyr Thr Leu Asp Her Ala His Asn Arg Val Leu Pro 

75 Asp Phe Thr Leu Lys Asn Leu Gly Leu Pro Phe Asn Gly "u Leu Val 
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67f) 680 



~ " " oo<j 685 

Val Val Phe Arg Gin Thr Gin Ser Ser Ser Gly Ser Leu Trp Ala Al 



Val Val Glu Gly Pro lie" Pro Asp Gly Ser T^r Arg Ala Thr Leu 

Asn Gly Gin Gin Gin Lsu 
740 745 



Ala Phe Val Ajn GVy Gin Gin Gin Lc-u lyr Leu Lys Gly Lys Irg Asn 



Gin Glu Thr Val His lie Lys Gin Gly Glu Thr Php Val Tyr Lys Pro 

Arg Ala 
Lys Gly 

Tyr Thr Val Lys 1 1 o Val Asn Gly Thr Ala Val Glu Ala He GU, Ser 

/jb 760 765 

Ser Glu Glu He Arg Val Phe Pro Asn Pro Ala Ara Asp T yr Val Glu 

775 7 fl0 
lie Ser Ala Pro Cys He Pro Gin Glu Thr Ser lie He Lou Phe Asp 
790 795 aon 

Leu Ser Gly Lys He Val Met Lys Asn Ser Leu Ser Ala Gly His Gl- 
80b 810 315 J 

Arg IJct Asp val Ser Arg Leu P ro Asn Gly Ala Tyr He L-u Lvs Val 

820 825 g 30 

Asp Gly Tyr Tht Thr Lye Tie Asn He Val lixs 
835 840 

12) INFORMATION FOR SF.Q ID NO: 433 

(1) SEQUENCL CHARACTERISTICS: 

(A) LENGTH: 2 90 amino acxos 

(B) TYFE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL; YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

fix) FEATURE: 

(A) NAME/ KEY i mire teatuie 
(B} LOCATION 1 _ .290 

(xi) SEQUENCE DESCRIPTION: SEQ TD 1 JO: 4 33 

Net Lye Lys Lou Phe Leu Ser Leu Thr Ser Leu Val Met Val Phe Ala 

Val Ala Ser Cys Asp lie He Asp L y* Asp Gin Thr Leu Leu Pro Ala 

Fro Thr Asn Val Thr Pro Asp Asn Pro Asp Asp Asn Pro ler Glu lie 

Asp He Thr Gin Thr Hi* Thr Glu Lys Tyr Val Leu Ara Glu Glu Ph- 

5)0 55 eo 

Thr Gly Gin Lys Cys Leu Asn Cys Pro Lye Gly His Arg Lye Leu Ala 

Ala Leu Lys Glu Gin Tyr Gly I.y* Axg Leu Thr Val Val Gly He Hi 
85 90 9s 



Tyr Lys Thr Trp Asp Val Pro He A) a Glu Gin Met Glu Gin Lys Ala 

-*- S0 155 i t 

Lye II, Asn He Phe Ala Val Ala Glu Tyr Thr Asp Thr Gin Lys il- 
ls- 170 17S 



Ala Gly Pro Gly Ser Leu Val Pro Pro LeU Phe Arg Thr Glu Ala Gly 

100 105 110 

Asp Ala Tyr Tyr Sex Lys Phe Ala A* n Asn Thr Pro Leu Pre Ala Leu 

1 120 12 s 

Met val Ser Arg Lys Lys Phe Gly Ser Ser Tyr Val Tvr A^p Lys S -r 
135 

Met 
Asp 

Lys Val Thr Val Lys" Gly Lys 11= Leu Glu Gly Asn Thr Leu III Lys 

1°° 185 190 * 

Ser Met Val Gin Val Tyr Leu Leu Glu Asp Lys Leu lie Ala Pro Gin 

1?5 200 20 e 

Val Asp Gly Asn Thr Thr Val Glu Asn Tyr Glu Hi R Aen Hie Val Leu 

■ £ - LU 215 °2.0 

Arg Gly Ala Val Arn Gly He Trp Gly Glu Glu Phe Val Asn Leu Ly b - 

23^ 240 
Asp Tyr Leu Tyr Thr Tyr Ala Val Glu Pro Leu* Ser Gly Met Ser Phe 

Val Ala Glu Asn l£ Ser He Val Ala Phe v.l Tyr Asp Val "n Thr 
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260 265 



Phe Glu VI Tyr Asp Va! Val His V.X Lys Tie A,„ Pro Gin Ser A.p 

Gly Ly, " ' 
290 

(2) INFORMATION FOR SEQ ID NO: 4 34 

(1) SEQUENCE CHARACTERISTICS* 

CA» LENGTH : 223 amino acids 
{B) TYPE; amino acid 
(D) TOPOLOGY: linear 

til) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(.1) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromona^ gingivals 

( 1 *c ) FEATURE : 

(A) NAME/ KEY: misc^f eature 

(B) LOCATION 1 . . .223 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 434 
Met Ly, Ly« Ser Ser Vol Val Ala Sc-r Val Leu AU Val Ala I,u Val 
Phe Ala Gly cys Gly L eu Asn Arn Met Ala Lys Gly Gly Leu iL Gly 
Ala Gly Val Gly Gly Ala He Gly aL Gly Val Gly Asn Val Ala Gly 
Arn Thr Ala Val Gly Ala lie Val Gly Thr Ala Val Gly Gly Ala Ala 
Gly Ala Leu lie Gly Lys Lys M . t Asp Ly , ty . Lys ^ ^ ^ 

Ala Ala Val Pro A=p Al 5 Thr He Gin Thr vfl Asn Asp Gly Glu IL 
lie Leu Val Thr Phe Asp Ser Gly ji e '° u p„ e H; Thr ^ j» r ^ 
Ssr 

115 120 izs 

net Asn Ly t Asn Pro Asp Thr Asp lie Arg II* Val Gly Hi.. Thr Asp 

A.,n Thr Gly Ser Asp Lys lie Asn A.sp Pro Leu Ser Glu Arg Arg Ala 

15U 



Thr Leu ser Pro Asn Ser Arg Thr Ala Leu Thr Lys Phe A^a Ala Asn 



Ala ser Val Tyr Ser Phe Leu Asn Ser «„ Gl y Val Ser „« a „ a" 
Met Ala Ala Glu Gly Arg Gly Ser His Glu Pro Val Ala A, P Aln Ser 

3 8 5 1 an 

Thr Val Ala Gly Arg Ser Ala Asn Ar, Arg Val Clu Val Tyr He Leu 

Pro Asr. Ala Lys MeL He Glu Gi°n Ala Gin Gin Gly n,l L e U Lys 

215 220 
(2) INFORMATION FOR SEQ ID NO: 4 35 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH : 337 amino acids 
(ej TYPE: ammo acid 
<D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromona c gingival!* 

i±:<) FEATURE: 

(A) NAME/KE/: mj.sc feature 

(B) LOCATION 1. . .337 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 35 
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Iter Ser Lys Lys Ser lis Leu Leu Lsu Cys Cys Ser Leu Cys Ph» II- 

1 5 10 15 

Ser Ala Thr Lys Ala Val Thr Pro Vsl Ary Asn Val Arg Asn Ssr Gin 

20 25 30 

Val Asn Ser Lys Ala Lys Thr Glu Arg Thr Lys Pro Ser Asp Ser Val 

35 40 45 

Arg Tyr Tie Ser Asn Met He Ala Asp Arg Leu Glu Phe Arg Asn Ly- 

50 55 60 

lie Ser Ser Glu Lys Glu Val Arg Lys Ala Glu Tyr Glu Asn Arg L~u 
65 70 -75 so 

Ala (1st Glu Ala Leu Asn Tyr Pro Ala He Asp Lou Tyr Gly Glu Asp 

85 90 95 

S-r Trp SeL Glu Tyr Val Asn Pro Phe Val Gly Ala Gly Thr Asp Val 

100 105 no 

Glu T1~ Pro Asn Ser Tyr Asp He Asp Cys Ser Ser Phe Val Met Pro 

115 120 125 

Val Glu Asp Lys Gin Val Thr Ser Gin Phe Gly Tyr Arg Arg Arg Ph- 

3 30 135 ijo 

Gly Arg H~t His Tyr Gly II e Asp Leu Ser Val Asn Ara Gly Asp Thr 
145 150 155 " 160 

He Arg Ala Ala Phe Asp Gly Lys Val Arg Val Ara Ser Tyr Glu Ala 

165 170 " 175 

Arg Gly Tyr Gly Tyr Tyr He Val Leu Arg His Pro Asn Gly Leu Glu 

180 ins 19p 

Thr Val Tyr Gly His Met Ser Arg Gin Leu Val Aso Glu Asn Gin H~ 

l ?s 200 " 205 

Val Arg Ala Gly Gin Pro He Gly Leu Gly Gly Ser Thr Cly Arg Ser 

210 215 220 

Thr Gly Pro His Leu His Phe Glu Thr Arq Fhe Met Gly He Pro Il» 
? 25 230 £35 240 

Asn Pro Ser Thr He He Asp Phe Asp Asn Gly Val Pro Leu Arg Asp 

245 250 2S5 

He Tyr Thr Phe Lys Arg Gly Ser Asn Ser Arg Tyr Ala Lys Ala Ser 

260 265 270 

Lys Thr Ser Ser Arg Tyr Ala Lys Lys Gly Lys Lys Gly Arg Gin Ala 

275 280 285 

Ser Ser Pro Met Thr Tyr Arg He Lys Lys Gly Asp Thr Leu Glu Thr 

290 295 300 

He Ala Lys Arg His Gly Thr Ser Val Gin Lys Leu Cy 3 Ala Thr Asn 
305 310 3 i5 320 

Gly He Gly Lys Ser Lys He Leu Thr Pro Gly Lys Ala Leu Arg 11 = 
325 330 335 

Lys 

(2> INTORIIATION FOR SEQ ID NO: 436 

(1) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH : 151 amino acids 
(B> TYFE: amino acid 
(D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

|vi) ORIGINAL SOURCE: 

<A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAME/KEY: roisc_f eaturs 

(B) LOCATION 1...151 

«Xl) SEQUENCE DESCRIPTION: SEQ JTi NO: 436 

Met Ala Lys, He Asn Ph- Tyr Ala Glu Gly Val Ser Leu Pro Arg II- 

1 5 10 15 

Arg Arg Arg He Val Gly Lys Trp He Ala Glu Val Cys Ser Arg Tyr 

20 25 30 

Gly Lys Ala Val Gly Glu He Ser Tyr Leu Phe Cys Asp Asp Glu T^r 

35 40 45 

He Leu Lys Ala Asn Gin Glu Phe Leu Asp His Asp Tyr Tyr Thr Asp 

50 55 60 

lie He Thr The Asp Ser Cys Glu Ala Asp Thr Val Asn Gly Asp Leu 
65 7C -- 



75 80 
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Lau lie S*?r Leu Asp Thr Val Arg Ser Asn Ala Arg Ala Leu Asp Leu 

85 90 9b 

Arg Tyr Glu Asp Glu Leu His Arg Val lis Tlf His Gly He Leu His 

100 105 110 

Leu Cys Gly Leu Lys Asp Lys Ser Lys Lys Asp Glu Ala Gin fist: Arg 

115 120 125 

Ala Ala Glu Glu Lys Ala Leu Val Met Leu Arg Glu Thr He Gly Ser 

130 135 140 

Glu Leu Ser Leu Leu His Thr 
145 150 



12) INFORMATION FOR SEQ ID HO: 4 37 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 91 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

til) IIOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 



tvi) OR7GINAL SOURCE: 

(A) ORGANISM: PorphyromonaE gingivalis 

lix) FEATURE: 

(A) NAME/KEY: mxscfeature 

(B) LOCATION 1. . . 391 

(si) SEQUENCE DESCRIPTION: SEQ ID NO: 437 



Met 


Lys 


Val 




Tyr 


Leu 


Met 


Leu 


1 








5 








Asn 


Ala 


Ser 


Ala 


Gin 


Glu 


Asn 


Thr 








20 










Ala 


Lys 


Asn 


Val 


Ala 


Phe 


Ala 


Arg 




35 










40 


Val 


'thr 


Leu 


Gin 


Gly 


Gly 


Val 


Ala 




50 " 










55 




Asn 


Lys 


Asp 


Leu 


Het 


Asp 


Arg 


Leu 


65 










70 






Gly Lys 


Tyr 


His 


Ser 


Pro 


Phe 


Phe 










as 








Gly 


Gin 


Ala 


Hi 3 


Thr 


Phe 


Leu 


Gly 








100 










Thr 


Asn 


Phe 


Gly 


Ala 


Ala 


His 


Phe 






115 










120 


Tyr 


Phe 


Ala 


Pro 


Tyr 


Arq 


Glu 


Asn 




130 










135 




Val 


Gly 


Val 


Gly 


Tyr 


Gin 


His 


T,ys 


145 










IbQ 






Asp Asn 


Val 


Glu 


Ser 


Leu 


Thr 


Ala 










165 








Arq 


Leu 


Gly 


Lys 


Arg 


Val 


Asp 


Phe 








ISO 










Ser 


Asn 


Leu 


Asn 


Leu 


Ser 


Arg 


Ala 






195 










200 


Phe 


Glu 


Asp 


Pro 


Ala 


Gly 


Arg 


Tyr 




210 










215 




Thr 


Ala 


Gly 


Leu 


Asn 


Phe 


Arg 


Leu 


225 










230 






Unfc 


Pro 


Met 


Asp 


Tyr 


Ala 


Leu 


lie 










2-15 








Arg 


Leu 


Arg 


Ser 


Glu 


Val 


Glu 


Glu 








260 










Pro 


Glu 


cys 


Pro 


Glu 


Val 


Thr 


Pro 






275 










280 


Thr 


Glu 


Lys 


Ala 


Val 


Leu 


Phe 


Arg 




290 










2 95 




Asp 


Gin 


Leu 


lie 


Asn 


Leu 


Tyr 


Asp 


30S 










310 






Asn 


Glu 


Pro 


He 


Thr 


Val 


Val 


Gly 










325 








Gin 


Tyr 


Asn 


Glu 


Ly* 


Leu 


Ser 


Glu 








34 0 











Thr 


Leu 


Val 


Gly Ala 


He 


Ala 


Leu 




10 










15 




Val 


Pro 


Ala 


Thr 


Gly 


Gin 


Leu 


Pro 


25 










30 






Asn 


Lys Ala 


Gly 


Ser 


Asn 


Trp 


Phe 










4,5 








Ala 


Gin 


Phe 


Leu 


Asn 


Asp 


Asn 


Asn 








60 










Gly 


Ala 


lie 


Gly 


Ser 


Leu 


Ser 


Val 






75 










80 


Ala 


Thr 


Arg 


Leu 


Gin 


He 


Asn 


Gly 




90 










95 




Lys 


Asn 


Gu- 


G) u 


Gin 


Glu 


He 


Asn 


105 










110 






Asp 


Phe 


llet 


Phe 


Asp 


Val 


Val 


Asn 










125 








Ary 


Phe 


Phe 


Hir 




He 


Pro 


Trp 








1 10 










Phe 


He 


Gly 


Ser 


Glu 


Trp 


Ser 


Lys 






155 










160 


Asn 


Val 


Gly 


Val 


Met 


Met 


Ala 


Phe 




17 0 








175 




Val 


He 


Glu 


Ala 


Gin 


Ala 


Ala 


His 












190 






Tyr 


Asn 


Ala 


Lys 


L.ys 


Thr 


Pro 


Val 










2 05 








Tyr 


Asn 


Gly 


Phe 


Gin 


Gly Het 


Ala 








220 










Gly 


Ala 


Val 


Gly 


Phe 


Asn 


Ala 


He 






235 










240 


Asn 


Asp Leu 


Asn 


Gly 


Gin 


He 


Asn 




250 










255 




Leu 


Ser 


Lys 


Arg 


Pro 


Val 


Ser 


Cys 


265 










27 0 






Val 


Thr 


Lys 


Thr 


Glu 


Asn 


He 


Leu 










285 








Phe 


Asp 


Ser 


His 


Val 


Val 


Asp 


Lys 








30a 










Val 


Ala 


Gin 


Phe 


Val 


Lys 


Glu 


Thr 






315 










320 


Tyr 


Ala 


Asp 


Pro 


Thr 


Gly Asn 


Thr 




330 










335 




Arg 


Arg 


At a 


Lys 


Ala 


Val 


Val 


Asp 


345 










350 
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Val Leu Thr Gly Lyp Tyr Gly Val Pro Ser Glu Leu Il e Scr Val Glu 

360 36h 
Trp Ly- Gly Asp Ser Thr Gin Pro Phe Ser Lys Lys Ala Trp Asn Arg 

370 37b 3Q0 

Val Val He Val Arg Scr Lys 
385 390 

(2) INFORMATION FOR SEQ ID NO:4J8 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 305 amino acids 
tB) TYPE: amino acid 
iD) TOPOLOGY: linear 

(ixl MOLECULF. TYPE; protein 

(iia) HYPOTHETICAL: YES 

<V1 ) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

U::) FEATURE: 

(A) NAME/ KEY: miscfeature 

(B) LOCATION 1...305 

{xi> SEQUENCE DESCRIPTION: SEQ ID MO: 4 38 

Met Thr Tyr Arg He Met Lys Ala Lys Ser Leu Leu Leu Ala Leu Ala 

15 10 15 

Gly Leu Ala Cys Thr Phe Ser Ala Thr Ala Gin Glu Ala Thr Thr Gin 

20 25 30 

Asn Lys Ala Gly Met. His Thr Ala Phe Gin Arg Asp Lys Ala Ser Asp 

3- 40 45 

His Trp Phe He Asp He Ala Gly Gly Ala Gly Ilet Ala Leu Ser Gly 

SO 55 60 

Trp Asn Asn Asp Val Asp Phe Val Asp Arg Leu Ser He Val Pro Thr 
65 * 70 75 80 

Phe Gly He Gly Lys Trp His Glu Pro Tyr Phe Gly Thr Arg Leu Gin 

85 g 0 q 5 

Phe Thr Gly Phe Asp He Tyr Gly Phe Pro Gin Gly Scr Lys Glu Arq 

100 105 no 

Asn His Asn Tyr Phe Gly Asn Ala His Leu Asp Phe Met Phe Asp L-u 

115 120 125 

Thr Asn Tyr Phe Gly Val Tyr Arg Pro Asn Arg Val Phe His He H« 

130 135 ld0 

Pro Trp Ala Gly He Gly Phe Gly Tyr Lys Phe His Ser Glu Asn Ala 
* 45 , ISO lfj5 16 n 

Asn Gly Glu Lys Val Gly Ser Lys Asp Asp Met Thr Gly Thr Val Asn 

165 170 175 

Val Gly Leu Met Leu Lys Phc Arg Leu Ser Ara Val Val Asp Phe Asn 

180 1 85 " i*c 

He Glu Gly Gin Ala Phe Ala Gly Lys Met Asn Phe He Gly Thr Ly« 

195 200 205 

Arg Gly Lys Ala Asp Phe Pro Val Met Ala Thr Ala Gly Leu Thr Phe 

210 215 220 

Asn Leu Gly Lys Thr Glu Trp Thr Glu He Val Pro Met Asp Tyr Ala 
225 230 235 240 

Leu Val Asn Asp Leu Asn Asn Gin He Asn Ser Leu Arg Gly Gin Val 

245 250 255 

Glu Glu Leu Ser Arg Arg Pro Val s-r Cys Pro Glu Cys Pro Glu Pro 

260 265 27n 

Thr Gin Pro Thr Val Thr Arg Val Val Val Asp Asn Val Val Tyr *h- 
2,b 280 285 

Ar9 Hi ASn Ser Ala Ly * Ile Ar 9 A£n Sin Glu He Asn Val Tyr 

290 295 300 

Asn Thr Ala Glu Tyr Ala Lys Thr Asn Asn Ala Pro He Lys Val Val 
05 110 315 -a-o 

Gly Tyr Ala Asp Glu Lys Thr Gly Thr Ala Ala Tyr Asn Met Lys Leu 

325 330 335 

Ser Glu Arg Arg Ala Lys Ala Val Ala Lys MeL Leu Glu L"* Tyr Gl - 

340 345 350 

Val Ser Ala Asp Arg He Thr lie Glu Trp Lys Gly Ser Ser Glu Gin 

355 360 365 

Ile Tyr Glu Glu Asn Ala Trp Asn Arg He Val Val Met Thr Ala Ala 
310 375 380 
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Glu 

385 



(Z) INFORMATION FOR SEQ ID HO* 4 3° 

(1> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 
IB ) TYPE: amino acid 
<D) TOPOLOGY: linear 

UU MOLECULE TYPE: protein 

(Hi) HY POTHET I CAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAN 1 SI I : p or phyromonas gingival!- 

(ix) FEATURE: 

(A) WANE/ KEY: misc feature 
{Q> LOCATION 1...190 

(xi) SEQUENCE DESCRTPTIOM: SEQ ID NO: 4 39 

Ilet Glu Fhe Phe Net Leu Phe lie Ala AJ a Val Fh- Val Asn Asn Val 

5 10 15 

Val Leu Ser Gin Phe Leu Gly He cys Pro Phe Leu Gly Val s-c L\- 

20 25 30 

Lys Val Asp Thr Ser He Gly Het Gly Ala Ala Val Thr Phe Val Leu 

Ala Leu Ala Thr Leu Val Thr Phe Leu lie Gin L' s Phe Val Leu Asp 

Arg Fhe Gly Leu Gly Phe Met Gin Thr He Ala Phe He Leu Val He 

Ala Ala Leu Val Gin Met Val Glu He lie Leu Lys Lys Val Ser Pro 

85 q 0 0 ci 

Pro Leu Tyr Gin Ala Leu Gly Val Phe Leu Pro Leu He Thr Thr Asn 

100 105 no 

Cys C iE Val Leu Gly Val Ala He Leu Val He Gin Lys Asp Tyr Thr 

115 120 125 

Leu Leu Gin Ser Phe Val Tyr Ala He Ser Thr Ala He Gly Phe Thr 

130 135 no 

Leu Ala Met Val Thr Phe Ala Gly He Arg Glu Gin Leu Asp Met Thr 



A*n Leu Pro Ly S Ala Mei Lys Gly Tie Pro Ser Ala Leu Leu Ala Ara 

170 175 
G1-, I_e Leu Ala Ilet Ala Phe Met Gly Phe Ser Gly He Ala 



180 



185 190 



(2) INFORMATION FOR SEQ ID NO: 4 -10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 
IB) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(VI) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonns glngivalls 

< i x ) FEATURE : 

(AJ NAME /KEY: miac_f ea tur e 
(BJ LOCATION 1. . . 186 

(>:1 ) SEQUENCE DESCRIPTION: SEQ ID NO: 4 4 6 

Met Leu Phe He Ala A!a Val Phe Val Asn Asn Val Val Leu Ser Gin 

Phe Leu Gly He Cys Pro Phe Leu Gly Val Scr Ly? Lys Val Asp Thr 

Ser He Gly Met Gly Ala Ala Val Thr Phe Val Leu Ala Leu Ala Thr 

Leu Val Thr Phe Leu lie Gin Lys Phe Val Leu Asp Arg Phe Gly Leu 
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SO 55 60 



Gly 


Phe 


Met 


Gin 


Thr 


He 


Ala 


Phe 


Ile 


Leu 


Val 


Tie 


Ala 


Ala 


Leu 


Val 


65 










70 










75 










80 


Gin 


MeL 


Val 


Glu 


He 


Ho. 


Leu 


Lys 


Lys 


Val 


Ser 


Pro 


Pro 


Leu 


Tyr 


Gin 










85 










90 










95 




Ala 


Leu 


Gly 


Val 


Phe 


Leu 


Pro 


Leu 


He 


Thr 


Thr 


Asn 


Cy3 


Cys 


Val 


Leu 








100 










105 










110 






Gly 


Val 


Ala 


He 


Leu 


Val 


He 


Gin 


Lys 


Asp 


Tyr 


Thr 


Leu 


Leu 


Gin 


Ser 




115 










120 










12S 








Phe 


Val 


Tyr 


Ala 


He 


Ser 


Thr 


Ala 


He 


Gly 


Phe 


Thr 


Leu 


Ala 


Met 


Val 




130 










135 










140 










Thr 


Phe 


Al~ 


Gly 


He 


Arg 


Glu 


Gin 


Leu 


Asp 


(let 


Thr 


Asn 


Leu 


Pro 


Lys 


145 










ISO 










155 










160 


Al D 


net. 


Lys 


Gly 


He 


Pro 


Ser 


Ala 


Leu 


Leu 


Alo 


Ala 


Gly 


He 


Leu 


Ala 










165 










17 0 










175 




Uet 


Ala 


Phe 


Uet 


Gly 


Phe 


Ser 


Gly Il- 


Ala 















180 185 



(2) ItlFORI IATION FOR SEQ ID NO: 441 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 amino acid- 
IB) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui} MOLECULE TYPE: protein 

liii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphy tomonas gingival la 

Ux} FEATURE: 

(A) IIAME/KEY: mis? cofeature 
( D > LOCATION 1 . . . 833 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 41 



Met 


Lys 


Gin 


Leu 


Asn 


He 


He 


Ser 


Phe 


He 


He 


Ala 


Phe 


Leu 


Phe 


Leu 


1 








5 










10 










15 




Gly Thr 


Ser 


Ala 


Ser 


Ala 


Gin 


Gin 


Ser 


Gly 


Gly 


Ser 


Val 


Thr 


Gly Thr 








20 










25 










30 






Val 


Val 


Asp 


Lya 


Ser 


Ser 


Lys 


Glu 


Pro 


He 


Ala 


Tyr 


Val 


G] n 


Val 


Phe 






35 










40 










45 








Val 


Lys 


Gly 


Thr 


Thr 


Leu 


Gly 


Thr 


Ser 


Thr 


Asp 


Ala 


Asn 


Gly Asn 


Tyr 




50 










55 










60 










Ser 


He 


Lys 


Gly 


He 


Pro 


Ser 


Gly 


Asn 


Gin 


Thr 


He 


Val 


Ala 


Arg 


Leu 


65 










7o 










75 










80 


Met 


Gly 


Tyr 


Ser 


Thr 


Cys 


Glu 


Glu 


Lys 


Val 


His 


He 


Glu 


Lys 


Gly Gly 










85 










90 










95 




Ser 


Arg 


His 


Val 


Asp 


Leu 


Tyr 


Leu 


Thr 


Glu 


Glu 


He 


Leu 


Ser 


Leu 


Asp 






100 










105 










130 






Gly Val 


Val 


Val 


Ser 


Ala 


Asn 


Arg 


Asn 


Glu 


Thr 


Phe 


Arg Arg 


Gin 


Ala 






115 










120 










125 








Pro 


Ser 


Leu 


Val 


Thr 


Val 


Leu 


Ser 


Pro 


Glu 


Leu 


Phe 


Leu 


Lys 


Thr 


Asn 




130 










135 










140 










Ser 


Thr 


Asn 


Leu 


Ser 


Gin 


Gly 


Leu 


Lys 


Phe 


Gin 


Pro 


Gly Leu 


Arg 


Val 


145 










150 










15b 










160 


Glu 


Asp 


Asn 


Cys 


Gin 


Asn 


Cys 


Gly 


Phe 


Asn 


Gin 


Val 


Arg 


He Asn Gly 










165 










170 










175 




Leu 


Glu 


Gly 


Ala 


Tyr 


Ser 


Gin 


He 


Leu 


He 


Asp 


Ser 


His 


Pro 


He 


Phe 








180 










185 










190 






Ser 


Ser 


Leu 


Ala 


Gly 


Val 


Tyr 


Gly 


Leu 


Glu 


Gin 


Met 


Pro 


Ala 


Asn 


Met 






195 










200 










205 








He 


Gl\i 


Arg 


val 


Glu 


Val 


He 


Arg 


Gly 


Gly 


Gly 


Ser 


Ala 


Leu 


Phe 


Gly 




210 










215 










220 










Ser 


Asn 


Ala 


Val 


Gly 


Gly 


Val 


lie 


Asn 


Val 


Ile 


Thr 


Lys 


Glu 


Pro 


Leu 


225 










230 










235 










240 


Arg 


Asn 


Scr 


Ala 


Glu 


He 


Ser 


His 


Ser 


Thr 


Uet 


Thr. 


_Phe 


Asp 


His 


Ala 










245 










250 










255 




Lys 


Gly 


Trp 


Gly 


Ser 


Phe 


Gin 


Asn 


Thr 


Thr 


Gin 


Phe 


Asn Gly 


Ser 


Met 








260 










265 










270 






Lou 


Thr 


Glu 


Asp 


Arg 


Lys 


Ala 


Gly 


Val 


Met 


Val 


Phe 


Gly Gin 


His 


Asn 






275 










280 










285 








Tyr 


Arg 


Pro, 


»Gly 


Gin 


Asp 


Tie 


Asp 


Gly 


As?p 


A^n 


Phe 


Thr 


Glu 


Leu 


Pro 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



380/ 490 





290 










2*5 










300 










Asn 


Leu 


Arg 


Asn 


Arg 


Ser 


Leu 


Gly 


Phe 


Arg 


Ser 


Tyr 


Tyr 


Lys 


Thr 


Gly 


305 










310 










315 










320 


Leu 


T>r 


Ssr 


Lys 


Ala 


Thr 


Leu 


Glu 


Tyr 


His 


Ser 


Het 


Gin 


Glu 


Tyr 


Arg 










325 










330 










335 




Arg 


Gly 


Gly 


Asp 


Arg 


Leu 


Asp 


Asn 


Pro 


Pro 


Phe 


Glu 


Ala 


Gin 


He 


Al a 








340 










345 










350 






Glu 


Tyr 


Leu 


G)n 


His 


Tyr 


He 


Asn 


Gl/ 


Gly 


Ser 


Phe 


Lys 


Phe 


Afp 


Gin 






356 










360 










365 








Gly 


Phe 


Ser 


Gly 


Gly 


Lys 


Asp 


Phe- 


Phe 


Ser 


T,eu 


Tyr 


Ala 


Ser 


Ala 


Gin 




37 0 










37 5 










380 










Asp 


Val 


Gin 


Arg 


Arg 


S~r 


Tyr 


Tyr 


Gly 


Gly 


Gly Acp 


Tyr 


Thr 


Glu 


Asn 


385 










390 










395 










40O 


Leu 


Leu 


Am 




Ala 


Val 


Gin 


Ser 


Gly 


Sor 


Thr 


Glu 


Ser 


Asp 


Glu 


Tyr 










4Q5 










41C 










415 




Am 


Asp 


Ala 


Phe 


Thr 


Ala 


Leu 


Thr 


Ser 


Tyr 


Gly 


Thr 


Thr 


Lys 


Gly 


Phe 








420 










42S 








430 






Asp 


Leu 


Gin 


Gly Gly 


Gl> 


Met 


Tyr 


Arg 


His 


Thr 


Phe 


Gly 


Glu 


Asn 


Trp 






435 










440 










445 








Asp 


Phe 


Thr 


Gly 


illy 


Leu 


Glu 


Tyr 


lie 


Tyr 


Gly 


Gin 


Leu 


Asp Asp Arg 




4 50 










455 










460 










Ser 


Gly 


Tyr 


Arg 


Pro 


Set 


Lys 


He 


Asp 


Gin 


Am 


Thr 


Ser 


Thr 


Phe 


Ser 


4 AS 










470 










4~5 










430 


Gin 


Tyr 


Arp 


Gin 


Leu 


Glu 


Tyr 


Lys 


Thr. 


Glu 


Lys 


Leu 


Ser 


Ala 


Leu 


He 










4 85 










4 90 










4 95 




Gly 


Ala 


Arg 


He 


Asp 


Tyr 


Val 


Leu 


Lsu 


Asn 


Gin 


Aso 


Gly 


Lys 


Arg 


Tyr 








500 










505 










510 






T1 p 


A.«p 


Pro 


Leu 


Phe 


He 


Phe 


Ser 


Pro 


Arg 


Ala 


Asn 


Val 


Arg 


Tyr 


Asn 






515 










520 










S25 






Pio 


A&n 


Lye 


Asn 


Leu 


Scr 


Phe 


Axq 


Leu 


Ser 


* j 


Ser 


Glu 


Gly 


Phe 


Arg 




530 










535 










54 0 










Ala 


Pro 


Gin 


Tyr 


Che 


Asp 


Glu 


Asp 


Leu 


His 


Val 


Glu 


Leu 


Ala 


Gly Gly 


545 










£50 










555 










560 


Thr 


Pro 


He 


Ser 


Arg 


Val 


Lou 


Ser 


Pro 


Asn 


Leu 


Lys 


Glu 


Glu 


Arg 


Ser 










565 










570 










575 




Arg; 


Ser 


Tie 


Ser 


Ala 


Ser 


Phe 


Asp 


Tyr 


Tyr 


His 


Arg 


Ala 


Asp 


Glu 


Trp 








580 










585 










590 






Gin 


Phe 


Attn 


He 


Ilet 


Gly 


Glu 


Ala 


Phe 


Ser 


Thr 


Phe 


He 


Ser 


Asn 


Gin 






595 










600 










605 








Phe 


Lys 


Pro 


Ser 


Asp 


Lys 


Val 


Glu 


Thr 


Thr 


Ser 


Asp 


Gly Lys 


Glu 


Trp 




610 










615 










620 










He 


He 


Arg 


Thr 


He 


Tyr 


Asn 


Asp 


I.ya 


Asp 


Gly 


Val 


Ser 


Lys 


Val 


Tyr 


625 










630 










635 










640 


Gly 


Val 


Am 


L~u 


Glu 


Gly 


Arg 


He 


Ala 


Tyr 


Asn 


Lys 


Ser 


Phe 


Asp 


Leu 










645 










650 










655 




Gln 


Leu 


Gly 


Gly 


Thr 


Trp 


Gin 


Arg 


Ser 


Arg 


Tyr 


Gly 


ser 


He 


Tyr 


Thr 








660 










665 










67 0 






Ala 


Val 


Glu 


Ala 


Asp 


Lys 


Thr 


Thr 


Gly 


Gin 


Ale 


Glu 


He 


Ser 


Val 


Lyc 






67 5 










680 










6S5 








Asp 


Tyr 


Val 


Arg 


Thr 


Pro 


Asn 


Leu 


Tyr 


Gly 


Tyr 


The 


Val 


Ala 


Thr 


Val 




6 90 










695 










700 










Arg 


Pro 


Thr 


Glu 


His 


Phe 


Ala 


He 


Asn 


Leu 


Ser 


Gly 


Thr 


Fhe 


Thr Gly 


705 










710 










71S 










720 


Lys 


liet 


Asp 


Val 


Val 


His 


Glu 


Ala 


Tyr 


Glu 


Giy 


Asp 


He 


Pro 


Al<* 


Glu 










725 










730 










73S 




His 


lie 


Ala 


Pro Asp 


Gly 


See 


Phe 


Asp 


Phe 


Glu 


Met 


Asn 


Gly 


Gin 


Gin 








740 










745 










750 






Pfcc 


Lys 


Gly 


Leu 


Ala 


Glu 


Gly 


His 


Ala 


Lys 


Leu 


Val 


Lys 


Thr 


Pro 


Ala 






755 










760 










7 65 








Phe 


Ala 


Asp 


He Asp 


Leu 


Lye 


Leu 


Ser 


Hi* 


Asp 


Phe 


His 


Leu 


Ala 


Ser 




770 










775 








780 










Thr 


Het 


Thr 


Leu 


Glu 


Leu 


Asn 


Ala 


Gly 


He 


Gin 


Asn 


lie 


Phe 


Asn 


Sor 


785 










790 








7 95 










800 


Tyr 


Gin 


Lys 


Asp 


Thr 


Acp 


Lys 


Gly 


Pro 


Cly 


Arg 


Ala 


ser 


Thr 


Tyr 


Val 










905 










810 










815 




Tyr 


Gly 


Pro 


Hef 


Gin 


Pro 


Arg 


Arg 


He 


Phe 


Val 


Gly Thr 


Lys 


He 


Asn 








820 










925 










8J0 






Phe 

































(2) II I FORI IAT J ON FOR SEQ ID NO: 4 42 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 091 amino acids 

(B) TYPE: amino acid 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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381/490 

{ D} TOPOLOGY: linear 

(ii> MOLECULhl TYPE: piotein 

liii) HYPOTHETICAL: YES 

(vl ) ORIGINAL SOURCE: 

(A) ORGAN I SI I: Porphyrons nas gingival is 

(ix) FEATURE: 

(A) MAI IE/ KEY: mlsc_featute 
(3) LOCATION 1 . . .891 

(xi> SEQUENCE DE3CRIPTIOH: SEQ 10 HO:442 

Met Tyr Lys Lys lie He Ala Val Ala Ala Leu Phe Cyt> Ala Ser II- 
1 _ 5 10 15 

Gly xle Leu Lys Gly Gin Ser Ser Asp Leu Thr Pro Gin Asp Th- 11- 
20 25 30 * " 

Tyr Ser Pro Glu Ho Ser Tyr Ala Lys- Pro He His Lye Thr He Al ^ 

35 40 4t, 

Ser He Glu lit Glu Gly Met Arg Ser Phe Asp Asp Fhe Val Leu Arg 

50 55 6C 

A.«r> Leu Ser Gly Leu Ala Val Gly Asp Glu Val Leu He Pro Gly A-p 
65 70 75 eo 

Ala Ilet S 61 Ala Ala Val Asn Arg II- Met Arg Gin Gly Tyr Fhe S-r 

85 90 a- 

Asn Val Arg He He Ala Asp Lys T>r Val Gly Asn T.ys Val Tyr Leu 

100 105 no 

Lys He He Val Thr Glu Arg Pro Arg He Ser Lys Val Thr Phe S-r 

115 120 125 

Gly Val Lys l.yr Ser Glu Arg Glu Asp Leu Glu Het Lys He Gly Leu 

130 135 14 y 

Arg Glu Gly He Gin Met Thr Arg Asn Asn Glu Asp Lys Val Arg Gin 



145 150 



155 160 



He Val Gin Lys Tyr Phe Ser Glu Lys Gly Tyr Arg Asp Ala Ser II- 

165 170 i7s 

Arg He Thr Gin Glu Pro Asp Leu Ser Lys Asp Gly fc»he Val Asn Val 

100 185 190 

Leu He Ser lie Glu Lys Lys Ser Lys Thr Lys Val Asn Glu He Tyr 

155 200 205 

Phe Ser Gly Asn Lys Ala Leu Ser Asn His Lys Leu Arg Met Ala M-t 

210 215 220 

Lys Asn Thr Asn Ala Lys Phe Ser Leu Arg Lys H1f He Arg Ser S~r 
225 230 235 240 

Phe Leu Lys Leu Phe See Thr His Lys Phe Val Glu Glu Ser Tyr Arg 

245 250 255 

Glu Asp Leu Val Arq Leu He Glu Lys Tyr Gin Glu Tyr Gly Tyr Arq 

260 265 270 

Asp Ala Glu He Leu Thr Asp £cr Val Val Lys Ala Pro Asp Gly Lvs 

275 280 285 

Arg Val Asp He Tyr Leu Asn He Glu Glu Gly Gin Lys Tyr Tyr II- 

290 2?5 3 oo 

Lys Asp Val Asn Phe V*l Gly Asn Ser Gin Tyr Pro Ser Glu T"r L*u 
305 310 31b 

Glu Arg Val Leu Gly He Lys Ser Gly Asp Val Tyr Asn Gin Arg Arg 

325 330 33 ! 

Leu Ala Lys Arg Leu Asn Glu Asp Glu Asp Ala Val Gly Asn T.eu Tyr 

340 345 350 

Tyr Asn Asn Gly Tyr He Phe Ala Trp Val Asp Pro Val Glu Thr Asn 

355 36O 3 6 5 

Val Val Gly Asp Ser Val Ser Leu Asp He Arg He Ala Glu Gly Lys 

370 375 380 

G)n Ala Asn He Asn Lis Val He He Lys Gly Asn Thr Val Val Tyr 
390 395 400 

Glu Asp Val Val Arg Arg Glu Leu Tyr Thr Lys Pro Gly Gin Leu Phe 

405 410 415 

Ser Arq Glu Asp He He Asn Ser He Arg Leu He Asn Gin L~u G1-" 

420 425 , 43 n 

His Phe Asp Ma Glu Lys Ser He Pro Aiy Pro He Pro A«>n Pro Glu 

■135 440 445 

Thr Gly Thr Vai Asp He Glu Tyr Asp Leu Val Pro Arg Ser Ser Asp 

450 455 4 C o 

Gin Leu Glu Leu Ser Val Gly Trp Ser Gin Ser Gly Leu Leu Ph~ Arg 
465 470 475 " 48O 
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Gly 


Ala 


lie 


Lys 


Phe 


Thr 


Aen 


Phe 


Ser 


Val 


Gly 


Asn 


Leu 


Leu 


His 


Pro 










485 










4 90 










4 95 




Ser 


Met 


Tyr 


Lys 


Lys 


Gly 


He 


lie 


Pro 


Gin 


Gly 


Asp 


Gly 


Gin 


Thr 


Leu 








500 










505 










510 






Ser 


Leu 


Ser 


Ala 


Gin 


Thr 


Asn 


Gly 


Lys 


Tyr 


Tyr 


Gin 


Gin 


Tyr 


Ser 


Val 






515 










52U 








52 5 






Thr 


Phe 


Met 


Asp 


Pro 


Trp 


Phe 


Gly 


Gly 


Lys 


Arg 


Pro 


Asp 


Met 


Phe 


Ser 




530 










535 










540 










The 


Ser 


Ala 


Phs 


Tyr 


Ser 


Lys 


Thr 


The 


Ala 


He 


Asp 


Ser 


Lys 


Phe 


Tyr 


545 










550 










555 










560 


Asn 


Ser 


Asn 


Ala 


Gly 


Asn 


Tyr 


Tyr 


Asn 


Ala 


Tyr 


Tyr 


Asn 


Ser 


Tyr 


Tyr 










56b 










57 0 










575 


Asn 


Asn 


Tyr 


Asn 


Ser 


Tyr 


Tyr 


Asn 


Gly 


Met 


Ser 


Asn 


Tyr 


Thr 


Gly Asp 








500 










505 








590 






Leu 


Tyr 


Thr 


Gin 


Ala 


Ser 


A^p 


Pro 


Asp 


Arj 


Ser 


Leu 


Gin 


Met 


Leu 


Gly 






595 










600 










60S 






Thr 


Ser 


lie 


Gly Tyr Gly Lys 


Arg 


Leu 


Thr 


Trp 


Pro 


Asp 


Asn 


Trp 


Phe 




610 










615 










620 










Gin 


lie 


Tyr 


Thr 


Ser 


Leu 


Asn 


Tyr 


Thr 


Tyr 


Tyr 


Arg 


Leu 


Arg Asn 


Trp 


625 










630 










635 










610 


Ser 


Tyr 


Asn 


Thr 


Phe 


Gin 


Arn 


Phe 


HI s 


HlF 


Gly 


Ser 


Ala 


Asn 


Asp 


Leu 










645 










650 










655 




Asn 


Leu 


Glu 


Leu 


Arq 


Leu 


ser 


Arg 


Thr 


Sei 


He 


Asp 


Asn 


Pro 


lie 


Tyr 








660 










665 










670 






Thr 


Arg 


Ser 


Gly 


Ser 


Asp 


Phe 


Het 


Vsl 


Ser 


Val 


Ala 


Ala 


Thr 


Leu 


Pro 






675 










680 










685 








Tyr 


Scr 


Leu 


Trp 


Asp 


Asn 


His 


Asp 


Tyr 


Al« 


Ser 


Gin 


Asn 


Leu 


Ser 


Val 




690 










695 










700 










Ser 


Asp 


Arg 


Tyr 


Arg 


Phe 


He 


Glu 


Tyr 


His 


Lys 


Trp 


Lys 


Phe 


Arg 


Gly 


7 05 










710 










715 










720 


Arg 


Val 


Phe 


Thr 


Pro 


Leu 


Leu 


Asn 


Pro 


Ala 


Thr 


His 


Lys 


Tyr 


The 


Pro 










725 










7 30 










735 




Val 


Leu 


Met 


Ser 


Arg 


Val 


Glu 


Gly Ala Val 


Leu 


Gly 


Ser 


Tyr 


Asn 


Ser 








740 










7-15 










750 






Asn 


Lys 


Lys 


S<=r 


Pro 


Phe 


Gly Thr 


Phe 


Tyr 


net 


Gly Gly 


Asp 


Gly Met 






755 










760 










765 








Ser 


Ser 


Tyr 


Tyr 


Gly 


Gly 


Tyr 


Met 


Asn 


Glu 


Thr 


He 


Gly 


Leu 


Arg 


Gli- 




770 










775 










7B0 










Tyr 


Lys 


Asn 


Gly 


Ser 


lie 


Ala 


Gly 


Asn 


Asn 


Tyr 


Asp 


Tyr Ala 


Tyr 


Ala 


785 










790 










795 










800 


T>T 


I let 


Arg 


Leu 


Thr 


Met 


Glu 


Leu 


Arg 


Phe 


Pro 


He 


Leu 


Phe 


Glu 


Asn 










805 










81 0 










815 




Ser 


Phe 


Asn 


Ala 


Trp 


Leu 


Leu 


Ala 


Phe 


Ala 


Glu 


Ala 


Gly 


Asn 


Ala 


Trp 








820 










825 










830 






Arg 


Ser 


lie 


Asp 


Asn 


Tyr 


Asn 


Pro 


Phe 


Asn 


Leu 


Lys 


Arg 


Ser 


Ala 


Gly 






835 










840 










845 






Val 


Gly 


Leu 


Arg 


Val 


Thr 


Leu 


Pro 


Met 


Val 


Gly 


Met 


Leu 


Gly 


He 


ASp 




eso 










855 










860 










Trp 


Gly 


Tyr 


Gly 


Phe 


Asp 


Arg 


Pro 


Asp 


Asn 


Ser 


Leu 


Gin 


Arg 


Gly 


Gly 


865 










870 










875 










BOO 


Ser 


Asn 


Val 


His 


Fhe 


Val 


Leu 


Gly Gin 


Glu 


Phe 













885 8 90 



(2) INFORMATION FOR SEO ID NO: 443 

(1) SEQUENCE CHARACTERISTICS: 

(A J LEI1GTH: 17 4 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

til) MOLECULE TYPE : protein 

{ 1 J 1 > HYPOTHETICAL; YES 

Ivi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

I A) NAME/ KEY : rtusc feature 

(B) LOCATION 1, . . 174 

{xil SEQUENCE DESCRIPTION : SEQ ID DO: 443 

Met Asn Gly Asp Uet Lye Arq Phe Leu He Leu He Gly Phe Ala Leu 
3 5 1C 15 
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Ala Val Ala Phe Ser Gly Phe Ser Gin Lys Phe Ala Leu Val Asp Met 

20 25 30 

olu Tyr He Leu Arg Asn He Pro Asp Tyr Glu HeL Met Asn Glu Gin 

1S 4 0 45 

Leu Glu Gin Val Scr Lys T.y? Trp Gin Asn Glu He Glu Ala Leu Glu 

50 55 60 

Asn Glu Ala Gin Ser Met Tyr Lys Lye Tyr Gin Ser Asp Leu Val Phe 
65 70 75 80 

Leu Ser Ala Ala Gin Lys Lys Thr Gin Glu Glu Ala lie Val Lys Lys 

85 an 55 

Glu Gin Gin Ala Ser Glu Leu Lys Arg Lys Tyr Phe Gly Pro Glu Gly 

105 HO 
Glu Leu Tyr Lys Lys Arg Ser Asp Leu Met Lys Pro He Gin Asp Glu 

115 120 125 

He Trp Asn Ala lie Lys Glu He Ala Lys Arg Asn Asn Tyr Gin Met 

130 135 14Q 

Val Leu Asp Arg Gly Thr Ser Gly He II- Phe Ala Ser Pro Ser He 
14i> 150 155 160 

Asp He Ser Asp Leu Val Leu Ser Lys Hot Gly Phe Scr Lys 
1G5 170 

(2) III FORI 1ATION FOR SEQ ID HO: 44 4 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 amino aci<U 

(B) TYPE: amino acid 
( D) TOPOLOGY: linear 

(it) MOLECULE TYPE: protein 

U 1 x I HY POTHET ICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromona? qingivalis 

(1x> FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1...170 

<xi } SEQUENCE DESCRIPTION: SEQ ID NO: 414 



Met 


Lys 


Arg 


Phe 


Leu 


He 


Leu 


He 


Gly 


Phe 


Ala 


Leu 


Ala 


Val 


Ala 


Phe 


1 


Gly 






5 










10 










15 


Ser 


Phe 


Ser 


Gin 


Lys 


Phe 


Ala 


Leu 


Val 


Asp 


Mot 


Glu 


Tyr 


lie 


Leu 


Arg 






20 










25 










30 






Akii 


He 


Pro 


Asp 


Tyr 


Glu 


Met 


Met 


Asn 


Glu 


Gin 


Leu 


Glu 


Gin 


Val 






35 










40 










45 






Ser 


Lys 


Lys 


Trp 


Gin 


Asn 


Glu 


lie 


Glu 


A1 a 


Leu 


Glu 


Asn 


Glu 


Ala 


Gin 




50 










55 










60 






Ser 


Met 


Tyr 


Lys 


Lys 


Tyr 


Gin 


Ser 


Asp 


Leu 


Val 


Phe 


Leu 


Ser 


Aid 


Ala 


65 










70 










75 








80 


Gin 


Lys 


Lys 


Thr 


Gin 


Glu 


Glu 


Ala 


11* 


Val 


Lys 


Lys 


Glu 


Gin 


Gin 


Ala 


Ser 


Glu 






85 










9C 










95 


Leu 


I.ys 


Arg 


Lys 


Tyr 


Phe 


Gly 


Pro 


Glu 


Gly Glu 


Leu 


Tyr 


Lys 


Lys 






100 










105 










110 


Arq 


Ser 


Asp 


Leu 


Met 


Lys 


Pro 


He 


Gin 


Asp Glu 


11- 


Trp 


Asn 


Ala 


He 




115 










120 










125 






Lys 
130 


Glu 


He 


Ala 


Lys 


Arg 
135 


Asn 


Asn 


Tyr 


Gin 


Met 

igo 


Val 


Leu 


Asp 


Arg 


Gly 
145 


Thr 


Ser 


Gly 


He 


He 
150 


Phe 


Ala 


Ser 


Pro 


Ser 
155 


He 


Asp 


He 


Ser 


Asp 

IfiO 


Leu 


Val 


Leu 


Ser 


Lys 


Met 


Gly 


Phe 


Ser 


Lys 













165 170 

(2) INFORMATION FOR SEQ ID NO: 445 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 163 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: p rot sin 

{Hi) HYPOTHETICAL: YES 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(vj. ) ORIGINAL SOURCE: 

(AJ ORGANISM; Po tphyromonafl gingivalis 

iix) FEATURR: 

(A) NAME/ KEY: misc_f ea ture 
<B) LOCATIOIJ 1. . .163 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: '145 



llet 


Lys 


Lys 


Phe 


Phe 


Leu 


Met 


Leu 


Leu 


Hst Ala 


Leu 


Pre 


Leu 


Ser 


Leu 


1 








5 










10 








15 




Lou 


A J a 


Gin 


Lys 


Val 


Ala 


Val 


Val 


Asn 


Thr Glu 


Glu 


He 


He 


Ser 


Lys 








20 










25 








30 






Met 


Pro 


Glu 


Gin 


Val 


Ala 


A3 a 


Thr 


Lys 


Gin Leu 


Asn 


Glu 


Leu 


Ala 


Glu 






35 










40 








45 








Lys 


Tyr 


Arg 


Leu 


A^p 


Leu 


Lys 


Ser 


Met 


Asp Asp 


Glu 


Phe 


Ala 


Lys 


Lys 




50 










55 








60 










Thr 


Glu 


Glu 


Phe 


Val 


Lys 


Glu 


Lys 


Asp 


Ser Leu 


Leu 


Glu 


Arn 


He 


Arg 


65 










70 








75 












Asn 


Arq 


Arq 


Gin 


Gin 


Glu 


Leu 


Gin 


Asp 


He Gin 


Thr 


Arg 


Tyr 


Gin 


Gin 










as 










90 








<>5 




Ser 


Tyx 


Gin 


Thr 


Met 


Gin 


Glu 


Asp 


Leu 


Gin Lys 


Aig 


Gin 


Gin 


Gin 


Lsu 








100 










105 








110 






rhs 


Ala 


Tro 


lie 


Gin 


Gin 


Lys 


Val 


Aid 


Asp Ala 


lie 


T.y.» 


Lys 


Val 


Gly 






115 










120 








125 








Asp 


Glu 


Glu 


Rsn 


C/e- 


Ala 


Tyr 


Tl » 


J let 


Glu Ala 


Gl y 


Met 


Met 


L&u 


Tyr 




130 










135 








1 JO 










Thr 


Gly 


A1 a 


Thr 


Ala 


lie 


Arp 


Leu 


Thr 


Al* Lys 


Val 


Lys 


Ala 


Lys 


Leu 


1 15 










1 SO 








155 










160 


Gly 


lie 


Lys 



























(2) INFORMATION FOR SEQ ID 110:446 

II) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 amino acids 

(B) TYPF.: amino acid , 
(D) TOPOLOGY: linear 

Hi) MOLECULE TYPE: protein 

<iiil HYPOTHETICAL: YF.S 

(vi) ORIGINAL SOURCE: 

(A| ORGANISM: Porphyromonas gingivalis 

( ix 1 fc'fclATURE : 

(A} HAME/KKY: miscjeature 
iB) LOCATION 1 . . ,827 



txi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 46 



Met 


Lye 


Glu 


Ala 


Ile 


Pro 


Arg 


Lys 


Asn 


Lys 


Tyr 


He 


Lys 


Leu 


Asn 


Gly 


1 








5 










10 










15 




He 


Tyr 


Arg 


Leu 


Ser 


Phs 


He 


Leu 


Leu 


Cys 


Cys 


Leu 


Leu 


Cys 


Sec 


Gin 








20 










25 










30 






Ala 


Ala 


Met 


Ala 


Gin 


Gly 


Val 


Arg 


Val 


Ser 


Gly 


Tyr 


Val 


Leu 


Asp 


Arg 






35 










40 










4 5 








Gly 


Glu 


Lys 


Pro 


Tie 


Pro 


Ph» 


Ala 


Gly 


Val 


Lys 


V**l 


Arg 


Gly Thr 


Gly 




50 










55 










60 








Thr Gly Ala 


Thr 


Thr 


Asn 


Leu 


Lys 


Gly 


Tyr 


Tyr 


Glu 


Phe 


Arg Met 


Lys 












70 










75 










80 


Ala 


Thr 


Thr 


Asp 


Ser 


He 


Thr 


He 


Glu 


Phe 


Ser 


Ser 


Met 


Gly 


Tyr 


Gin 










85 










90 








?5 




Sly 


Va) 


Ser 


Arg 


a»r 


Phe 


Pro 


Ser 


Leu 


Thr 


Lys- 


Asp 


Thr 


Arq 


Leu 


Asn 








1 0^ 










105 










110 






Val 


Arg 


Leu 


Ala 


Glu 


Ala 


Glu 


Iter 


Glu 


Leu 


Ser 


Ser 


Val 


Thr 


Val 


Gin 






115 










120 










125 








Ala 


Thr 


I.y* 


Arg 


Arq 


Leu 


Asn 


Thr 


Met 


Glu 


Arg 


VaU 


Asn 


Thr 


Arg 


Asp 




130 










135 










140 










Leu 


Arg 


Val 


Asn 


Ala 


Gly 


Pro 


Thr 


Gly 


Gly 


Val 


Glu 


Set 


Leu 


He 


Ser 


145 










150 










15b 










160 


Thr 


Tyr 


Ala 


Gly 


Val 


Thr 


Gin 




Asn 


Glu 


Leu 


Ser 


Ser 


Gin 


Tyr 


Ser 










165 










170 










175 




Val 


Arg 


Gly 


Gly 


Ser 


Tyr 


Ab-p 


Glu 


Asn 


Met 


Val 


Tyr 


Val 


Asn 


Gly 


Vol 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



100 



Glu 


\al 


Tyr 
1 95 


Arg 


Pro 


Leu 


Lsu 


''GO 


Ser 


Phe 
210 


Val 


Asn 


Pro 


Asp 


Ltu 

215 


Thr 


Gly 


Plie 


Thr 


Ala 


Asp 


Tyr 


Gly 


Asp 


225 










230 






Arg 


Tyr 


Lys 


Gin 


Pro 
24 5 


Gin 


Glu 


i*ys 


Leu 


Gin 


Ser 


Ssr 
260 


Al s 


Tyr 


Tyr 


Gly 


lie 


Thr 


Gly 

275 


Val 


Arg 


Tyr 


Lys 


Ser 
280 


Anp 


Thr 


Lys 


Ala 


Glu 


Tyr 


Asp 
295 


Pro 


Met 


Thr 


Tyr 


Arg 


Phe 


Ser 


Pro 


Lys 


305 










31 0 






71* 


Ser 


Gin 


Thr 


Arg 

325 


Tyr 


L>s 


Pne 


Phe 


Gly 


Thr 


Lt;u 
34 0 


Ser 


Asp 


Ala 


Lys 


Gin 


Glu 


CI n 
355 


Asp 


Arg 


Phe 


Leu 


Thr 
360 


phe 


Val 


Fro 


Asp 


Asp 


Lys 


Gin 

37 5 


Arg 


Asn 


Ser 


A:n 


Glu 


Arg 


Glu 


Thr 


Tyr 












390 






Asn 


Asp 


Val 


Gin 


Leu 

405 


Gly 


Ala 


Asp 


Sex 


Glu 


Acn 


Ser 
420 


Asn 


Gly 


Leu 


Gly 


Arg 


Asn 


Arg 
435 


Leu 


Ser 


Tyr 


Axg 


Vsl 
440 


MsL 


Lys 
450 


Leu 


Asn 


Glu 


Lys 


His 
455 


Arg 


Met 


Glu 


Lys 


He 


Ala 


Asp 


His 


He 


4€S 










47 0 






Val 


Gly 


Tyr 


Asn 


Leu 
4 85 


Pro 


His 


Ser 


Asn 


Leu 


Tyr 


Ala 
500 


Asp 


Thr 


Gin 


11* t 


Val 


CI n 


Asp 
515 


Acg 


Phe 


Asn 


Phe 


Ser 

520 


He 


Pro 


Gly 


lie 


Arg 


Ala 


Ser 
535 


Trp 


Val 


Ser 


Pro 


Arg 


He 


Ser 


Val 


Gly 


54 5 










550 






Leu 


Val 


Leu 


Arg 


Ala 

565 


Ala 


Ala 


siy 


Lys 


Glu 


Leu 


Arg 
58 0 


Gin 


Thr 


His 




Val 


Leu 


At; n 
595 


Glu 


Lys 


He 


Arg 


Ser 
600 


Gly 


Ala 
610 


A.=p 


Tyr 


Thr 


Phe 


Glu 

615 


l-'.et 


Ala 


Glu 


Ala 


Tyr 


Tyr 


Lys 


Ser 


Leu 


625 










630 






Glu 


Asn 


Val 


Lys 


He 
645 


Arg 


Tyr 


Leu 


Ala 


Ala 


Gly 


He 
660 


Asp 


Leu 


V -e 


Leu 


Asp 


Ser 


Trp 

675 


Leu 


Thr 


Ala 


Ser 


He 

680 


Gly 


Tyr 
690 


Gly 


Ser 


l.cu 


Pro 


Leu 
695 


:iet 


Pha 


Phe 


Leu 


Gin 


Glu 


Tyr 


Val 


Pro 


7 0S 










710 






Leu 


Arg 


Ala 


Ala 


Leu 
725 


Ser 


Gly 


Gly 


Gly 


Leu 


Ser 


Ser 
740 


Pro 


Ala 


Phe 


Thr 


Leu 


Gly 


Val 
755 


Met 


Tyr 


Lys 


Trp 


Leu 
760 


Axg 


Ser 
770 


Lys 


Trp 


Leu 


Met 


Gly 

775 


Val 



385/ 490 



185 










190 






Arg 


er 


a 


n 


Gin 


Glu 


Gly 


Leu 










205 








tain 


er 


V 


GJ n 


Phe 


Ser 


Ala 


Gly 








gf ° 










ys 




S«*r 


er 


Va 1 


Leu 


Asp 


He 






235 










240 


Glu 


Gly 


Al a 


Vu 1 


Leu 


Leu 


Gly net 




250 










255 






Ser 


Ala 


Gly 


Ala 


Phe 


Ser 


Gin 












27 0 






Ala 


Lys 


Ser 


Leu 


Leu 


Gly Thr 


Thr 










285 








He 


Tyr 


Ala 


Asp 


Gly 


Gin 


Thr 


Phe 








300 










Leu 


Ser 


Val 


Ser 


Phe 


Leu Gly 


Asn 






315 










320 


Val 


Pro 


Gl n 


Tnr 


Arg 


Glu 


Thr 


Ser 




330 










335 




Lys 


Leu 


Lys 


He 


Phe 


Phe 


Asp 


Gly 


34 5 










350 






Tyr 


Phe 


Gly 


Ala 


Phe 


Ser 


Met 


Asn 










365 








His 


Thr 




Thr 


Leu 


Ser 


Ala 


Pre 


















Asp 


He 




Gly 


^ u 


Tyi 


Phe 


Leu 






305 

r 










4 C r J 


Gly 


Thr 


Ala 


Ser 


Met 




Ser 


Gly 




410 










415 




He 


Gly 


Axg 


Asn 


His 


Glu 


His 


Ala 


4 25 










430 






Leu 


Asn 


1 let 


Gly 


Tyr 


Arg 


Gly 


Glu 










4 45 








Leu 


Gin 


Ala 


Gly 


Val 


Ser 


Ala 


Gin 








4 60 










Ser 


Glu 


Trp 


Gl u 


Arg 


Arg 


Asp 


Ser 






4 7 5 










480 


lu 


Thr 


Val 


LeU 


Leu 


Met Tyr 


Asn 




4 90 










4 95 




Arg 


Gly 


Thr 


Arg 


_ 

x*eu 


Ser 


Ala 


Phe 


505 










510 






Met 


Gly 


Gly 


Gly 


Tnr 


Phe 


Ser 


.on 










525 








Trp 


er 


Ph ft 




Lys 


Glu 


Leu 


Leu 




















Ser 


Pro 


lu 


er 


Asn 


Pro 


Ala 






555 










550 


eu 


T"r 


Tyr 


0 n 


Al a 


Pro 


Phe 


Tyr 




570 










575 




Asp 


Ala 


Glu 




Asn 


Ar>n 


Val 


Val 


585 










590 






Gin 


Gly 


Ala 


Phe 


His 


He 


Leu 


Ala 










605 








Gly 


Gly 


Arg 




Tyr 


Lys 


Phe 


Thr 








620 










Phe 


Asn 


He 


Asn 


Pro 


Tyr 


He 


He 






635 










640 


Gly 


Glu 


Asn 


He 


Gly 


Ser 


Gly 


Tyr 




650 










655 




Phe 


Gly 


Glu 


Leu 


Val 


?ro 


Gly 


Val 


665 










670 






He 


Lys 


Ala 


Arg 


Gin 


Lys 


Leu 


Asp 










685 








Asn 


Ala 


Pro 


Thr 


Tyr 


Asn 


Phe 


Ser 








700 










Gly 


A-sn 


Lys 


Arg 


He 


Thr 


Ala 


Thr 






715 










720 


Leu 


Pro 


Gin 


Leu 


Asn 


Pro 


Ser 


Lys 




730 










735 




Ala 


Pro 


Ala 


Tyr 


Lys 


Ara 


Val 


Asp 


745 










750 






Aep 


Pro 


Asp 


Acp 


Scr 


Phe 


Ala 


Gly 










765 








Lye 


Gly 


Ala 


Tyr 


He 


Gly 


Ala 


Asp 








730 











SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



386/ 490 



Leu phe Am Leu phe Asp Het Thr Asn V*l Asn Sei Tyt Tyr Trp Val 
785 790 795 800 

Set Asp Al« Tyr Gin Gin Gin Tyr Ala Val Fro Asn Tyr Leu The Arg 

BOS 810 815 

Arg Gin Phe Asn Leu Arg Leu Leu Val Glu Phe 
820 825 



{Z) IHFORIIATION FOR SEP ID NO: 4 47 

(t) SEQUENCE CHARACTERISTICS: 

IA) LENGTH: 67? amino acids 
[B> TYPE: amino acid 
ID) TOPOLOGY: linear 

til) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival ls» 

(ix) FEATURE: 

(A) NAME/ KEY: mico_feature 
fB) LOCATION 1. . .672 



SEQUENCE DESCRIPTION: SEO. ID HO: 4 47 



Met 


Tyr 


Ser 


Gly 


His 


His 


Lys 


He 


His 


Tyr 


Pro 


Phe 


Leu 


He 


Leu 


Leu 


1 








5 










10 










15 




Val 


Cys 


Lsu 


Ala 


Phe 


Ala 


Ala 


Cys 


Lys 


ser 


Val 


Lys 


Leu 


Lys 


Asp 


Ala 








20 










25 










30 






Glu 


Lys 


Ala 


His 


Asp 


Arg 


Gin 


Glu 


Tyr 


Thr 


Lys 


Ala 


Ala 


Asp 


Met 


Tyr 






35 










40 










45 








Afn 


Thr 


Leu 


Tyr 


Arg 


Arg 


Thr 


Arg 


Arg 


Lys 


Gin 


Val 


Glu 


MeL 


Lys 


Ala 




SG 










55 










60 










Tyr 


Thr 


Ala 


Phe 


Arg 


Ser 


Gly 


Glu 


Asn 


Tyr 


Arg 


Ala 


Ala 


Gly Arg 


Gin 


65 










70 










75 










80 


Ala 


Lys 


Ala 


LeU 


Arq 


Gly 


Tyc 


Leu 


Asn 


Ala 


Arg 


Arg 


Tyr 


Gly Tyr 


Pro 










85 










90 










95 




Asp 


Ser 


Val 


Val 


Leu 


Leu 


Arg 


Leu 


Al a 


Gin 


Thr 


Tyr 


Gin 


Gin 


Gly 


Gly 








100 










105 










110 






Asn 


Tyr 


Lys 


Glu 


Ala 


Glu 


Val 


Leu 


Phe 


Arg 


Gly 


Tyr 


Leu 


Glu 


Ala 


Tyr 






US 










120 










125 








Pro 


Lys 


Ser 


Tyr 


Phe 


Al a 


Ala 


He 


Gly 


Leu 


Glu 


Gly 


Cys 


Leu 


Phe 


Ala 




130 










135 










140 










Arg 


Gin 


Gin 


Lys 


Glu 


Tyr 


Pro 


Thr 


Arg 


Tyr 


Arg 


He 


Arg 


Arg 


Ala 


Ala 


145 










150 










155 










1 60 


Glu 


Trp 


Asn 


Ser 


Als 


Arg 


Cly 


Asp 


Phe 


Gly 


Pro 


Ala 


Tyr 


Ala 


Pro 


Asp 










165 










17C 










175 




Ala 


Ser 


Ala 


Leu 


Tyr 


Phe 


Thr 


Ser 


Ser 


Arg 


Ser 


Lys 


Asp 


Asp 


Gly 


l.f u 








180 










IBS 










190 






Asp 


Asn 


Ser 


Ser 


He 


Thr 


Gly 


Leu 


Lys 


Pro 


Asn 


Asp 


He 


Tyr 


He 


He 






I9S 










200 










205 








Lys 


Arg 


Anp 


Ala 


Gin 


Gly 


Arg 


Trp 


Gly 


Arg 


Pro 


Asp 


Ser 


Val 


Ser 


Gly 




210 










215 










220 










Gly 


lie 


Asn 


Thr 


Pro 


Trp 


Asp 


Glu 


Gly 


Val 


Pro 


Thr 


He 


Thr 


Pro 


Asp 


225 










230 










235 










240 


Gly Ser 


Thr 


lie 


Tyr 


Tyr 


Thr 


Leu 


Ala 


Gin 


Gin 


Gly 


Ala 


Asp 


Tyr 


Asp 










245 










250 










255 




Arg 


Thr 


Val 


Gin 


He 


Tyr 


Ser 


Ala 


Ala 


Arg 


Ser 


Gly 


Glu 


Gly Gly 


Trp 








260 




















270 






Ser 


Asn 


Gly 


Ser 


Leu 


Val 


Asp 


He 


Met 


Arg 


Asp 


Ser 


Leu Arg Met 


Ala 






275 










2B0 








285 








Aid 


His 


Pro 


Ser 


Het 


Ser 


Ala 


Ser 


Gly 


Asp 


Tyr 


Leu 


Tyr 


Phe 


Val 


Ser 




290 










295 










300 












lie 


Gly 


Gly 


Ser 


Tyr 


Gly 


Gly 


Lys 


Asp 


He 


Tyr 


Arg 


Val 


Lys 


Val 


305 










310 










315 










320 


Stir Asp 


Arg 


Ser 


Tyr 


Gly 


Ser 


Pro 


Glu 


Asn 


Leu 


Gly 


Pro 


Asp 


He 


Asn 










325 










330 










335 




Thr 


Pro 


Gly 


Asp 


Glu 


Met 


Phe 


Pro 


Phe 


He 


Asp 


Gly 


Asp 


Ser 


Thr 


Leu 








340 










345 










350 






Phe 


Phe 


Ala 


Ser 


Asp 


Gly 


His 


Ala 


Gly 


Leu 


Giy 


Gly 


Leu 


Asp 


He 


Phe 






355 










360 










365 








Lys 


Ala 


Thr 


Leu 


Asp 


Ser 


Thr 


Gly 


Gin 


Trp 


His 


Val 


Val 


Asn 


Met 


Gly 




37Q 










375 








380 









SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



387/490 



Gin Pro Val Asn Sei Ser Ala Asp Asp Phe Gly Leu Ala Val Glu Pro 
385 390 395 400 

Lys Gly Lys Asn Lys Glu Glu Ala Leu Pro Asp Asn Gly Val Lys Gly 

4 05 410 415 

Val Phe Cys Ser Asn Atg Gly Asp Ala Arg Gly Trp Pro His Leu Ph~ 

420 425 430 

Ills Phe Glu Lea Pro Ala lie Tyr Thr Glu lie Gin Gly Tyr Val MeL 

435 440 445 

Asp Arg Glu Glu Asn Pro Tie Ala Gly Ala Thr Val Arg He Val Gly 

450 455 460 

Glu Arg Gly Pro Val Gly Gin Gly Phe Val Thr Thr Arg Asp Asp Gly 
465 -HO 475 480 

Ser Tyr Lys MeL Ser Val Gin Gly Asp Thr Arq Tyr Val Het Leu Ala 

435 490 495 

Gly Ala Ser Gly Tyr Leu Asn Gin Tyr Vai Glu Leu Lys Tnr Asp Thr 

505 510 
Ala Lys Gin Ser Glu Thr Tyr Tyr Val Asp Phe Phe Leu Ala Ser Arg 

Sl^ 520 525 

Glu Lys Ala Glu Gly Leu Gin Asn He Phe Tyr Asp Phe Asp Lys Ala 

530 535 54 0 

Thr Leu Arg Pro Glu Ser MeL Lys Ser Leu Asp Glu Leu He Arg Tie 
545 550 555 560 

Leu Thr Asp Asn Pro Asp He Arg He Glu Leu Gly Ser His Ala Asp 

565 570 575 

Arg Lys Gly Pro Asp Al* Tyr Asn Leu Gly l.?u s-r Avp Arq Arg Ala 

590 5e5 590 

Lys Ser Val Val Asp Tyr Leu Thr Ser Ara Gly He Ala Ala Asp Arq 

595 600 605 

Leu Thr Trp Lys Gly Tyr Gly I.ys Ser Val Pro Lys Thr Val Thr Ala 

610 61S 52 0 

Lys He Ala Glu Arg His Asp Phe Leu Lys Glu Gly Asp Val Leu Thr 
625 630 635 640 

Glu Glu Phe Val Ala Pro Leu Thr Glu Glu Gin Gin Ser Val Cys Asp 

645 65n 655 

Gin Leu Asn Arg Arg Thr Glu Phe Ary Val He Glu Glu Glu Leu Arg 
660 665 670 

(2) 1HFORI1ATION FOR SEQ ID NO: 448 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(ixi) HYPOTHETICAL: YES 

(Vl> ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix| FEATURE: 

(A) NAME/ KEY : mise_feature 

(B) LOCATION 1 708 



(xi» SEQUENCE DESCRIPTION: SEQ ID 110:44 8 



Uet 


Lys 


Lys 


Phe 


Phe 


Phe 


Ala 


Leu 


Leu 


Ser 


He 


Gly 


He Ser 


Ala 


Gin 


1 








5 










10 






15 




Ala 


Phe 


Ala 


Lys 


Thr 


Asp 


Asn 


Val 


Pro 


Thx 


Asp 


Ser 


Leu Arg Val 


His 








20 










25 








30 






Asn 


Leu 


Gin 


Thr 


Val 


Thr 


Val 


Tyr 


Ser 


Thr 


Arq 


Thr 


Ala Val 


Pro 


Leu 






35 










40 










4S 






Lys 


Lys 


He 


Pro 


Ala 


Lys 


Met 


Glu 


Leu 


He 


Ser 


Ser 


Arg Asn 


Tie 


Lys 




50 










Si> 










60 




Gin 


Ser 


Gly 


Phe 


Asn 


Asn 


Met 


Thr 


Asp 


He 


Leu 


Lys 


Thr Gin 


Ser 


Ser 


65 










70 










75 






80 


Leu 


Asp 


Val 


He 


Gin 


Tyr 


Pro 


Gly 


Phe 


Ser 


Ser 


Asn 


He Gly 


He 


Arg 










85 










90 








05 


Gly 


The 


Lys 


Pro 


Ser 


Gly 


Lys 


Tyr 


Val 


Thr 


Val 


Leu 


Val Asn 


Gly 


He 








100 










105 








110 




Pro 


Ala 


Gly 


Thr 


Asp 


Asn 


He 


Ser 


Thr 


Leu 


Asn 


Thr 


Ser Asn 


He 


Glu 






115 










120 










125 


Gin 


He 


Glu 


He 


Leu 


Lys 


Gly 


Pro 


Phe 


Ser 


Ser 


He 


Tyr Gly Thr 


Asn 




13C 










135 










140 









SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



388/490 



Ala 


Met 


Gly 




Val 


Val 


Asn 


He 


He 


Thr 


His 


Lys 


Ser 


Lys 


Asp 


Lys 


140 










150 










1SS 










160 


lie 


Hxs 


Gly 


Asn 


Vai 


Ser 


Leu 


Phe 


Gly Gly 


Ser 


Tyr 


Gin 


Thr 


llet 


Ala 










165 










170 








175 




Gly 


Ser 


Ph^ 


Asn 


Leu 


Gly Gly Arg 


Phe 


Glu 


A-p 


He 


Phe 


Ser 


Phe 


Asp 








180 










185 










190 






Leu 


Ser 


Leu 


Gly 


Leu 


Asp 


Lys 


Gin 


Asn 


Lys 


Asp 


Tyr 


Lye 


Thr 


Gly 


Ser 






195 










200 










205 








Asn 


A?n 


Phe 


Leu 


Ser 


Leu 


Ser 


Lys 


Leu 


Glu 


Glu 


Ala 


He 


Val 


Asp 


Vol 




210 










215 










220 










Asn 


Ala 


Thr 


Lys 


Asn 


Lys 


Lys 


tlet 


Lys 


Gly 


Scr 


Asp 


Tyr Thr 


Val 


Ala 


225 










230 










235 










240 


Thr 


G1 y 


Arg 


Leu 


Arg 


Phe 


Gly 


He 


Asp 


Phe 


Thr 


Pro 


Glu 


Trp 


Ser 


Leu 










245 










25C 










255 




Asn 


Lsu 


Tyr 


Gin 


Asn 


Val 


Phe 


Leu 


Gly 


Asp 


A1 a 


Tie 


Pro 


Val 


Gly 


ciy 








260 










265 










270 






Ser 


lie 


Trp 


Gly 


Val 


Tyr 


Gly 


Glu 


Ser 


Lys 


Lys 


Asn 


Leu 


Asn 


Arg 


Ser 
















280 










285 








Ser 


Tnr 


Ser 


Phe 


Glu 


Leu 


Leu 


Gly 


Lye 


His 


Gly 


Cys 


His 


Thr 


Leu 


Gin 




290 










295 










30Q 










Phe 


Ser 


Pro 


Tyr 


Phe 


Asn 


lie 


Glu 


Lys 


Ser 


Glu 


Asn 


Tyr Asn 


Asn 


Ala 


300 








310 








315 










320 


Asp 


Pre 


Thr 


Gly 


Phe 


He 


Asn 


Tyr 


Lys 


Ser 


Acp 


Tyr 


Tyr 


Thr 


Tyr 


Gly 










325 










330 










335 




Ala 


Leu 


Leu 


Gin 


7\cp 


Lys 


lie 


Ser 


Phe 


Gly 


Gly 


Gin 


Asn 


He 


Val 


Leu 








340 










345 










350 






civ 


Val 


Asp 


aer 


Arg 


Asn 


liet 


Thr 


Met 


Glu 


Ser 


Glu 


Arg 


Phe 


Glu 


Gin 






355 










360 










365 








Ala 


Gly 


Val 


A?n 


Thr 


Lys 


Pro 


Tyr 


Asn 


t?ro 


Gly 


Tyr 


Ala 


Thr 


Asn 


Asn 




310 










375 










300 










lie 


Gil- 


Leu 


Phe 


Gly 


Gin 


Ala 


Asn 


Phe 


Tyr 


L^u 


Leu 


A.*tn 


Asp 


Ala 


Leu 


385 










390 










395 










4 00 


Ser 


lie 


Ser 


Ala 


Gly Ala 


Arg 


Ala 


Asp 


Phe 


tlet 


Phe 


Phe 


Asp 


Leu 


Lys 










405 










410 










415 




Ala 


Asn 


Glu 


Tyr 


Leu 


Asn 


Asn 


Glu 


Ala 


Lys 


Gin 


Glu 


Thr 


Ills 


Asn 


Val 








420 










425 










430 






lie 


Asn 


Fro 


Asn 


Val 


Gly 


lie 


Lys 


Tyr 


Glu 


Phe 


Val 


Lys 


Gly 


Leu 


Thr 






4 35 










440 










445 








Ala 


His 


Gly 


Thr 


Phe 


Gly 


Ser 


Ala 


Phe 


Ser 


Ala 


Pro 


Asp 


Ala 


Phe 


Gin 














4 55 










4 60 










Lys 


Ala 


Gly 


Gin 


Tyr 


Val 


Gl/ 


Pro 


Phe 


Gly 


Thr 


Thr 


He 


Gly Asn 


Pro 


•165 










470 










475 










480 


Asp 


Leu 


Lys 


Pro 


Glu 


Lys 


Ser 


Met 


Thr 


Trp 


Asp 


Phe 


Gly 


11- 


Gly 


Tyr 










485 










490 










4 95 




Ser 


As'n 


Ala 


Arg 


Cys 


Gly 


He 


Gin 


Ala 


Asp 


Val 


Thr 


Leu 


Thr 


Tyr 


Phe 








SCO 










505 










£10 






His 


Th£ 


Afp 


Hie 


hye 


Asp 


Leu 


He 


Leu 


Ser 


!3*r 


Pro 


Asp 


Tyr 


Ala 


Asn 






515 








520 










525 








Af n 


11- 


Thr 


Thr 


Tyr 


lie 


Asn 


Ala 


Asp 


Lys 


Ala 


Arg 


llet 


Ser 


Gly 


Tic 




530 










535 










54 0 










Glu 


Ala 


Leu 


Leu 


Ser 


Tyr Asp 


Phe 


Gly 


Ser 


Leu 


Phe 


Ala 


Asn 


Lys 


Phe 


S4S 










550 




















560 


Ser 


Leu 


Arg 


Ala 


Phe 


Ala 


Am 


Ala 


Thr 


He 


list 


Leu 


Asn 


Ser 


Glu 


Met 










565 










57 0 










57 5 






Lys 


Ser 


Gin 


Thr 


Asp Ala 


Pro 


Trp 


Ser 


Glu 


net 


Tyr 


Tyr 


Val 


Arg 








560 










565 










590 






Lys 


Gin 


Asn 


lie 


Thr 


Phe 


Gly 


He 


Glu 


Tyr 


Arg 


Gly 


Lys 


Glu 


Gly 


Leu 






595 










600 










605 








Glu 


Val 


Met 


Leu 


Asn 


Gly Arg 


Phe 


Met 


Gly 


Axg 


Arg 


He 


Glu 


Gln 


Asn 




610 










615 










620 










Trp 


Tyr 


Ala 


Tyr 


Tyr 


Pro 


Glu 


Val 


Arg 


Pro 


Glu 


Leu 


Gin 


G3n 


Leu 


Leu 


625 










630 










635 










640 


Ala 


Al„ 


Glu 


Glu 


Pro 


Glu 


Leu 


Ala 


Ala 


Gin 


Gly 


Leu 


Leu 


Arg 


Hxs 


Pro 










645 










651" 










655 




Gin 


Ala 


Met 


Val 


Phe 


Asn 


Ala 


Ser 


Ala 


Tyr 


Tyr 


hx a 


(let 


Asn 


Lys 


Tyr 








660 










665 










670 






Leu 


Thr 


Phr 


Gly 


Val 


Asn 


Leu 


Asn 


Asn 


He 


Leu 


Asp 


Glu 


Leu 


Tyr 


Thr 






675 










680 










685 








Glu 


Lys 


Asp 


Gly 


Tyr 


His 


Met 


Pro 


Gly 


Arg 


Asn 


11 5. 


Mat 


Gly Lys 


Val 




6?0 










695 










700 










Met 


Va! 


A.?n 


Phe 


























705 
































(2) 


IHFORt 1ATI0H 


FOR 


SEQ 


ID HO: 449 

















SUBSTITUTE SHEET CRule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



389/ 490 



(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 462 amino acids 

(B) TYFE: amino acid 
[D) TOPOT.OGY: linear 

(ii> MOLECULE TYPE; protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGI1IAL SOURCE: 

tA) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE; 

(A) NAME/ KEY : misc_feature 
( B } LOCATION 1. . .462 

(Xi) SEQUENCE DESCRIPTION: SEQ IDMO;449 

Met Asn Arg Phe Ser Asn His Trp Pro Cys He Leu Val Gly Phe Val 

15 10 15 

Leu Trp Phe Val S*r Ala Scr Arg Thr Val Ala Gin Asn Ala S~r Glu 

20 25 30 

Thr Thr Val Ser Tyr Asp Thr Asp Thr Ala Val Leu Ser Olu Ala Asp 

35 40 4 b 

Val Leu Arg lie Ala Leu Ser Glu Asn Ala Thr Val Lys Val Ala Ast- 

50 55 60 " 

Met Asp Val Arg Lys Gin Glu Tyr Ala Arg Arg Ala Ala Arg Ala Asp 
65 ™ 7b bo 

L - u Pro Lys Val Asp Lsu A^n Gly Val Tyr Ser His: Thr L-u L^ 

8i > 90 95 

Lys Gin Val Leu Tyr He Asp Met Pro Gly Phe Ser Ser Ser Glu Gly 

100 10 5 no 

He Glu net Gly Arg Thr His Asn Thr Gin Gly Gly Val Asn V«,l Ser 

115 3?0 12 S 

Met Pro Leu Val Ser Ala Gin Leu Trp Lys Ser He Ala Met Thr Glv 

130 135 i4 0 y . 

Glu Gin Leu Asp Leu Ala Leu Glu Lys Ala Arg Ser Ser Arq Il*» Afd 
145 150 155 * * 16 J 

Leu Val Ala Glu Val Lys Lys Ala Tyr Leu Ser Val Leu Leu Ala Glu 

165 170 n 5 

Asp Scr Tyr Gly Val Phe Lys Arg Ser Tyr Asp Asn Ala Leu Ala Asn 

180 185 X90 

Tyr Lys Asn He Ser Asp Lys Phe Asp Arg Gly Leu Val Ala Glu T'-'r 

195 200 205 

Asp Lys He Arg Ala Asn Val Gin Val Arg Asn He Glu Fro Asn Lou 

23 0 215 220 

Leu Gin Ala Gin Asn Ser Val Ala Leu Ala L~u T rp Gin Leu L— Val 
— S 230 235 " ->«o 

Leu Met Ser Met Glu Val Glu Thr Pro He Arg Leu Ser Gly S-i L-u 

245 250 2*5 

Ser Asp Tyr Lys Glu Gin Val Tyr Thr Gly Tyr Phe Ala Ala Asp Thr 

260 265 270 

Leu He Ser Asn Asn Ser Ser Leu Arg Gin Leu Asp Ho Gin Arq Ara 

275 2R 0 2B5 

Leu Ala Val Ser Ala Asp Lye Leu Asn Lys Tyr Ser Phe Leu Pro Thr 

290 295 300 

Leu Asn Leu Gly Gly Gin Tyr Thr Tyr Ser Leu Asn Ser Asn Asp II- 
305 310 315 32n 

Lys Phe Trp Gly Glu Gly Gin Arg Trp Thr Pro Phe Ser Thr II- Ser 

32b 330 335 

Leu Ser Lou Tyr He Fro He Phe Asn Gly Gly Lye Arg Leu Tyr Asn 

3* 10 345 350 

Val Lys Gin Ser Ala L«u Ser He Arg Gin He Asp Leu Gin Arg Axy 

355 360 3*5 

His He Glu Gin Ser He Arg Met Gly He Lys A?n Gin Asn Asp Arg 
370 375 * 80 * 

Leu Arg Thr Cys Met Gin Arg Phe Val Ala Ser Glu Glu Ala Val Arg 
305 390 3 o S . J0 ^ 

S-r A^a Glu Lys Gly Tyr Gin He Ala Glu Lys Arn-Tyr Gin Thr Glv 

405 4io - 4lFi 

Glu Gly Thr Uu Val Glu Leu Asn Asp Ala Asp Val Ala Leu Leu Gin 

■'20 425 430 

Ala Arg Leu Asn Tyr Asr, Gin Ala Ho Phe Asp Phe llet Thr Ala Lys 

435 440 445 

Ala Glu T.-u Asp Lys Met Asn Gly Met Gly He Pr~ Glu Gin 



SUBSTITUTE SHEET (Rule 26) (RQ/AU) 





WO 99/29870 



PCT/AU98/01023 



390/ 490 



450 



455 



460 



5 
10 
15 
20 
25 
30 
35 

* 40 

45 
50 
55 
60 
65 
70 
75 



(2> INFORMATION FOR SEQ ID NO: 450 

U) SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 4 92 amino dcidr 
<B) TYPE: amino acid 
t D) TOPOLOGY: lin-ar 

(11) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

{Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

( lx ) FEATURE : 

(A) NAME/KEY: miscjeaturt 

(B) LOCATION 1...492 



txi) SEQUENCE DESCRIPTION: SEQ TO 110:450 



Met 


Trp 


Gly 


Asp 


Ser 


His 


Gly 


Val 


Ala 


Pro 


Asn 


Gin 


Va 1 Arg 


Arg 


Thr 


I 








5 










10 








lb 




Leu 


Val 


Lys 


Val 


Ala 


Leu 


Ser 


Glu 


Ser 


Leu 


Pro 


Tro 


Gly Ala 


Lys 


Gin 








20 










25 








30 






He 


Arg 


lie 


Gly 


Phe 


Ser 


Leu 


Pro 


Lys 


Glu 


Thi 


Glu 


Glu Lys 


Val 


Thr 






35 










40 










45 






Ala 


Leu 


Tyr 


Leu 


Leu 


Val 


Ser 


Asp 


Ser 


Leu 


Ala 


Val 


Arg A?p 


Leu 


Pro 




50 










55 










60 








Asp 


Tyr 


Lys 


Gly 


Arg 


Val 


Ser 


Tyr 


Asp 


Ser 


Phe 


Pro 


He Ser 


Lys 


Glu 


65 










70 










•75 








80 


Asp 


Arq 


Thr 


Thr 


Ala 


Leu 


Ser 


Ala 


Asp 


Ser 


Val 


Ala 


Gly Arg Arg 


Phe 










35 










90 








95 




Phe 


Tyr 


L-u 


Ala 


Ala 


Asp 


He 


Gly 


Pro 


Val 


Ala 


Ser 


Phe Ser 


Arg 


Ser 








100 










105 








110 






Asp 


Thr 


Leu 


Thr 


Al a 


Arg 


Val 


Glu 


Glu 


Val 


Ala 


Val 


Asp Gly Arg 


Pro 






115 










1 20 










125 






Leu 


Pro 


Leu 


Lys 


Glu 


Leu 


Ser 


Pro 


Ala 


Ser 


Arg 


Arg 


Leu Tyr 


Arg 


Gly 




130 










135 








140 








Tyr 


Glu 


Ala 


Leu 


Phe 


Val 


Pro 


Cly 


Asp 


Gly 


Gly 


Ser 


Arg Asn 


Tyr 


Arg 


145 










150 










155 








160 


He 


Pro 


Ala 


He 


Leu 


Lys 


The 


Ala 


Asn 


Gly 


Thr 


Leu 


He Ala 


Met 


Ala 










165 










17 0 








175 




Asp 


Arg 


Arg 


Lys 


Tyr 


Asn 


Gl n 


Thr 


Asp 


Leu 


Pro 


Glu 


Asp lie 


Asp 


He 








10O 










105 








190 






Val 


Het 


Arg 


Arg 


Ser 


Thr 


Asp 


Gly 


Gly 


Lys 


Set 


Trp 


Ser Asp 


Pro 


Arg 






19S 










200 










205 






He 


He 


Val 


Glti 


Gly 


Glu 


Gly 


Arg 


Asn 


His 


Gly 


Phe 


Gly Asp 


Val 


Ala 




210 










215 










220 








Leu 


Val 


Gin 


Thr 


Gin 


Ala 


Gly 


Lys 


Lsu 


Leu 


Met 


lie 


Phe Val 


Gly Gly 


225 










230 










235 








240 


Val 


Gly 


Leu 


Trp 


Gin 


Ser 


Thr 


Pro 


Asp 


Arg 


Pro 


Gin 


Arg Thr 


Tyr 


He 










245 










250 








255 




Ser 


Glu 


ser 


Arq 


Asp 


Glu 


Gly 


Leu 


Thr 


Trp 


Ser 


Pro 


Pro Arg 


Asp 


He 








260 










265 








270 






Thr 


His 


Phe 


He 


Phe 


Gly 


Lys 


Asp 


Cys 


Ala 


Asp 


Pro 


Gly Arg 


Ser 


Ary 






275 










280 










285 






Trp 


Leu 


Ala 


Ser 


Phe 


Cys 


Ala 


Ser 


Gly 


Gin 


Gly 


L~u 


Val Leu 


Pro 


Ser 




2 90 










295 










300 








Gly 


Arg 


He 


Thr 


Phe 


Val 


Ala 


Ala 


He 


Arg 


Glu 


Ser 


Gly Gin Glu 


Tyr 


305 










310 










315 








320 


Val 


Leu 


Asn 


Asn 


Tyr 


Val 


Leu 


Tyr 


Ser 


Asp 


Asp 


Glu 


Gly Asp Thr Trp 










325 










330 








3 35 




Gin 


Leu 


Ser 


Asp 


Cys 


Ala 


Tyr 


Arg 


Arg 


Gly 


Asp 


GlU 


Ala Lys 


Leu 


Ser 








340 










345 








350 






Leu 


Met 


Pro 


Asp 


Gly 


Arq 


Val 


Leu 


Met 


Ser 


He 


Axg 


Asn Gin 


Gly 


Arg 






355 










360 










365 






Gin 


Glu 


Scr 


Arg 


Gln 


Arg 


Phe 


Phe 


Ala 


Leu 


Ser 


Ser 


Asp Asp Gly 


Leu 




370 










375 










3S0 








Thr 


Trp 


GlU 


Arg 


Ala 


Lyt 


Gin 


Phe 


Glu 


Gly 


He 


His 


Asp Pro 


Gly 


Cys 


385 










390 










395 








4 00 


Asn 


Gly 


Ala 


Met 


Leu 


Gin 


Val 


Lys 


Arg 


Asn 


Gly 


Arg 


Asp Gin 


Val 


Leu 










405 










410 








415 




His 


Ser 


Lou 


Pro 


Leu 


Gly 


Pro 


Asp 


Gly 


Arg 


Arg 


Asp 


Gly Ala 


Val 


Tyr 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



391/490 



42S 4JQ 
Leu Phe As-p His Val Ser Gly Arg Trp Ser Ala Pro Val Val Val Asn 

435 440 445 

Ser Gly Ser Ser Ala Tyr Ser Asp Met Thr Leu L<=u Alo Atp Gl v ' Thr 

450 455 460 

He GI3 Tyr Phe Val Glu Glu Gly Asp Glu II- Ser Leu Val Pn~ He 

465 470 475 480 

Arg Phe Val Leu Asp Asp Leu Phe Asp Val Arg Gin 
4 85 490 

(2) INFORMATION FOR SEO ID NO; 4 51 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonar gl nglvaMs 

( I>: ) FEATURE : 

(A> NAME/KEY: misc_feature 

(B) LOCATION 1. . .245 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 451 



Met 


Lys 


Lys 


Glu 


Lys 


Leu 


Trp 


He 


Ala 


Tie 


Val 


Ala 


Gly 


Leu 


Ala 


Fhe 


1 








5 










10 








15 




Val 


Lftu 


Gly 


Leu 


Tyr 


Ala 


Leu 


Gly Arg 


Ser 


Val 


Ala 


Gin 


Leu 


Arg 


Arg 


















25 










30 


Ser 


Gin 


Pro 


Ser 


Val 


Thr 


val 


Thr 


Gly 


Met 


Ala 


Glu 


Arg 


Asn 


Phe 


Lys 






35 










40 










45 






Ser 


Asp 
50 


Leu 


lie 


Val 


Trp 


Thr 
55 


Ala 


Ser 


Tyr 


Gin 


Leu 

60 


Gin 


Met 


Met 


Asp 


Leu 


Glu 


Ser 


Ala 


Tyr 


Lys 


Ala 


Leu 


Lys 


Glu 


Lys 


Gin 


He 


Leu 


Val 


Ala 


65 










70 










75 










80 


Asp 


Tyr 


Leu 


Lys 


Asn 


Lys 


Gin 


Leu 


Pro 


Asp 


Ser 


Ser 


Tyr 


He 


Phe 


Ser 










85 










90 








95 




Ser 


Val 


Ala 


He 


Ser 


Lys 


Glu 


Tyr 


Asn 


Tyr 


Tyr 


Tyr 


Asp 


Pro 


Arg 


Gin 


Glu 






100 










105 








110 




Gin 


Asn 


Val 


Arg 


Thr 


Ph- 


Ala 


Gly Tyr 


Leu 


Leu 


Ser 


Gin 


Thr 


V«l 






115 










120 










125 






Thr 


Val 
130 


Thr 


Ser 


Gin 


Asp 


ils 
135 


Glu 


Hir 


Val 


Glu 


Lys 
140 


He 


Ser 


Arg 


Asp 


11^ 


Thr 


ciu 


Leu 


lie 


Asn 


Gin 


Gly 


Val 


Glu 


lis 


Thr 


Ser 


Asp Arg 


Pro 


145 


Tyr 








150 










155 










160 


Ala 


T>r 


Tyr 


Thr 


Lys 


Leu 


Asn 


Asp 


Leu 


Lys 


Val 


Glu 


Met 


Leu 


Arg 


A^n 


Ala 






165 










170 










175 


Ser 


Glu 


Asp 


Ala 


Phe 


Asn 


Arg Ala 


Ser 


Val 


He 


Ala 


Glu 


Gly 








180 










185 










1 on 




Ser 


Gly Ser 


Ser 


Val 


Gly 


Lys 


Met 


Leu 


Ser 


Ser 


Ser 


Met 


Gly Val 


Phe 






195 










200 










205 








Cln 


He 


Val 


Gly 


Leu 


Asn 


Ser 


Asn 


Glu 


Asp 


Tyr 


Ser 


Trp 


Cli- 


Gly 


Ser 


Phe 


210 










215 










220 








At n 


Thr 


Ser 


Ser 


T.ys 


Met 


Lys 


Thr 


Ala 


Ser 


He 


Thr 


val 


Lys 


Ala 


225 










230 










235 








240 


Ser 


Phe 


Ala 


Leu 


Lys 























245 



(2| IllFORt [ATION FOR SEQ ID NO: 4 52 

(it SEQUENCE CHARACTERISTICS : 

(A> LENGTH: 27 6 amino acids 
(31 TYFE: amino acid 
(D) TOPOLOGY: linear 

(It) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi( ORIGINAL SOURCE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



392/ 490 



(A) ORGAN I SH: Porphyromonaff gmqivalis 

Ux) FEATURE: 

(A) MAI IE/ KEY r nU5c_feature 
( B) LOCATION 1 . . - ?7f> 



ixi ) SEQUENCE DESCRIPTION: SEQ ID NO: 452 



I let 


Lys 


Lyz 


Thr 


He 


Ala 


He 


He 


Ala 


Ser 


Ala 


Leu 


Leu 


Ala 


Leu 


Gl y 


1 








5 










10 










15 




Ala 


Val 


Gil- 


cys 


Lys 


Lys 


Asn 


Ala 


Asp 


Thr 


Thr 


Ala 


Val 


Ser 


Glu 


Lyp 








ZO 










2b 










30 






Atp 


Ser 


lie 


Ala 


Leu 


Scr 


Met 


Gly 


He 


Leu 


Tyr 


Gly 


Gin 


Asp 


Phe 


Ala 






35 










40 










45 








A5n 


Gin 


Phe 


Glu 


Ilet 


Ser 


Arg 


Leu 


Gin 


Gly 


Gin 


Pro 


He 


Asp 


Ser 


Val 




SO 










55 










60 










Aid 


Phe 


Leu 


Asp 


Gly 


Phe 


Lys 


Tyr 


Gly 


He 


Asp 


Thr 


Thr 


Arg 


Phe 


Ser 


65 










70 










75 










80 


Tyr 


Asn 


Lej 


Gly 


Ala 


He 


Tyr 


Ala 


Ser 


Asn 


lie 


Ala 


Arg 


Gin 


Leu 


Ala 










85 










90 










95 




His 


Asp 


Ser 


He 


A&p 


lie 


Arp 


Lys 


Phe 


Tyr 


Ala 


Ala 


list 


Arg 


Ala 


Ala 








100 










105 










] 10 






Leu 


Leu 


Lys 


Asp 


Thr 


Val 


Ser 


He 


Ala 


Het 


Lys 


Pro 


Ala 


Asp 


Ala 


Gin 






US 










120 










125 








Ala 


Phe 


net 


Gin 


Arg 


lie 


Gin 


Ala 


Lys 


Lys 


Gin 


Arg 


Glu 


Asn 


Asn 


Met 




130 




















140 










L/5 


Gin 


Phe 


Gly 


Gin 


Asn 


He 


Glu 


T.y* 


Gly 


Asn 


Glu 


Tyr 


He 


Asp 


Thr 


145 










150 










155 










160 


Phe 


Lyc 


Lye 


Glu 


Aso 


Gly 


Val 


Thr 


Val 


Thr 


Thr 


Thr 


Gly 


Leu 


Ala 


Tyr 










165 










170 










17 b 




Lys 


Thr 


Leu 


Gin 


Glu 


Gly 


Thr 


Gly 


Ala 


Thr 


Pro 


Ser 


Leu 


Ala 


Asp 


Thr 








100 










185 










190 






Val 


Arg 


Val 


Ly* 


Tyr 


Val 


Gly 


Thr 


Leu 


Val 


Asp 


Gly 


Lys 


Glu 


Phe 


Asp 






195 










200 










205 








Lys 


Asn 


Glu 


Glu 


Gly 


He 


Glu 


Phe 


Ala 


Val 


Thr 


Gly 


Val 


He 


Lys 


Cli- 




210 










215 










220 










Trp 


Thr 


Glu 


Hot 


L«?\i 


Gin 


Leu 


Met 


Lys 


Val 


Gly 


Gin 


Lys 


Val 


Arg 


val 


225 










230 










235 










240 


Val 


lie 


Pro 


Gin 


Glu 


Leu 


Ala 


Tyr 


Gly 


Glu 


Thr 


Gly 


Asn 


Tyr 


Thr 


He 










245 










250 








255 




Glu 


Pro 


Phe 


Ser 


Thr 


Lou 


Thr 


Phe 


Glu 


Het 


Glu 


Leu 


He 


Gly 


lis 


Lys 



260 265 270 



Pro Gly Lys Lys 
275 

t2) INFORMATION FOR SEQ ID 140:4 53 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 5 amino acids 
IB) TYPE: amino acid 
(D> TOPOLOGY: linear 

fill IIOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM : Porphyromonas gingivalxs 

I ix ) FEATURE : 

(A) NAME/KEY: miscfeature 
<B> LOCATION 1 . . .77S 

(Mil SEQUENCE DESCRIPTION: SEQ ID NO: 4 53 



net 


Lys 


Va) 


Leu 


Arg 


Gin 


Val 


fhe 


Leu 


Pro 


He 


Leu 


Phe 


Val 


Leu 


Lou 


l 








5 










10 










15 




Thr 


Gly 


Ala 


Cys 


Ser 


Thr 


Thr 


Lys 


Asn 


Leu 


Pro 


Glu 


Gly 


Glu 


Gin 


Leu 








20 








25 










30 






Tyr 


He 


Gly 


J let 


Gly 


Lys 


Thr 


Gin 


lit 


Leu 


Arg 


Gin 


Asp 


Lys 


Ser 


HI? 






35 










40 










4S 








Ala 


Gly 


Gin 


Gin 


Ala 


Leu 


Thr 


Glu 


Val 


Glu 


Sec 


Thr 


Leu 


Lys 


Vai 


Thr 




50 










55 










60 








Pro 


A.-n 


Gly 


Ala 


He 


Phe 


Gly 


Ssr 


Ala 


ser 


Ala 


Ser 


Leu 


Pro 


Lys 


lie 



€5 70 - 75 80 
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Pro Phe Gly Leu Trp hru Tyr Asn Ser Phe Val Gly Asp Ser Thr Vol 

85 go o 5 

lie Ser Lys Trp He Phe Asp Lys Phe Ala Ala Lyr Pro Val Phe He 

100 105 no 

Ser Gin Val l.ya Ser Asp Ssr Arg Ala Lys Val Ala Thr Asn He L~u 

US 120 12S 

Arg Glu His Gly Tyr Phe Asp Ala Lye Val Lys Ssr Ser Val Thr Thr 

130 135 no 

Leu Lyr Lyr A^p Ser Leu Lys Ala Lys He Ser Tyr Thr Val Asp H-t 
H5 150 155 160 

/via Ser Pro Tyr His Tyr Asp Ser He He Pro Leu Pro He Ser Thr 

165 170 175 

Phe Pro Asp Ser He Leu Ala Tyr Arg Gin Thr Fro Ser Leu He Arg 

180 IBS 190 

Lys Gly Asp Gin Phe Asn L^u Ala Lya Lgu His Glu Glu Arg Gin Thr 

105 200 205 

lis Ser Ala Leu Leu Arg Asp Asn Gly Tyr Tyr Tyr Phe Arg Pro Gin 

210 215 220 

Asp He He Tyr Glu Ala AEp Thr Leu Leu Val Ara Gly Ala Val C**« 
225, 230 235 " 240 

Leu Arg Air, Lys Leu Ser Glu Asp Thr Pro Pro Gin Ala Met Arg Pro 

245 250 255 

Trp Arg He Gly Lys Aig Thr AJ a Vfti Leu Leu Gly Met Asn Gly Glu 

260 265 270 

Ser Pro Thr Asp Ser Leu Glu Val Glu Asp Met Lit- Val L~u T"r T"r 

275 2B0 28b 

Arg Lys I let Pro Val Arg Pro Ly3 He Leu Ala Lys Arq Phe Arg Ph~ 

290 29b 300 

Phe Ser Gly A-=n Leu Tyr Arg Gin Lys Asp A&p Glu Thr Thr Arg Lys 
305 310 315 3 2 A 

Ser Leu Ala Arg Leu Gly Ala Phe Ser Val He Asp Leu Asn Phe L-u 

325 330 335 

G3n Acg Asp Ser He Ser Gly Leu Leu Asp Val Arg Leu Leu Thr Thr 

340 345 350 

Leu Asp Lys Pro Trp Atp Ala Sor Leu Glu Thr Leu Fhe Thr S«*r L- «- 

355 360 365 

Ser Asn Asp Phe He Gly Pro Gly Leu Asr. Phe Ala Leu Ala Arq Ara 

370 375 330 

Asn Val Phe Gly Gly Gly Glu Asn Leu Ser Trp Asn He Gly Gly Ser 
385 390 3g5 4 0Q 

Tyr Glu Trp Glu Thr Gly Asn Arg Pro Glu Asn Ser Ser Asn Arg Leu 

405 410 415 

He Asp He Asn Ser Tyr Asn Met Asn Thr Ala Val Asn Leu Ser Phe 

420 425 430 

Pro Ser Tie Val Fhe Pro Gly Leu Leu Asp Lys Tyr Tyr Tyr Tyr Pro 

435 440 445 

Thr Thr Thr Thr Phe Gin Ala Ser Ala Thr Ala Leu Asn Arg Ala Kis 

450 455 4*0 

Tyr Phe S~r tl=t Tyr Ser Fhe Gly Phe S«*r Thr Thr Tyr Glu ?h=> Sin 
4*5 470 4/^ , ;30 

Pro Ser Lys Glu His Arg His Ala He Phe Pro Leu Lys Leu Asn Tyr 

485 490 495 

Asn Leu Leu Gly His Gin Thr Glu Thr Phe Gin Ala He Thr Ala Asn 

500 505 510 

Asn Pro Pro Leu Leu Leu Ser Leu Gin Ser Gin Phe Leu Ala Gin tl^t 

515 520 b 2 b 

Gly Tyr He Tyr Thr Phe Asn Lys Ser Val Ser Glu Lys Ser Pro Hi^ 

530 535 540 

Hitf Leu Trp Met Gin Phe Gly Leu Ser Glu Ala Gly Asn Leu Leu Asn 
545 550 555 56^ 

Leu He Tyr Leu Ala Ala Gly Lys Lys Tyr Ser Asp Thr Lys Asn Phe 

565 570 575 

Val Gli Val Pro Phe Ser Gin Phe lie Lys Ala Thr Gly Glu Leu Arq 

530 585 590 

Tyr Ser Tyr Thr He Asp Arg Asn Gin Ser Leu Ala Thr Arg Phe Gly 

595 600 605 

Thr Gly Val He Tyr Ser Tyr Gly Asn Met Arg Val Ala Pro Tyr Ser 

frlt> 615 620 

Glu Gin Phe Tyr Val Gly Gly Ala Asn Ser He Arg Ala Phe Thr Val 
625 63C 535 64 .^ 

Arg Ser He GJ y Pro Gly Arg Phe Asn Pro Asp Sei. Asrp Asn Gin Tyr 

645 6 50 655 

Ser Tyr Leu A£p Gin Val Gly Glu Fhe Lys Leu Glu Ala Asn Val Glu 

660 665 e/0 

Tyr Arg Gly vys Leu Phe Gly Atp Lou Hi* Ala Ala Val Phe Leu Asp 
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675 680 



690 V ^ Ar9 G1U AE P S = r Ser Ar <5 Pro Cly Gly 

Ala Leu Ser Glu V.1 Gly lor Val Ser Asn Ph. leu Asn Ser He Al. 

Leu Gly Thr Gly Val Gly Leu Arg Tyr Asp lit Ala Phe Leu Val Vol 

Arg Val Asp Val Gly Ph. Gly Le„ HI, Leu Pl o Tyr Arn Tnr Gly 1*. 

Lys Gly Tyr Tyr Asn lie Pro Arg III Lys Asp Ala lie Gly ?h = His 

760 7 65 

Leu Ala Val Gly Tyr Pro Phe 
I 1 * 775 

(2) INFORMATION FOR SEQ. ID NO: 45 1 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 77 4 amino acids 
<B) TYPE: amino acid 
(DJ TOPOLOGY: linear 

UU MOLECULE TYPE: protein 

fill) HYPOTHETICAL; YES 

fvi) ORIGINAL SOURCE; 

(A) ORGANISM: Pcrphyroinonas gingivalis 
tix) FEATURE: 

(A) NAME/KEY: mi£c feature 

(D) LOCATION 1...77 4 

<xi) SEQUENCE DESCRIPTION: SEQ ID HO: 454 

Met Ser Ser His Ser Val Arg Tyr Leu II, Gl, He Ala Gly cys Leu 

Leu Leu Met Leu Ala Ser Ser Cys Ser Val Thr Arg Tyr Val Pro Asp 

Gly Ser Arq Leu Leu Asp Arg Val Thr He Ala Ser Glu ^nr Gly Ser 

II, Ala Leu Pro Glu Asp Tie Ar'g Asp Tyr Thr Leu Gin Gin Pro Asn 

60 

Tyr Arg Leu Phe Gly He t Thr Arg Trp Leu Leu Arg Val Tyr Ser Ser 

Ser Asn Pro Asn Ser Asn Ser Trp Trp Asn Arg Ser Leu Arg Lys Met 

85 gQ .i^ 

Gly Glu Pro Pro Val Leu He Asp Ser Val Leu Thr n*p -g Thr Ala 

100 10S 



Asn Arg Leu Ala Lys Ala M « Al, Q ly ^ Qly pha Lsu r; p Aia ^ 

Ala Arg Ala Val Val Asp Thr Gly Leu Tyr Ly., Lys Af* Ar. He Thr 

1 140 
Tyr Leu He Gin Pro Gly Ser Ar, Tyr Tyr He Arg Asn IIs t Ala Leu 

Asp Val Ly. Asn Pro Leu Leu Pro Pro Val Ala Leu Gly Asn Ser "2 

170 1 *7 t 

Pro ser Ala Tyr Ly S Val Gly He Ser Glu Gly Ser Pro Leu Ser Pro 

He V.1 Leu Asp Glu Glu Arg Lys a" He Ala Arg Hi, ,m Arg Asn 

200 70S 
Asn Gly Ph e Trp Lys Phe Ser Ala Glu Asp Val Tyr Tyr Glu Ala As P 

Thr Thr Val Ser Gly Gly S =r Gly Thr Lys Ser III Asp lej Lys Leu 

Val Val Asn Gly lie Gly Arg Tyr Pro Tyr Ar| He Gly Ar, Val Phe 

£<o 250 ->cc 

The His Ala Asp Tyr Asp Pxo Leu Glu Ser Asp Phe Arg Val Gin G!u 

Leu Pro Arg Tie Asp Ser lie Ser Arg Gly Asp Tyr- Thr Val Tyr Tyr 

280 205 

V fan ^ Al " 9 TyX IlG Ar ^ A ^ Leu Thr Ara Ser V.1 

" 295 300 

305' '' al GlY 5JS PhS Phe C ^ Acp Asp Val Gl, Arg Ser 

Tyr lie Lys Leu Acn Ala Leu Pro He Val Arg A.n Val «n He Arg 
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325 330 335 

Phe Val Glu His Asn Gly Lys Asp Glu lie Ala Leu Ala As.p Ser Ser 

340 345 350 

Arg Leu Val Asp Cyr Tyr He Leu Thr Val fro Ala Lys Ser Lys Ser 

355 360 365 

Phe Uiu Ala Glu Val Leu Gly Thr Asn Ser Ala Gly Asp Phe Gly Ala 

370 375 300 

Ala Leu Ser I.eti Gly Phe Thr Asp Arg Asn Leu Phe Arg Gly Ala Glu 
385 390 3?5 400 

Ilet Fhe Asn He Lys Leu Lys Gly Ala Tyr Glu Ala He Arg hys* Gly 

405 410 415 

Ser His Ser Phe Met Glu Tyr Gly Val Glu Ser Ser Leu Arg Phe Pro 

420 425 430 

Arg Leu Leu Phe Pro Phe He Ser Asp Glu Thr Arg Arg Arg Leu Arq 

435 440 445 

AJ a Ser Thr Glu Trp Lys He Gly Tyr Asn Tyr Gin Thr Arg Pro Glu 

450 455 4G0 

Phe Asp Arg Val He Leu Ser Ala Gin Leu Asn Tyr Ser Trp Gin Thr 
465 470 475 480 

Tyr Leu Ills Asn Axg Leu Arg His Thr He Arg Leu Leu Asp Val Acp 

4 85 4 90 4 35 

Tyr Leu His Leu Pro Tyr He Asp Pro Asp Phe Ala Gin Ser Leu Pro 

500 SOS S10 

Pro Thr Thr Ala Leu Tyr A^n Tyr Thr Glu Gin Phe Ho Lou Gly Ser 

515 520 b25 

Al*, Tyi He Leu Asn Tyr Thr Thr Ala Ser Ser Met Glu Arg Thr Val 

53'J 535 54 O 

S-r Asn Pro Phe Thr Ala Arg Phe Ser He Gin Thr Ala Gly Asn Leu 
545 550 Sbb 560 

Leu Gin Ala lit Ssr Tyr Leu Thr Asp Ser Pro Lys Asp Glu His Gly 

565 570 575 

Leu Tyr Lys llet The Gly Leu His Tyr Ala Gin Phe Val Lys Leu Asp 

580 585 SS0 

Leu Asp Leu Ala Lys Thr Val Leu Leu Glu Lys Asp Asn Thr Leu Ala 

555 600 605 

Leu Hie Leu Gly Phe Gly Leu Ala Phe Pro Tyr Gly Asn Ala Arg His 

610 615 620 

He Fro Phe Glu Leu Arg Tyr Phe Ala Gly Gly Ser Asn Ser Val Arg 
6 25 630 635 640 

Gly Trp Ser Val Arg Thr Leu Gly Pro Gly Ser Met Lys HeL Thr Pro 

645 650 655 

Asp Lys Thi Phe Phe Asp Gin Met Gly Asp He Arg Leu Asp Leu Asn 

660 665 670 

Val Glu Tyr Arg Thr Lys Leu Phe Trp Lys Phe Arg Ala Ala Ala Phe 

67 5 680 685 

Val Asp Ala Gl y Asn Val Trp Thr He Lys Glu Tyr Glu Asn Gin Glu 

«?0 695 700 

Asp Gly Leu Phe Arg Phe Asp Arg Phe Tyr Lys Glu He Ala Leu Ala 
"70S 710 715 720 

Tyr Gly Leu Gly Leu Arg T.«u Asp Phe Asp Tyr Phe Leu Val Arg Leu 

725 730 735 

Asp Ala Gly Leu Lys Ala Tyr Asp Pro Gin Gin Thr Gly Arg Tyr Lvs 

740 745 750 

Trp Alu He Thr Arg Pro Asn Leu Ser Ser Asn Fhe Ala Trp His II- 

755 760 765 

Ala Val Gly Tyr Pro Phe 
770 

(2) INFORMATION rOR SEQ ID MO: 4 55 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 867 amino acids 

(B) TYPE: amino acid 
<uj TOPOLOGY; linear 

(ii) MOLECULE TYFE: protein 

(iii) HYPOTHETICAL: YES 

( i) ORIGINAL SOURCE: 

<A) ORGANISM: Pcrphyromonas gingiva lis 

<i::) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1...067 
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ixi) SBQUEUCE DESCRIPTION: SEQ ID NO: 455 



Me L 


Arg 


Lye 


Arg 


T ] (5 


Leu 


Cln 


Leu 


Phe 


Leu 


Thr 


Ala 


Leu 


Leu 


Leu 


Ala 


1 


















I 0 










15 




Leu 


Gly 


i'er 


Ser 


Leu 


Ala 


He 


Ala 


Gin 


Thr 


Val 


Val 


Thr 


Gly 


Lys 


Val 








20 










25 










30 






He 


Arp 


Ser 


Gl u 


Thr 


Ser 


Glu 


Pro 


Leu 


Ile 


Gly 


Val 


Ser 


Va„ 


Ser 


Thr 






35 










40 










45 








idly 


'Jin 


Gly 


Ala 


Ser 


Leu 


Arg 


Gly 


Val 


Thr 


Thr 


Asp 


Met 


Arrp 


Gly 


Gly 




50 










55 










60 










Phe 


Arg 


Phe 


GT l) 


Val 


Pro 


Ala 


Lys 


Ser 


Val 


Leu 


Thr 


Phe 


Arg 


Cys 


Val 


65 










7 0 










75 








8 0 


Gly 


Tyr 


Ala 


Thr 


Val 


Thr 


Arg 


Ser 


He 


Gly 


Arg 


Gly 


Ser 


Gin 


Glu 


Asp 










85 










90 










95 




Leu 


Gl y 


Thr 


lis 


Leu 


Leu 


Asp 


Pro 


Gin 


Ala 


He 


Gly 


Leu 


Asp 


Glu 


He 








10O 










105 










110 






Gin 


Vai 


He 


Ala 


Ser 


Vol 


Val 


Pro 


Lys 


Asp 


Arg 


Het 


Thr 


Pro 


Val 


Pro 






115 










120 










125 








Va 1 


5-r 


Asn 


He 


Arg 


Val 


Ala 


Asp 


He 


Gin 


Ala 


Ala 


Ser 


Leu 


Asn 


Val 




130 










135 










140 










Glu 


Phe 


Pro 


Glu 


Leu 


Vol 


Lys 


Ser 


Thr 


Pro 


Ser 


Thr 


Tyr 


Thr 


Thr 


Lys 


145 










150 










155 










160 


Gl y 


S~r 


Gly 


Gly 


Phe 


Gly 


Asp 


Gly 


Arg 


Thr 


Asn 


Val 


Axg 


Gly 


Fh~ 


Asp 










) 65 










170 










175 




Tin 


Tyr 


Asn 


Phe 


Sly 


Val 


Leu 


Tie 


Asn 


Gly 


Val 


Pro 


Val 


Asn 


Gly 


Met 








lti'J 










185 










190 




Gl u 


a^p 


Gly 




Val 


Tyr 


Trp 


Ssr 


Asn 


Trp 


Ser 


Gly 


Leu 


Met 


Asn 


Gin 






195 










200 










205 








Ala 


Ser 


Thr 


lie 


Gin 


He 


Gin 


Arg 


Gl y 


Leu 


Cly 


Ala 


Ser 


Lys 


Leu 


Gly 




210 










215 










220 










He 


Ser 


Ser 


Val 


Gly 


Gly 


Thr 


Met 


Asn 


He 


Ile. 


Thr 


Lys 


Thr 


Thr 


Asp 












230 










235 










240 


Ala 


Asn 


Thr 


Gly 


Gly 


Ser 


Ala 


Tyr 


Val 


Gly 


rt-t 


Gly Asn 


Asp 


Gly 


Leu 










245 










250 










255 




His 


Lyr 


Glu 


Ser 


Phe 


Ser 


He 


Ser 


Thr 


Gly 


Met 


Asn 


Asp 


Gly Trp 


Als 








260 










265 










270 






He 


rhr 


lie 


Ala 


Gly 


Ser 


His 


Hat 


Thr 


Ciy 


Leu 


Gly 


Tyr 


Val 


Lys 


Gly 






275 










280 










285 








Leu 


Lys 


Gly 


Arg 


Ala 


Phe 


Ser 


Tyr 


Phe 


Phe 


Asn 


Val 


Ser 


Lye 


Lys 


Phe 




290 










295 










300 










Afn 


Glu 


Ary 


His 


Thr 


Leu 


Ser 


Leu 


Thr 


Gly 


Phe 


Gly Ala 


Pro 


Gin 


Trp 


305 










310 










315 










320 


HI.* 


A5n 


Gin 


Arg 


Ser 


Ser 


Lys 


Tyr 


Ser 


Val 


Ala 


Asp 


Tyr 


Asp 


Lys 


Tyr 










325 










330 










335 




Gly 


He 


Arg 


His 


Asn 


Gl n 


Ser 


Phe 


Gly 


Tyr 


Leu 


Arg 


Gly 


Glu 


Leu 


Thr 








34 0 










345 










3S0 






Pro 


Thr 


Ala 


Tyr 


Ala 


Tyr 


Asn 


The 


Tyr 


His 


Lys 


Pro 


Gin 


Phe 


Ser 


Leu 






355 










360 










355 








A«n 


Hi t* 


Phe 


Trp 


T 


Met 


Asp 




Asn 


The 


Ser 


L^u 


Tyr 


Thr 


Ala 






37 0 










375 










300 










Tyr 


Ala 


Ser 


Leu 


Ala 


Thr 


Gly 


Gly 


Gly 


Arg 


Arg 


Ala 


Tyr 


Gly Lys 


Asn 


385 










390 










3 95 










4 00 


Ser 


Lys 


Trp 


Val 


Leu 


He 


Asn 


Tyr 


Asn 


Thr 


Gly Gin 


Pre 


Tyr 


Glu 


Gin 










405 










410 










415 




Thr 


Lys 


Val 


Thr 


Pro 


Asp 


Gly 


Leu 


He 


Asp 


Tyr 


Asp 


Ala 


Val 


Leu 


Ala 








420 










425 










430 






Ala 




Ala 


Al a 


Ala 


Ser 


Asn 


Gly 


Ser 


Glu 


Ala 


He 


Phe 


Ala 


Leu 


Gly 






435 




















445 






Ser 


Asn 


Ser 


Mi^ 


Lys 


Trp 


Phe 


31/ 


Leu 


Leu 


Ser 


Ser 


Phe 


Lys 


Lys 


Lys 




450 










4 55 










4 60 










Leu 


Asn 


Ser 


Ser 


Leu 


Thr 


Leu 


Thr 


Ala 


Gly 


Tyr 


Asp 


Gly Arg 


Tyr 


Tyr 


4 65 










470 










475 










4 80 


Arg 


Gly 


Asp 


His 


Tyr 


Asp 


Lys 


He 


Thr 


Asp 


Leu 


Leu 


Gly 


01 y 


Ser 


Tyr 










485 










4 90 










4 95 




Tyr 


He 


Glu 


Asp 


Pro 


Lys 


Thr 


Lys 


Leu 


Ala 


Tyr 


His 


Ala 


Glu 


Gly 


Gin 








500 










sos 










510 






Gin 


Leu 


Lys 


Val 


Gly 


Asp 


He 


Val 


Am 


Arg 


Asp 


Tyr 


Thr 


Gly Glu 


He 






515 










520 










,525 








Het 


Trp 


His 


Cli- 


Leu 


Fhe 


Ala 


Gin 


Het 


Glu 


His 


Ser 


Ser 


Glu 


Trp 


Ho 




530 










535 










54 0 








Asp 


Ala 


Phe 


val 


Ser 


Gly 


Ser 


He 


Asn 


Tyr 


Glu 


Leu 


Tyr 


Arg 


Asn 


His 


S4 5 










550 










555 






S60 


Asn 


Tyr 


Giy 


Gly 


Ser 




Ser 


Thr 


Gly 


Tyr 


Leu 


Pro 


Gly Val 


Ser 


Pro 










56b 










570 










57 5 





SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



397/490 



Trp 


Lys 


Ser 


Phe 


Lou 


Pro 


Trp 


Ser 


Gly 


Lys 


/Mo 


Gly 


L^u 


Ser 


Tyr 


Lys 








5 _ 0 










585 










S9U 






Phe 


Ala 


Gin 


Gly 




Asn 


Val 


Phe 


Al 3 




Gly 


Gly 


Phe 


Phe 


Thr 


Arq 






5 95 










600 










605 








Ala 


Pro 


Leu 


Ph** 


v 


Asn 


He 


Tyr 


Ala 


Ala 


Gly 


Ala 


He 


lib 


Pro 


Asn 




610 










61 5 










62 0 










Asp 


Lys 


Ala 


As n 


Met 


Glu 


Lys 


Val 


Leu 


Thr 


Gly 


Glu 


Val 


Gly 


Tyr 


Gly 


625 










630 










635 










6-10 


Phe 


Thr 


Am 


H1 ^ 


Lys 


Asn 


Phe 


Glu 


Phe 


Asn 


He 


Asn 


Gly 


Tyr 


Tyr 


Th r 










645 










650 








655 




Lys 


Trp 


Met 


Asp 


Arg 


Val 


Thr 


Ser 


Lys 


Arg 


lie 


Gly 


Asn 


Glu 


Tyr 


Val 








660 










665 










670 






Tyr 


J.cu 


Asn 


Gly 


Val 


Asp 


Ala 


Val 


His 


Cys 


Gly 


Va 1 


Glu 


AT a 


Gl u 


Val 






67 5 










680 










685 








Ser 


Tyr 


Arg 


Pro 


lie 


Arg 


Gin 


Tie 


Asp 


Leu 


Arg 


Gly 


Met 


Phe 


Ser 


Leu 




690 










6 95 










700 










Gly Asp 


Trp 


Thr 


Trp 


Gin 


Asn 


Asn 


Val 


Ser 


Tyr 


Thr 


Scr 


Tyr 


Asp 


Glu 


705 










710 










715 










720 


Ala 


Gly 


Asn 


Glu 


Thr 


Gly Gin 


Asp 


He 


Thr 


Tyr 


He 


Lys 


Gly 


Leu 


His 










I2S 










730 










735 




Vol 


Gly 


A. e p 


Ala 


Ala 


Gin 


Ilet 


Thr 


Ala 


Ala 


Val 


Ser 


Ala 


Asp 


He 


Glu 








140 










74b 










750 






LeU 


Phe 


Lys 


Gly 


Phe 


Hie 


Val 


Jl ft 


Gly 


Lys 


Tyr 


Asn 


Phe 


Leu 


Giy 


Ly t 






755 










7 60 










765 








Asn 


Tyr 


Ala 


civ 


Phe 


Asn 


Pro 


Ala 


Thr 


Arg 


Aen 


Ala 


Gin 


Gin 


Tyr 


Glu 




770 










775 










780 










Ala 


Asp 


Gly 


Lys 


Glu 


He 


Val 


Glu 


Ser 


Trp 


Lys 


Leu 


Fro 


Asp 


Val 


Gly 


765 










7 90 










7 95 










800 


Leu 


Phe 


Asp 


Leu 


Ser 


Ala 


Ser 


Tyr Asn 


Phe 


Lye 


Leu 


Gly 


Ser 


Leu 


Ser 










805 










810 








815 




Thr 


Thr 


Phe 


Tyr 


Phe 


Asn 


Met 


Asp 


Asn 


Val 


Ala 


Asp 


Lys 


Arg 


Tyr 


Val 








820 










625 










830 






Ser 


Asp 


Ala 


Asp 


Asp 


Asn 


He 


He 


Gly 


Lye 


Lys 


His 


Asp 


GlU 


Ala 


Ser 






835 










840 










845 








Ala 


Leu 


Val 


Trp Tyr 


Gly 


Phe 


Gly Arc? 


Thr 


Trp 


Ser 


Thr 


Gly 


He 


Arg 




8S0 










855 










860 











Val Asn Phe 



BGb 

(21 INFORMATION FOR SE0 ID NO:456 

{it SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B ) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

till] HYPOTHETICAL; YES 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Porphyroroonas gingivalia 

|ix) FEATURE: 

(A) NAME /KEY: m±sc_f eature 
(Bl LOCATION 1...431 



<xt) SEQUENCE DESCRIPTION; SEQ ID HO: 456 



Met 


Lys 


Phe 


S«r 


He 


Arg 


Leu 


Phe 


Leu 


Cys 


He 


He 


Phe 


Leu 


Leu 


Ser 


1 








5 










10 










15 




Ala 


Phe 


He 


Leu 


Pro 


Ala 


Leu 


Gly 


Gin 


Lys 


Ser 


Lys 


Gin 


Val 


Gin 


Arg 








20 










25 








30 






Leu 


Glu 


Lys 


Gin 


Arg 


Lys 


Glu 


Ala 


Leu 


Lys 


Ala 


He 


Glu 


Lys 


Thr 


Asp 






35 










40 










45 






Arg 


Glu 


Leu 


Arg 


Asn 


Thr 


Lys 


Lys 


A&p 


Lyr 


Gin 


Asp 


Ly.<* 


Gin 


Lys 


His 




50 










55 










60 










Leu 


Asn 


Leu 


Leu 


Asn 


Lys 


Gin 


Val 


Ala 


Gin 


Arg 


Lys 


Gin 


Met 


Val 


Gin 


65 










70 










7S 










80 


Leu 


Leu 


Asp 


Asn 


Glu 


Val 


Ly& 


Glu 


Leu 


Gin 


Ser 


Asp 


He 


Asp 


Ser 


Met 










85 










90 










95 




The 


Gly 


Val 


Cys 


His 


Gin 


Leu 


Ser 


Val 


GlU 


GlU 


Lys 


Ala 


Aro 


Ser 


Asp 








100 










105 










110 






Glu 


Tyr 


Al* 


Gin 


Ala 


Leu 


Gin 


Ser 


Met 


Gin 


Lye 


Arg 


Lys 


Arg 


Ser 


Leu 






115 










120 










125 







SUBSTITUTE SHEET (Rule 26) (RO/AU) 



15 



25 



WO 99/29870 PCT/AU9 8/0 1023 

398/490 

Aep Arg lie Leu Phe He Sec Set Al« Lys Scr Phe Acp Glu Gly H-t 

130 135 140 

Acq Arg Met Arg Phe Lou Glu Gin Tyr Ala Ger Ala Tyr Lys Leu Ala 
145 ISO 15S leo 

O Ser Val Arg Leu Arg Asp Thr Ary Ser Lys Leu Glu Thr Glu Arg Ala 
1S5 17C 175 

Thr Val Glu Asp Ala Lys Lys Glu Lys Gly His Leu Leu Val He Arg 

180 185 190 

Glu Glu Glu Lys Lys Lys Leu Glu Gly Gin Gin Ala Clu Gin Arq Ara 
*0 195 200 205 

Gin Val Gin Ala Leu Gly Ala Lys Gin Lys Asp Leu Glu Ala Gin L~u 

210 2is 220 

Arg Lys Gin Lyt Lys Gin Ala Glu Ala L-u A«tn Arg Ly^ He Glu Lvs 
225 -30 235 2 40 

Gin He Ala Lys Glu He Glu Ala Al« Glu Arg Arg Ala Arg Glu Glu 

245 250 255 

Arg Glu Arg Leu Ala Arg Glu Ala Lys Ala Lys Gly Lys Pro Val Pro 

260 265 210 

Ala Glu Pro Glu Arg Lys Ala Glu Thr Lys Gly Gly Tyr Ala Met Acp 
4V 275 280 285 

Ala Ser Glu Ary Ala Leu Ser Gly Ser Phe Ala Gin Asn Lys Gly Arg 

290 295 300 

Leu Pro Gly Pre Val Arq Gly Arg Tyr Arg He Val Ser Atp Phe Gly 
305 310 315 J20 

Val His Gin His Ser Glu Leu Lys Lys Val Gin Val Asn Asn Gly Gl" 

325 330 335 

Tie A-p He Ala Val Ala Thr Gly Ser Asp Ala Thr Ser Val Phe Arp 

340 345 350 

Gly Val Val Ser Ser Val Phe Val Ho Pro Gly Tyr Asn Ser Ala Val 
dU 3 55 360 365 

Met Val Arg His Gly Asn Tyr He Thr Val Tyr Ala Asn Leu Ser Lys 

370 375 3G0 

Val Tyr Val Asn Ser Gly Thr Arg Val Lys Thr Gly Gin Ala L-u Glv 
QI - 395 390 395 400 

Arg Ala Tyr Thr Asp Pro Ser Asn Asn Gin Thr He He His Phe Glu 

405 410 415 

He Trp Lys Giu Arg Ser Lys Gin Asn Pro Arg Leu Trp Leu Arg 
420 425 430 

40 (2) Til FORMATION FOR SEQ ID NO: 4 57 

ti) SEQUENCE CHARACTERISTICS ; 

I A) LENGTH; 333 amino acids 
IB) TYPE: amino acid 
ID) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

tvi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival! 3 

ILx) FEATURE: 

<A) NAME/KEY: misc foature 

(B) LOCATION 1. . . 333 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 57 

Met Lys Lys Tyr Leu Leu Tyr Ala Ser Leu Leu Thr Ser Val Leu Leu 

15 10 15 

Phe Ser Cys Ser Lye Asn Asn Pro Asn Glu Pro Val Glu Asp Ara S~r 

20 25 30 * " 

He Glu He Ser He Arg Val Asp Asp Phe Thr Lys Thr Gly Glu Ala 
OO 35 40 45 

Val Arg Tyr Glu Arg Asn Gin Gly Ser Ala Ala Glu Arg Leu lie Thr 

50 55 60 

Am L-u Tyr Leu Leu Leu Phe Asp Gin Ser Gly Ala Am Pro Ala Lv? 
65 70 75 - 30 

Tyr Tyr He Thr Gly Asn Thr Ph- Thr Gly Gly Thr Trp L-u Pro Atp 

85 90 95 

Asp Met Lys Val Lys Leu Asp Met Thr Gin Ser Glu Ala Gly Glu Ara 
100 105 no 



45 



50 



55 



60 



70 



Lya* Val Tyr Val Val Ala Asn Val Asp Asn Air. Val Lys Thr Ala Leu 
/0 115 120 125 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



399/490 



Asp 


Ala 


Val 


Ala 


Asn 


Glu 


Ser 


Aep 


Leu 


Gin 


Thr 


Vol 


L/s 


Arg 


Thr 


Thr 




130 










135 










no 






Ala 


Met 


Pre 


Trp 


Ser 


Thr 


Asp 


He 


Ala 


Ser 


Pro 


Phe 


Leu 


Met 


Ser 


Gly 


14 5 










150 










155 










160 


Asn 


Lys 


Thr 


Has 


Asp 


Phe 


Leu 


Ala 


Asn 


Arg 


Leu 


Lqu 


Asp 


Asn 


Val 


Pro 










165 










1 to 










175 




Leu 


Val 


Arg 


Ala 


He 


Ala 


Lys 


Val 


Glu 


Leu 


Asn 


He 


Ser 


Leu 


Ser 


Glu 








180 










185 










190 






Lys 


Phe 


Gin 


He 


Val 


Pro 


He 


He 


Val 


Asn 


Gly 


Ser 


Leu 


Ser 


Glu 


Php- 






195 










200 










205 








Lys 


Phe 


Arg 


Tyr 


Val 


Asn 


Phe 


Asp 


Lys 


Glu 


Thr 


Tyr 


Val 


Val 


Lys 


Pro 




210 










215 










220 








Thr 


Thr 


Lys 


Pro 


Asp 


Asn 


Leu 


He 


Ser 


Ser 


A1 % 


A?n 


Gly 


Val 


Trp 


Pro 


22 5 


lie 








230 










235 






240 


Gin 


Thr 


Asp 


Trp 


Thr 


Val 


Trp 


Gly 


Ala 


Set 


Leu 


Asn 


Thr 


Ser 


Pro 










245 










250 










25S 




Ala 


Pro 


Asp 


Ala 


Gly Thr 


Gly Tyr 


Thr 


Leu 


Asp 


Ala 


Asn 


Gly 


Lys 


Val 








260 










265 










270 




Thr 


Ala 


Leu 


Arg 


He 


Val 


Thr 


Tyr 


Leu 


Asn 


Glu 


Arg Asp 


Sei. 


Lys 


Gly 






275 










2S0 










285 




Ala 


Thr 


Val 


Glu 


Val 


Ala 


Leu- 


Pro 


Arg 


Val 


Ar*p 


Asp Gly 


Thr 


Leu 


Pro 




2 90 










295 










300 










Pro 


Pro 


Glu 


Phe 


Gly 


Pro 


Glu 


Leu 


Tyr 


Arg 


Leu 


Pro 


Leu 


Pro 


Asp 


Lys 


305 










310 










315 








320 


lie 


Leu 


Arg 


Asn 


His 


Trp 


Tyr 


Lys 


Tyr 


Glu 


Val 


Glu 


13 e 









325 330 



(2) INFORMATION FOR SEQ ID NO: 4 58 

(i) SEQUENCE CHARACTERISTICS : 

(AJ LENGTH: 33 5 amino acids 
(BJ TYPE: amino acid 
(DJ TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vl) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromona * gingiva lis 

(XX) FEATURE: 

(A) IIAME/KEY : misc f eature 
(R) LOCATION 1...315 



(xli SEQUENCE DESCRIPTION: SEQ ID !IO:458 



Met 


He 


Arg 


Thr 


He 


Leu 


Ser 


Arg 


Tyr 


Val 


Ser 


Ser 


Asn 


Phe 


Trp 


Ser 


1 








5 










10 










IS 




Arg 


Gly 


Ala 


Thr 

20 


Phe 


Phe 


Phe 


Thr 


Tl p 
25 


Phe 


Pro 


Ala 


Phe 


He 
30 


Leu 


Ala 


Ala 


Thr 


Ala 


Lfsu 


Pro 


Ala 


Cys 


Gly 


Gly 


Gly 


Thr 


Ala 


Ser 


Gly 


Ser 


Asp 






35 










40 










45 




Arg 


Thr 
SO 


Leu 


Ala 


Val 


Thr 


He 
5b 


Glu 


Pro 


Gln 


Lys 


Tyr 
60 


Ph<* 


He 


Glu 


Ser 


He 


Ala 


Asp 


Lys 


Ser 


Val 


Gin 


Val 


Val 


Ala 


Leu 


Val 


Pro 


Ala 


Gly 


Ser 


65 










70 










75 








80 


Asn 


Pro 


Glu 


Glu 


Tyr 


Asp 


Pro 


Ser 


Pro 


Thr 


Val 


Met 


Lys 


Arg 


Leu 


3er 










85 










90 






95 




Glu 


Ala 


Asp 


Ala 


Tyr 


Phe 


Tyr 


He 


Gly 


Gly 


Leu 


Gly 


Phe 


Glu 


Gin 


Arg 








100 










105 










110 




Asn 


Leu 


Ala 


Ala 


He 


Arg 


Asp 


Asn 


Ae?n 


Pro 


Lys 


Leu 


Pro 


Leu 


Phe 


Glu 






115 










120 








125 








Met 


sly 


Lys 


Ala 


Leu 


Ala 


Asp 


Ala 


Gly 


Ser 


Ala 


Asp 


Leu 


His 


Gly 


Ser 




130 










135 










l'io 








Cys 


Thr 


Asp 


His 


Ser 


His 


Thr 


Asp 


Leu 


His 


Ala 


His 


Asp 


Pro 


His 


Tyr 


145 










ISO 










155 








160 


Trp 


Ser 


ser 


Val 


Val 


Gly 


Ala 


Lys 


Ala 


Leu 


S~r 


Arg 


Ala 


Ala 


Tyr Asp 


Ala 








165 










170 










175 




Leu 


Vdl 


Glu 


Leu 


Tyr 


Pro 


Asn 


Glu 


Lys 


Asp 


Lys 


Trp 


Asp 


Lys 


Gly 








130 










105 










190 




His 


Asp 


Arg 


Leu 


Asn 


Gly 


Arg 


He 


Asp 


Ser 


Val 


Lys 


Arg 


Leu 


Val 


Asp 






195 










200 










205 






Thr 


llet 


Phe 


Ala 


Arn 


Gly 


Lys 


Ala 


Asp 


T.ys 


Ala 


Phe 


Val 


lie 


Tyr 


His 




210 










215 










220 









SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



400/490 



Pro 


Set 


Lsu 


Ser 


Phe 


Phe 


Ala 


Gin 


Glu 


Phe 


Gly 


Leu 


Arg 


Gin 


He 


Val 


225 










230 










235 










240 


Il- 


GlU 


Glu 


Asp 


Gly 


Lys 


Glu 


Pro 


Thr 


Ala 


Ala 


His 


Leu 


Arg 


Arg 


Val 










24S 










250 










255 




Ile 


Asp 


Gin 


Ala 


Arg 


Ala 


Asp 


Gly 


Val 


Arg 


lie 


Val 


Phe 


lie 


Gin 


Pro 








260 










265 










27 0 






Glu 


Phe 


Glu 


Thr 


Arg 


Gin 


Ala 


Glu 


Asp 


lie 


Ala 


Arg 


Glu 


He 


Gly Ala 






275 










280 










285 








Arg 


Pro 


Vai 


Arg 


lie 


Acn 


Pro 


I.eu 


Arg 


Ser 


s°r 


Trp 


Glu 


Glu 


Glu 


He 




2 90 










2 55 










300 










Leu 


His 


lie 


Ala 


Arg 


Ala 


Leu 


Ala 


His 


Glu 


Arg 













305 3\0 315 



{2 J IMrORlIATIGH tOH SEQ ID NO: 459 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE1IGTH: 348 amino acids 
IB) TYPE; amino acid 
<D> TOPOLOGY: linear 

(ill MOLECULE TYPE: protein 

illi) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE; 

(A) ORGAl-flSJI: Porphyromor.ar gingiva lis 

(ix) FEATURE: 

(A) NAME/ KEY: mircjeature 
( B } LOCATIOU L. . .848 





Ui) 


i SEQURT1CE DESCRIPTION: SEQ ID HO: 4b 9 










Met 


lie 


•Jly 


Lys 


Lys 


He 


Phe 


Phe 


He 


Leu 


Leu 


AI a 


Leu 


He 


Ala 


Phe 


1 








5 










10 










15 




Ser 


Gly 


Leu 


Asn 


A3 a 


Ala 


Thr 


Asp 


Thr 


Glu 


Phe 


Lys 


Tyr 


Pro 


Thr 


Asp 








20 










25 










30 






Ala 


Asn 


ile 


He 


Gly 


His 


Val 


Lye 


Asp 


Ser 


Lys 


Thr 


Gly Glu 


His 


Leu 






35 










40 










45 








Val 


Gly 


He 


Thx 


lie 


Ala 


He 


Lys 


Gly 


Thr 


Thr 


Phe 


Gly Thr 


Ser 


Thr 




50 










55 










60 










Asp 


Ala 


Thr 


Gly 


His 


Tyr 


Tyr 


Leu 


Arg 


Asn 


Leu 


Arg 


Pro 


Gly 


Glu 


He 


65 










70 










75 










89 


Thr 


Lea 


Ile 


Met 


Arg 


Gly 


Met 


Gly 


Tyr 


Lys 


Ser 


Gin 


Glu 


Arg 


Val 


Val 










85 










90 










95 




Arg 


Val 


Glu 


Lys 


Asp 


Lys 


Thr 


He 


Glu 


Val 


Asn 


Phe 


Glu 


Ala 


Glu 


Glu 








100 










105 










110 






Asp 


Ala 


He 


Asn 


Leu 


Asp 


Glu 


Val 


Val 


He 


scr 


Ala 


Asn 


Arg 


Glu 


L-u 






115 










120 










125 








Thr 


Leu 


Arg 


Arg 


Leu 


Ala 


Pro 


Thr 


Leu 


Val 


Asn 


Val 


Leu 


Asn 


Glu 


Lys 




130 










135 










14 0 










Val 


Phe 


Ser 


Gin 


Val 


Asn 


Ala 


Ser 


Asn 


Leu 


Ala 


Gin 


Gly 


Leu 


Ser 


Phe 


14S 










150 










1S5 










160 


Gin 


Pro 


Gly 


Val 


Arg 


Val 


Glu 


Asn 


Asn 


Cys 


Gin 


Asn 


Cys 


Gly 


Phe 


Asn 










165 










170 










175 




Gin 


Val 


Arg 


He 


Asn 


Gly 


Leu 


Asp 


Gly 


Arg 


Tyr 


Ala 


Gin 


He 


Leu 


lie 








180 










185 










190 






Asp 


Ser 


Arg 


Pro 


He 


Met 


Ser 


Ala 


Leu 


Ala 


Gly Val 


Tyr 


Gly 


Leu 


Glu 






195 










200 










205 








Gin 


He 


Pro 


Ala 


Asn 


Met 


He 


Glu 


Arg 


Val 


Glu 


Val 


Val 


Arg 


Gly 


Gly 




210 










215 










220 




Gly 


Ser 


Ala 


Leu 


Tyr 


Gly 


Ser 


Ser 


Ala 


He 


Ala 


Gly 


Val 


Val 


Asn 


He 


225 










230 










235 








240 


He 


Thr 


Lys 


Glu 


Pro 


Ser 


His 


Asn 


Scl- 


Phe 


Thr 


Phe 


Asn 


Glu 


Ser 


Leu 










245 










250 










255 




Ser 


Phe 


Thr 


Gly 


Phe 


Ser 


Lys 


Leu 


As p 


Asn 


Asn 


Thr 


Asn 


Phe 


Asn 


Ala 








260 










265 










270 






Ser 


He 


Val 


Ser 


Asp 


Asp 


Asn 


Arg 


Ala 


Gly 


Ala 


Met 


Val 


Phe 


Gly 


Gin 






275 










280 










2 85 








Ala 


Arg 


Tyc 


Arg 


Asn 


His 


Trp 


Asp 


Ala 


Asn 


Asn 


Asp 


Gly 


Tyr 


S*r 


Glu 




290 










295 










3Q0 










Leu 


Gly 


Lys 


He 


Asp 


Ala 


Arg 


Ser 


Leu 


Gly 


Ala 


His 


Ser 


Tyr 


Leu 


Arg 


305 










310 










315 










320 


Lsu 


Scr 




Tyr 


Ser 


Lys 


Leu 


Thr 


Gly 


Glu 


Phe 


Hl£ 


Thr 


He 


Sor 


Clu 










3? 5 










330 










335 





SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



401/490 

Phe Arq Arg Gly Gly Asp Ary He Asp Lau Pro Pro His Val Val Gly 

340 345 350 

Val Ala Glu Gin Thr Asp Hie Ser Val Phe Ssr Gly Asn Leu Ly- T-r 

355 360 365 

Asp Leu Phe Ser Ser Asn Tyr Lys His His Phe Gin Ala Tyr Thr Ser 

370 375 3 8 o 

Gly Gin He Val Asn Arg Lys Ser Tyr Tyr Gly Gly He Gly Glu 11° 
385 390 395 4Q0 

Asp Val Asn Gly His Pro Gly Gly Thr Glu Cly Tyr Pro He Pro Gin 

4 05 -110 415 

Asp Gin Tyr Gly Asn Asn Tyr Gly Val Thr Lys Gly Lys Thr Tyr lie*- 

'120 4 25 430 

Gly Gly lis Gin Tyr Ser Tyr Asp Leu Asp Lys Ph~ Leu Leu Met Pro 

435 44 0 445 

Sfer Gin Leu Leu Phe Gly Ala Glu Tyr Thr Arg Asp Glu Leu Asn Arp 

450 455 460 

Val llet Pro lie Leu Ser Tr^ Gin Thr Gly Glu Asp Ala Asn Glv A-n 
465 470 475 48C 

Thr He Pro Leu Tyr Pro Glu Leu Asp Gin Asn He Asn Asn Tyr S-r 

48b 490 495 

Leu Phe Gly Gin Asn Glu Trp Lys Asn Asp Arg Trp Ser He Leu Val 

500 505 • 510 

Gly Ala Arg Leu Asp I.ys His Ser Glu Val Lys Asp Met He Leu Ser 

515 520 525 

Pro Arq Thr Thr Leu Arg Phe Asn Val Acn Pro A/*p He Asn L~u Ara 

530 53S £40 " 

Ala Thr Tyr Ala Lys Gly Phe Arg Ala Pro Gin Val Phe Asp Glu Asp 
545 550 555 56" 

Leu His Val Gly Val Val Gly Gly Glu Ala Gin Lys Val Fhe Asn Asp 

565 57 0 57 5 

Pro Asn Leu Lys Pro Glu He Ser His Ala Phe Ser Leu Scr Ala Asp 

580 5B5 590 

Met Tyr His Arg Phe Gly Asn Val Gin Thr Asn Phe Leu Val Glu Gl- 

525 600 6Q5 

Phe Tyr Thr Arg Leu Leu Asp Val Phe Thr Asn Glu Glu Gin Pro Asp 

610 615 620 

Gin His Asp Gly He Lys Arg Tyr Thr Arg He Asn Gly Ser Glv Al« 
62S 630 635 Y 640 

Lys Val Phe Gly Leu Asn Leu Glu Gly Lys Val Ala Tyr Lys Ser Ph- 

645 650 655 

Gin Leu Gin Ala Gly Leu Thr Leu Ala Ser Asn Lys Tyr Asp Glu Ala 

*60 665 670 

Gin Glu Trp Gly Leu Asn Thr Val Lys Asp Thr Asn Gly Ala Phe Val 

675 680 685 

Thr Glu Ala Asn Ala Asn Gly Gin Gin Glu Tyr Lys Asn Glu Ser M-t 

690 695 700 

Thr Asp Thr Gin He Thr Arg Thx Pro Ser Val Tyr Gly T"r Ph- Thr 
70S 7io 715 ' 720 

Leu Ala Tyr Asn Pro Ala Hir Ser Trp Asn He Ala Leu Thr Gly Ala 

72 5 730 ,35 

Tyr Thr Gly Gin Met Tyr Val Pro His Ala He Glu Tyr Gly Val Lv- 

740 745 750 

Ser Ala Glu Leu Asp He Met Gin Asn Asn Pro Glu He Thr Asp Glu 

755 760 765 

Thr Gly I.ys Ala Pro Arg He Asp Glu Leu Lys Lys Thr Pro Ala Phe 

77 0 775 780 

Phe Asp Leu Gly Leu Lys Val Gly Tyr Asp Phe His Val Phe Gin Ala 
785 7 90 795 aoo 

Thr Glu Val Gin Leu Tyr Val Gly Met Asn Asn He Phe Asn Ser Ph- 

S( >5 flio 815 

Gin Lys Asp Phe Asp Arg Gly Ala Ala Arg Asp Ser Gly Ty- He Tvr 
820 



830 



Gly Pro Thr Gin Pro Arg Thr Gly Tyr Hftt Cly Leu Val Val Lys Ph- 
835 840 845 

(2) INFORMATION FOR SEQ ID HO: 460 

(J) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 202 amino acids 
<B> TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



402/490 



PCT/AU98/01023 



Ui) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas qingivalis 

iix) FEATURE: 

(A) NAT IE/KEY: mircjeaturs 

(B) LOCATION 1 . . - 202 



C'.i) SEQUENCE DESCRIPTION : SEQ ID HO: 4 60 



Ilet 


Thr 


Val 


Lys 


Arg 


Ala 


Val 


Arg 


He 


Ala 


Leu 


Leu 


Thr 


Leu 


He 


Gly 


1 








S 








10 










15 




lie 


L? u 


Phe 


Ser 
20 


Ser 


Pro 


Ser 


Leu 


Val 

25 


Arg 


Ala 


Gin 


Ser 


Leu 
30 


Phe 


Ser 


Thr 


Glu 


His 


Val 


Leu 


Gin 


Leu 


Tyr 


Asn 


Lys 


He 


Leu 


Tyr 


Gly Glu 


Ser 






35 










40 


















Ala 


Ala 


A.^p 


Thr 


Val 


Ala 


Glu 


Lys 


Thr 


Ala 


Gly 


Glu 


Ser 


Ala 


Phe 


Pro 




50 










55 


















Phe 


He 


Asp 


Lys 


Leu 


He 


Acn 


Leu 


Gly 


Arq 


Thr 


Phe 


Leu 


Gly 


Lys 


Pro 


65 










70 










75 










80 


Tyr 


Arg 


Tyr 


Axg 


Gly 


Pro 


Ser 


Pro 


Trp 


Pro 


Met 


Aerp 


Cys 


S-r 


Gly Tyr 










85 










90 










95 




Val 


Ser 


Tyr 


Leu 


Tyr 


Ser 


Lys 


Phe 


Asp 


He 


Lys 


Leu 


Pro 


Arg 


Gly Ala 








100 










105 










110 






A]* 


Air, 


Gin 
115 


Ser 


Gin 


Tyr 


Thr 


Asn 
120 


Pro 


He 


Glu 


Arg 


Glu 

125 


Asp 


Val 


Arg 


Pro 


Gly 
130 


Asp 


Leu 


Leu 


Phe 


Phe 
135 


Lys 


Gly 


Arg 


Asn 


Ala 
140 


Arg 


Ser 


Asn 


Arg 


11^ 


Gly 


His 


Val 


Ala 


Leu 


Val 


Val 


Ser 


Val 


Asp 


Glu 


Asp Asp 


He 


Thr 


145 










150 










155 










160 


Hst 


Met 


His 


Ser 


Arg 
165 


Asn 


Ser 


Axg 


Gly 


He 
170 


Val 


He 


Glu 


Lys 


Leu 
175 


Asn 


Arc, 


Ser 


Ala 


Tyr 


Phe 


Ser 


Arg 


Arg 


Leo 


Val 


Ser 


Tyr 


Gly Arg Val 


Pro 








100 










185 










190 






Gly 


Ala 


Lyo 
195 


Arg 


Val 


He 


Pro 


Arg 
200 


Lys 


Ser 















(2) INFORMATION FOR SEQ ID NO: 4 61 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

ill) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YF.S 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalie 

(ix> FEATURE; 

(A) NAME /KEY: miscjeature 

(B) LOCATION 1. . .455 



{Xi} SEQUENCE DESCRIPTION: SEQ ID NO: 461 



Wet 


Lys 


Arg 


Thr 


He 


Leu 


Leu 


Thr 


Ala 


Leu 


Thr 


Val 


Lsu Ser Ser 


Leu 


1 








5 










10 






15 




Ser 


Leu 


Leu 


Arg 


Ala 


Gin 


Asn 


Glu 


Ser 


Glu 


Ala 


Set 


Thr Aen Pro 


Het 








20 










25 








30 




Ser 


Gly 


Leu 


Ser 


Leu 


Glu 


Asp 


Cys 


Tie 


Arg 


He 


Ala 


Lys Glu Arg 


Asn 






35 










40 










45 




Leu 


Asn 


Leu 


Arg 


Arg 


Gin 


Glu 


He 


Glu 


Gin 


Glu 


Am 


Arg He He 


Se- 




50 










55 










60 




Leu 


Asp Ala 


Ala 


Axg 


Hie 


Ser 


Phe 


Leu 


Pro 


Ser 


Val 


Asn Ala Gly 


Ile 


65 










70 










75 






00 


Gly 


Ills 


Asn 


Tyr 


Ser 


Phe 


Gly 


Arg 


Ser 




hep 


hyc 


Thr Gly Val 


Thr 










85 










50 






95 




Val 


Asp 


Arg 


Ser 


Ser 


Het 


Ae n 


Thr 


Asn 


Leu 


Ser 


He 


Gly Ala Ser 


Val 








100 










105 








110 




Glu 


Val 


Ph- 


Ser 


Gly Thr Arg 


Arg 


Leu 


His 


Asp 


Leu 


Lys Gin Gin 


Lys 






ils 










120 










125 


Tyr 


Asn 


Val 


GJu 


Asp 


Gly 


He 


Ala 


Arg 


Lsu 


Gin 


I.y? 


Ala Arg Glu 


Asp 



130 135 140 
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Leu Ser Leu Gin He Ala Ala Leu Tyr He Asn Leu Leu Phe Arg Gin 
145 ISO 155 160 

Glu Met Thr Arg Thr Ala Glu Thr Gin Leu Ala Leu He Arg Glu Gin 

165 170 ns 

Arq Asn Arg Thr Ala Glu Het Val Arg Val Gly Lys Trp Ala Glu Gly 

180 iGS 190 

Lys Leu Leu Asp He Asn Ala Gin Met Ala Lys Asp Glu Gin Leu Leu 

195 200 205 

Val Gin Tyr Arq Ser Glu Glu Glu Leu Ala Arg Leu Asp Leu Gly Gin 

210 215 220 

Ala Leu Glu Leu Glu His Pro Glu Ser He Ala Val Lys Ala Pro Asp 
225 230 C35 2 4 0 

Thr Asp Val Leu Val Ala Glu Arg Leu Gly Ser Leu Leu Ala Pro Glu 

245 250 255 

Glu He Tyr Arg Thr Ala Leu Gly Leu Lys Pro Ala Leu His Ser Ser 

260 265 270 

Glu Leu Gin He Ala Ser Ala Arg Glu Gly Leu Ala Ser Ala Arg A3 a 

275 200 285 

Ala Tyr Phe Pro Thr Leu Ser Leu Ser Ala Gly Tyr Ser Asn Gly T"r 

29C 295 300 

Phe Arg Asp Leu Gly Lys Glu Tyr Ala Ala lie Asn Pro Ser Ph= s-r 
305 310 31b " 320 

Glu Gin Trp Lys Asn Asn Gly Ser Tyr Ser II- Gly Leu Ser Leu Asn 

325 330 335 

He Pro He Phe Ser Ala Met Gin Thr Gin Ao-p Ary Val Arg Ser Ser 

340 345 350 

Arg Leu Gin Ho Arg Ser Ser Glu Leu Arg Leu Val Glu Glu Lys Ly* 

355 360 365 

Ala Leu Tyr Lys Glu He Arg Gin Ala Tyr Ser Asn Ala Val Ala Ala 

370 375 380 

Asp Lys Ala He Ala Ala Ala Glu Asn Ser Lys Ala Ala Thr Leu Lys 
3B5 390 395 4u0 

Ala Tyr Giu Tyr Ala Arg Asp Ser Phe Glu Ala Gly Arg Leu S-r Ala 

405 410 4 15 

Tyr Glu Tyr Ala Glu Ala Lys Thr Lys Tyr Ala Leu Ser Gin Val Glu 

420 425 430 

Glu Leu Arg Ala Lys Tyr Asp Phe He Tyr Lys Ala Lys Val Leu Asp 

435 440 445 

Phe Tyr Gin Gly Lys Asp Phe 
4S0 455 

{2} INFORMATION FOR SEQ ID NO; 462 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 amino acids 
in) TYFE: amino acid 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(111) HYPOTHETICAL: YES 

[vil ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

(A) NAME/ KEY: mis cofeature 
(BJ LOCATION 1 . . .444 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 62 

Met Arg Phe Gin His Tyr Leu Tie Cys Thr Ala Ala Val Ala Ala Leu 

15 10 15 

Ala Ala As>\\ Pro Leu Thr Gly Gin Ser Asn Met Thr Leu Glu Glu cv* 

20 25 30 

He Asp Tyr Ala Arg Arg His Ser Ser Ala Val Ala Leu Ser Ala Ala 

35 40 45 

Glu Leu Glu Gin Ser Lys Ala Asp Tyr Leu Gin Ala Val Gly Asn Phe 

50 55 ecL 

Leu Pro Arg Val Ser Ala Gly Thr Gly Ala Ser Tro Asn Phe Gly Arq 

65 fo 75 eo 

Gly Leu Asp Ala Glu Thr Asn Thr Tyr Thr Asp He Asn Ser Ph- Asn 

85 90 q 5 

Asn Ser Tyr Ser He His Ala Thr Met Thr Leu Phe Asp Gly L-u Gin 

loo los no 
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Ser 


V d l 


Tj r 


Arq 


Leu 


Arg 


Met 


Ala 


His 


Ala 


Arg 


Arg 


Glu 


Ala 


Ser 


Arg 






11!, 










120 










125 








Let! 


S-r 


Val 


Arg 


Glu 


Gin 


Gin 


Glu 


Leu 


Ala 


Ala 


Leu 


Gly 


Thr 


Thr 


Glu 




130 










135 










140 










Ala 


Tyr 


Tyr 


Asp 


Lou 


Val 


Tyr 


Ala 


Arg 


Gin 


MeL 


Gin 


Glu 


Leu 


Ala 


Het 


145 










150 










15b 










160 


OJ n 


I.y* 


Tyr 


Glu 


Glu 


Ser 


Ser 


Arg 


Leu 


His 


Arg 


Gin 


Thr 


Ala 


Arg 


Met 










165 










170 










175 




Glu 


Glu 


Leu 


Gly net 


Lys 


Ser 


Arg 


Pro 


Asp 


Val 


LtrU 


Glu 


Met 


Gin 


Ser 


















185 










190 






Arg 


Met 


Ala 


Gly Asp 


Arg 


Leu 


Ala 


Leu 


Thr 


Gin 


Ala 


Asp 


Asn 


Gin 


Cys 






195 










200 










205 








lie 


lie 


Ale 


Leu 


He 


Arg 


Leu 


Lys 


Glu 


Lys 


Itet 


Asn 


Phe 


Pro 


He 


Asp 




210 










215 










220 










Asp 


Glu 




Val 


Val 


Asp Asp Met 


Pro 


Ala 


Asp 


Ser 


Leu 


Ser 


Ala 


Asp 


22b 










230 










235 










240 


Met 


Ala 


Glu 




Asp 


Ser 


Ser 


Ala 


Gly 


Val 


Phe 


Ala 


Arg 


Ala 


Ala 


His 










245 










250 










255 




Hi* 


His 


Pro 


Val 


Leu 


Leu 


Arg 


Ala 


Lys 


Leu 


Asp 


Glu 


Gin 


Ala 


Ala 


Thr 








250 










265 










270 






Asp 


Arg 


Leu 


Arg 


Ala 


Ala 


Arg 


Gly 


Ala 


Phe 


Leu 


Pro 


Scr 


Val 


Ser 


Val 






270 










230 










2B5 








S~r 


Gly 


Gly 


Trp 


Asn 


Thr 


Gly 


Phe 


Ser 


Arq 


Phe 


Leu 


Asr. 


Gly 


Ser 


Asp 




2?0 




















300 










Tyr 


Thr 


Fro 


Phe 


Ser 


Glu 


Gin 


Phe 


Arg 


Asn 


Ara 


Arg 


Gly 


Glu 


Tyr 


Val 












310 










315 










320 


Ser 


Leu 


Arn 


Leu 


Sex 


He 


?ro 


He 


Phe 


Ser 


Gly 


Phe 


Ser 


Leu 


Val 


Ser 










325 










330 










335 




His 


Leu 


Arg 


Gin 


Ala 


Arq 


Ala 


Glu 


Arg 


Arg 


Ala 


Ala 


Tie 


Val 


Arg 


Arg 








340 




















350 






Cly Glu 


Ala 


Glu 


Arg 


Arg 


Leu 


Tyr 


Ser 


Glu 


He 


Ala 


Gin 


Ala 


Met 


Ala 






355 










360 










365 








Asp 


Ary 


Asp 


Ala 


Ala 


Leu 


Ala 


Ser 


Tyr Arg 


Gin 


Ala 


Lys 


Glu 


His 


Thr 




37 0 










375 










380 










Asp 


Ala 


list 


Gin 


Thr 


Ala 


Tyr 


Glu 


Ala 


Val 


Leu 


Gin 


Arg 


Tyr 


Glu 


Glu 


3B5 










390 










395 








400 


Gly 


Leu 


Asn 


Thr 


Ala 


He 


Asp 


Leu 


Thr 


Thr 


Gin 


Ala 


Asn 


Arg 


Leu 


Leu 










4 05 










410 










415 




A.^p 


Ala 


Arq 


Val 


Gin 


Arg 


Leu 


Arg 


Ala 


Ala 


Ilet 


Thr 


Tyr 


Arg 


Leu 


Lys 








420 










425 










430 






Cys 


Lys 


Leu 


lie 


Ala 


Tyr 


Tyx 


Gly 


Cys 


Leu 


Ser 


Asp 














435 










440 



















12) IK FORMATION FOR SEO ID HO: 4 63 

(i) SEQUEUCE CHARACTERISTICS; 

(A) LEKGTH: 940 amino acids 
(D) TYPE: amino acid 
(D) TOPOLOGY : linear 

till MOLECULE TYPE: protein 

(ili) HYPOTHETICAL: YES 

Ivi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

fix) FEATURE: 

(A J IJAI IE /KEY : tniscjeature 
tB) LOCATlOtl 1. . .94 0 

Ui) SEQUENCE DESCRIPTION: SEQ ID HO: 4 63 



Met 


Asn 


Lys 


Phe 


Tyr 


Lys 


Ser 


Leu 


Leu 


Gin 


Ser 


Giy 


Leu 


Ala 


Ala 


Phe 


1 








5 










10 










15 




Val 


Scr 


Met 


Ala 


Thr 


Ala 


Leu 


Thr 


Ala 


Ser 


Ala 


Gin 


He 


Ser 


Phe 


Gly 








20 










25 










30 




Gly 


Glu 


Pre 

35 


Leu 


Ser 


Phe 


Scr 


Ser 
40 


Arg 


Ser 


Ala 


Gly 


Thr 


His 


Ser 


Phe 


Asp 


Asp 


Ala 


Met 


Thr 


He 


Arg 


Leu 


Thr 


Pro 


Asp 


Phe 


Asn 


Pro 


Glu 


Asp 




50 










55 










60 








Leu 


He 


Ala 


Gin 


Ser 


Arg 


Trp 


Gin 


Ser 


Gin Arg 


Asp 


Gly 


Arg 


Pro 


Val 


65 










70 










75 










80 


Axg 


lie 


Gly 


Gin 


Val 

85 


He 


Pro 


Val 


Asp 


Val 
90 


Asp 


Phe 


Ala 


Ser 


Lys 
95 


Ala 
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Ser 


His 


He 


Ser 


Ser 


He 


Gly 


Asp 


Val 


Asp 


Val 


Tyr 


Arg 


Leu 


Gin 


Phe 








100 










105 










110 






Lys 


Leu 


Glu 


Gly 


Ala 


Lys 


Ala 


lie 


Thr 


Leu 


Tyr 


Tyr 


Asp 


Ala 


Phe 


Asn 




lib 










120 










125 








Tl €> 


Pro 


Glu 


Gly 


Gly 


Arg 


Leu 


Tyr 


Tie 


Tyr 


Thr 


Pro 


Asp 


His 


Glu 


He 




130 










135 










140 










Val 


Leu 


Gly 


Ala 


Tyr 


Thr 


Asn 


Ala 


Thr 


His 


Arg 


Arg 


Asn 


Gly Ala 


Fhe 


145 










150 










155 










160 


Ala 


Thr 


Cl u 


Pro 


Val 


Pro 


Gly 


Ser 


Glu 


Leu 


He 


Met 


Asp 


Tyr 


GlU 


Val 










165 










170 










175 




Ser 


Arg 


Gly 


Gly 


Thr 


Leu 


Pro 


Asp 


He 


Lys 


He 


Ser 


Gly 


Ala 


Gly 


Tyr 


















185 










190 






lis 


Phe 


Arp 


Lys 


Val 


Gly 


Gly 


Acq 


Pro 


Val 


Thr 


Asp 


Asn 


His 


Tyr 


Gly 






195 










200 










205 








lie 


Gly 


Glu 


Asp 


Asp 


Ser 


Asp 


Ser 


Asp 


Cys 


Glu 


He 


Asn 


He 


Asn 


Cys 




210 










215 










220 










Pro 


Glu 


Gly 


Ala 


Asp 


Trp 


Gin 


Ala 


Glu 


Lys 


7\3r\ 


Gly 


Val 


val 


Gin 


Ilet 


221. 










230 










235 










24 0 


lie 


Net 


Val 


Lys 


Gly 


Gin 


Tyr 


He 


Ser 


Met 


Cys 


Ser 


Gly Asn 


Leu 


Leu 










245 










250 










255 




Asn 


Asn 


Thr 


Lys 


Gly 


Asp 


Phe 


Thr 


Pro 


I.CI1 


He 


He 


Ser 


Ala 


Gly 


Hi£ 








260 










265 










270 




Cys 


Ala 


Ser 


He 


Thr 


Thr 


Asn 


Ph- 


Gly 


Val 


Thr 


Gin 


Ser 


Glu 


Leu 


Asp 






2' ft. 










280 










285 








Lys? 


Trp 


He 


Phe 


Thr 


Phe 


His 


Tyr 


Glu 


Lys 


Arg 


Gly 


Cys 


Ser 


Asn 


Gly 




290 










295 










300 










Thr 


Leu 


Al a 


Tie 


Phe 


Arg 


Gly 


Asn 


Ser 


Ho 


He 




Ala 


Ser 


Met 


Lys 


30i» 










310 










315 










320 


Ala 


Phe 


Leu 


Pro 


He 


Lys 


Gly 


Lya 


Ser 


Asp 


Gly 


Leu 


Leu 


Leu 


Gin 


Lou 










325 










330 










335 




Asn 


Asp 


Glu 


Val 


Pro 


Leu 


Arg 


Tyr 


Arg 


Val 


Tyr 


Tyr 


Asn 


Gly Trp 


Asp 








34 O 










345 










350 






Ser 


Thr 


Pro 


Asp 


He 


Pro 


Ser 


Scr 


Gly 


Ala 


Gly 


He 


His 


His 


Pro 


Ala 






355 










360 










365 








Gly 


Asp 


Ala 


Het 


Lys 


He 


Ser 


He 


Leu 


Lys 


Lys 


Thr 


Pro 


Ala 


Leu 


Asn 




370 










375 










380 










Thr Trp 


He 


Ser 


Ser 


Ser 


Gly 


Scr 


Gly Gly Thr 


Asp 


Asp 


His 


Phe 


Tyr 


385 










390 










395 










400 


Phe 


Lys 


Tyr 


Asp 


Gin 


Gl y Gly 


Thr 


Glu 


Gly 


Gly 


Ser 


Ser 


Gly 


Ser 


Ser 










4 05 










410 










415 




Leu 


Phe 


Asn 


Gin 


Asn 


Lys 


Hit 


Val 


Val 


Gly 


Thr 


Leu 


Thr 


Gly Gly Ala 








4 2 0 










425 










430 






Gly Asn 


Cys 


Gly Gly 


Thr 


GlU 


Phe 


Tyr 


Gly 


Arg 


Leu 


Arn 


Ser 


His 


Trp 






435 










440 










445 








Asn 


GlU 


Tyr 


Ala 


Ser 


Asp Gly 


Asn 


Thr 


ser 


Aro 


Het 


Asp 


He 


Tyr 


Leu 




450 




















460 










Asp 


Pro 


Gin 


Atn 


A^n 


Gly 


Gin 


Thr 


Thr 


He 


Leu 


Asn 


Gly Thr 


Tyr Arg 


465 










470 










475 












Aqp 


Gly 


Tyr 


Lys 


Pro 


Leu 


Pro 


Ser 


Val 


Pro 


Arg 


Leu 


Leu 


Leu 


Gin 


Ser 










485 










400 










4 95 




Thr 


•31 y 


Asp 


Gin 


Val 


Glu 


Leu 


Asn 


Trp Thr 


AJa 


Val 


Pro 


Ala 


Asp 


Gin 








500 










505 










510 






Tyr 


Pro 


Ser 


Ser 


Tyr 


Gin 


Val 


Glu 


Tyr 


His 


He 


Phe 


Arg Asn Gly 


Lys 






51S 










520 










525 








Glu 


lie 


Ala 


Thr 


Thr 


Lys 


Glu 


Leu 


Sar 


Tyr 


Ser 


Asp 


Ala 


He Asp 


Glu 




530 










535 










540 










Ser 


He 


He 


Gly 


Ser 


Gly 


He 


He 


Arg 


Tyr Glu 


Val 


Ser 


Ala 


Arg 


Phe 


545 










550 










555 










560 


He 


T/r 


Pro 


Ser 


Pro 


Leu 


Asp 


Gly Val 


Glu 


Ser 


Tyr 


Lys 


Asp 


Thr 


Asp 










565 










570 










57 5 




Lys 


Thr 


Ser 


Ala 


Asp 


Leu 


Ala 


He 


Gly Asp He 


Gin 


Thr 


Lye 


Leu 


Lys 








580 










585 










590 






Pro 


Asp Val 


Thr 


Pro 


Leu 


Pro 


Gly Gly 


Gly Val 


Ser 


Leu 


Ser Trp 


Lyr 






595 










600 










605 








Val 


Pro 


Phe 


Leu 


Sor 


Gin 


Leu 


Val 


Ser 


Arg 


Phe 


Gly 


Glu 


Ser 


Pro 


Asn 




610 










615 










620 










Pro 


Val 


Phe 


Lys 


Thr 


Phe 


Glu 


Val 


Pro 


Tyr 


Val 


Ser 


Ala 


Ala 


Ala 


Ala 


625 










630 










635 










540 


Gin 


Thr 


Pro 


Asn 


Pro 


Pro 


Val 


Gly Val 


Val 


He 


Ala 


Asp Lys 


Phe 


Met 










645 










650 










655 




Ala 


Gly Thr 


Tyr 


Pro 


Glu 


L^s 


Ala 


Ala 


Xle 


Ala 


Ala 


Val 


Tyr 


Val 


Met 








660 










665 










67 0 






Pro 


Ser 


Ala 


Pro 


A?p 


Ser 


Thr 


Phe 


Hie 


Leu 


Phe 


Leu 


Lys 


Ser 


Asn 


Thr 






615 










600 










6H5 








Asn 


Arg 


Arg 


Leu 


Gin 


Lys 


Val 


Thr 


Thr 


Pro 


Ser 


Asp 


Trp 


Gin 


Ala 


Gly 
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690 










fc9S 










700 










Thr 


Trp 


Leu 


Arg 


He 


Asn 


Leu 


Asp 


Lys 


Pro 


Phe 


Pro 


Val 


Asn 


Acn 


Asp 


70S 










710 










715 










720 


His 


Met 


Leu 


Phe 


Ala 
725 


Gly 


He 


Arg 


Het 


Pro 
730 


Asn 


Lys 


Tyr 


Lys 


Leu 
735 


Asn 


Arg 


Ala 


He 


Arg 
7*10 


Tyr 


Val 


Arg 


Asn 


Pro 
745 


Asp 


Asn 


Leu 


Phe 


Ser 
750 


lie 


Thr 


Gly 


Lys 


Lys 


He 


Ser 


Tyr Asn 


Asn 


Gly 


Val 


Ser 


Phe 


Glu Gly Tyr 


Gly 






755 










760 










765 








He 


Pro 
770 


Spr 


Leu 


Leu 


Gly 


Tyr 
775 


Met 


Ala 


He 


Lys 


Tyr 
700 


Leu 


Val 


Val 


Asn 


Thr 


Asp 


Ala 


Pro 


Lys 


He 


Asp 


Met 


Ser 


I.eu 


V*l 


Gin 


Glu 


Pro 


Tyr 


Ala 


7BS 










790 










795 










800 


Lj s 


Gly 


Thr 


Asn 


Val 
805 


Ala 


Pro 


Phe 


Pro 


Glu 
810 


Leu 


Val 


siy 


He 


Tyr 
815 


Val 


Tyr 


Lys 


Asn 


Gly 

820 


Thr 


Phe 


lis 


Gly 


Thr 

825 


Gin 


Asp 


Pro 


Ser 


Val 
830 


Thr 


Thr 


Tyr 


Ser 


Val 


Ser 


Asp 


Gly Thr 


Glu 


Ser 


Asp 


Glu 


Tyr 


Glu 


He 


Lys 


Leu 






835 










840 










845 








Val 


Tyr 


Lys 


Gly 


Ser 


Gly 


He 


Ser 


Asn 


Gly Val 


Ala 


Gin 


lie 


Glu 


Asn 




850 










05b 










860 










Asn 


Asn 


Ala 


Val 


Val 


Ala 


Tyr 


fro 


Ser 


Val 


Val 


Thr 


Asp Arg 


Phe 


Ser 


865 










870 










875 










880 


He 


Lys 


Acn 


Ala 


Hie 
865 


Met 


Val 


HI.*? 


Ala 


Ala 
890 


Ala 


Leu 


Tyr 


Ser 


Leu 

9 95 


Asp 


Gly 


Lys 


Gin 


Val 


Arg 


Ser 


Trp 


Asn 


Acn 


Leu 


Arg 


Asn 


Gly Val 


Thr 


Phe 








300 










9 Ob 










910 






Ser 


Val 


Gin 
915 


Gly 


Leu 


Thr 


Ala 


Gly 
920 


The 


Tyr 


Met 


Leu 


Val 
925 


Met 


Gin 


Thr 


Ala 


Asn 
930 


Gly 


Pro 


Val 


Ser 


Gin 
935 


Lys 


He 


Val 


Lys 


Gin 
940 











(2] INFORMATION FOR SEQ ID NO: 464 

ti) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 67 0 amino actd? 
(B) TYPE: amino acid 
ID) TOPOLOGY: linear 

(11) nOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

tvi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalls 

(ix) FEATURE: 

(A) NAME/ KEY: misc feature 

(B) LOCATION 1 . . .67 0 



(xl> SEQUENCE DESCR1 PTION : SEQ ID 110:464 



Met 


Lys 


Tyr 


Leu 


He 


Arg 


Lgu 


Phe 


Leu 


Ser 


Leu 


Met 


Leu 


Leu 


Ser 


Leu 


1 








5 










10 










15 




Trp 


Thr 


Gly 


Cys 


Thr 


His 


Glu 


Glu 


Leu 


Ser 


He 


Cys 


Asp Gly 


Glu 


Asn 








20 










25 










30 






Thr 


Leu 


Val 


Leu 


Arg 


Val 


Glu 


Thr 


Gly 


Lys 


Ala 


Pro 


Asn 


Ala 


Arg Ala 






35 










40 










45 








Thr 


Glu 


Pro 


Gly 


Gin 


Gly 


He 


Tyr 


Asn 


Glu 


Asn 


Lys 


Val 


Gly Ser He 




50 










55 










60 










Ser 


Val 


Leu 


Phe 


Tyr 


Leu 


Glu 


Gly 


Gin 


Leu 


Arg 


Trp 


Gin 


Val 


Lys 


Ser 


65 










70 










75 










00 


Thr 


Arp 


Tyr 


Gin 


He 
85 


His 


Glu 


Gly 


Ala 


Tyr 
90 


lie 


71- 


Pro 


Val 


Lys 

9S 


GlU 


Gin 


Met 


Arg 


Pro 

100 


Leu 


Phe 


Asn 


Gly 


Asn 
105 


Asn 


Asn 


Phe 


Ser 


He 
110 


Tyr 


Val 


Val 


Ala 


Asn 
115 


Leu 


Asp 


Phe 


Asn 


Ala 
120 


Pre 


Ala 


Thr 


Glu 


Ala 
125 


Ala 


Leu 


Sec 


Gin 


Phe 


Val 


Val 


Glu 


Lys 


Ser 


He 


Glu 


Vai 




Ser- 


Thr 


Thr 


Ala 


Pro 




130 








135 










140 










Ala 


Asp 


Phe 


Val 


Met 


Leu 


Ala 


His 


Gly 


Asn 


Lys 


Gin 


He 


Asn 


Met 


Ali 


145 










150 








155 










160 


Thr 


Thr 


Glu 


Gly 


Ly3 


Leu 


Leu 


Gly 


Asp 


Tyr 


L/s 


Leu 


Lys Arg 


Val 


Ala 










165 










170 










175 




Ala 


Lys 


He 


Arg 


Met 


He 


Lys 


Pro 


Thr 


He 


Asn 


Val 


Gin 


Gly 


Tyr 


Glu 
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10 



20 
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190 185 190 

Val Val Cly Asn lie Gin Ala Lys Phe Arg Asn Sec Val Thr Lys Gly 

*-5 200 20S 

Phe Leu Thr Thr Glu Ala Gin Glu Tie Pro Ala Ala Ala Ser Tyr Lvs 

210 215 220 

Thr Ser Glu Tyr Leu Asp lie Ala Glu Ser Ala Pro Ala Asn Ser Il~ 
225 230 235 240 

His- Phe Tyr Ser Tyr Tyr Asn Lys Trp Thr Leu Ser Thr Pro Glu Ly* 

245 250 255 

Arg Pro Glu Phe Phe lie Met Val Lys Phe Lys Lys Thr Gly Gin Pro 

260 265 270 

Asp Asn Thr Ala Lys Pro Tyr Tyr Tyr Arg Val Pro Leu Glu Ser Gin 

275 280 285 

Asp Asn Gin Val Lys Ser Asn Val Leu Tyr Asn Leu Asn Val Lys He 
15 290 295 300 

Glu He Leu Gly Ser Leu Gin Glu Pro Glu Ala Val Ser Val Asn Gly 
305 310 3l5 32 £ 

Thr Leu Ala He Glu Glu Trp He Leu His Gin Asp Ala Phe Asn Leu 

325 330 335 

Pro Ala Thr Asn Tyr Leu Tie Val Glu Gin His Glu He Phe Met Asn 

340 345 350 

Asn Val Asn Thr Tyr Ser Val Lys Tyr Gin Thr Ser Gin Lys Pro He 

355 360 355 

S^r He Ser He Gin Ser Val Thr Phe Ser Tyr Val Ser Ser Asp Gly 
^ 370 375 3B0 

Thr Gin His Asn Asp Leu Val Ala Ser Ser Ser Asp Gin Tyr Pro Thr 
385 390 395 .100 

He Thr S^r Asp Asn Thr S~r He He He Tnr Ser Lys Ile Pro Vai 

on 405 410 415 

dU A3n Val Lys Lys He Val Phe Glu Val Thr Asn Gly Val Ala 

420 425 430 

Gly Leu Lys Glu Thr Val Thr Val Leu Gin Tyr Pro Ala Gin Phe He 

435 4 4 o 445 

Val Asn Thr Leu Gly Thr Ala Ser Ala Trp Arg Pro Asp Gly Ser Leu 
d ° 450 455 450 

Ala Pro Gly Leu Asn Asn Lys Ala He Tyr His Val Val Val Leu Val 
465 470 475 4Q0 

Pro Pro Glu Asn Leu phe Glu Asp Gly Thr Gin Thr II- He Gly Tyr 

405 490 495 

Pro Pro Thr Glu Thr He Ser Phe His Lys Lys Glu Asn Asn Thr Tyr 

500 SOS 510 

Pro He Val Trp Ser Asp Thr Asn Thr Thr Lys Gin Asp Leu Glu Thr 

515 520 525 

Ser Arg I let He Ser Pro Ser Phe Glu Leu Ala Ser Gin Leu Gly Ala 
4t) 5 30 535 54Q 

Thr L-u Pro Het Pro Tyr Leu Glu Tyr Trp Pro Gly Thr Scr Tyr L-u 
545 550 55^ 56Q 

Leu Asp Tyr Ser Gly Asn Tyr Asn Asn Lys Arg Tyr Ala Leu Phe Asn 

£- n 565 570 575 

OU Cy.- Ala Phe Tyr Trp Glu Lys Arg Lys Val Asn Asn Glu Glu He Lys 
580 58S 590 

Phe Asp Asp Trp Arg Leu Pro Thr Glu Ala Glu He Lys Leu He Asp 

695 600 60S 

Lys Leu Gin His Asn Glu Gin Ser Ala Val Gin Ala He Met Thr Glv 
t>3 610 615 620 

Asn Tyr Tyr Trp Asp Ser Tyr Ser Ala Asn Gly Ser Tyr Lye Met Gin 
625 630 635 64Q 

Gly Gly Gly Gly Gin Gly Asn Ser Ser Lys Ala Tyr Val Arg Cys Val 

645 650 655 

Arg Asp Val Lys Lys Pro He Arg Asp Lys Lys Ser Gly Lys 
660 6Gb 670 

[2> INFORMATION FOR SEQ ID NO: 4 65 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1282 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ill} HYPOTHETICAL: YES 

(VI) ORIGINAL SOURCE: 
75 (A) ORGANISM: Porphyromonas gingivalrs 



40 



60 



65 



70 
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lix) FEATURE ; 

(A) NAME/ KEY: miscjeature 
(B» LOCATION 1 - . .1282 



I xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 65 



Met 


Arg 


Lye 


T 1 C: 


Leu 


Ser 


Phe 


Leu 


Met 


Jlet 


Cys 


Ser 


Leu 


Has 


Leu 


Gly 


1 








5 










10 










15 




Leu 


Gin 


Ser 


Gin 


Thr 


Trp 


His 


Gly 


Asp 


Pro 


Asp 


Scr 


Val 


Ala 


Ala 


Leu 








20 










25 










30 






Pro 


Ser 


He 


Gly 


T 1 c- 


Gin 


Glu 


Ser 


Ser 


cys 


The 


Arg 


He 


Thr 


Phe 


Glu 






Jb 










40 










45 








Val 


Val 


Phe 


Pro 


Giy 


Fhe 


Tyr 


Ser 


Val 


Glu 


Lys 


Arg 


Glu 


Gly Asn 


Gin 




50 










55 










60 










Val 


Phe 


GJn 


Arg 


He 


Ser 


Met 


Pro 


Gly 


Cys 


Gly 


Ser 


Phe 


Gly Asn 


Leu 


65 










70 










75 










80 


Gly 


Glu 


Ala 


Glu 


Leu 


Pro 


Val 


Leu 


Lys 


I.ys 


Met 


He 


Ala 


Val 


Pro 


Glu 










85 










90 










PS 




Phe 


Ser 


Thr 


Ala 


Asn 


Val 


Ala 


Val 


Lys 


Il« 


Lye 


Glu 


Thr 


Glu 


Thr 


Phe 


















105 










110 






A^p 


At n 


Tyr 


Acn 


He 


Tyr 


Pro 


Asn 


Pro 


Thr 


Tyr 


Val 


Vai 


Glu 


Glu 


Lsu 






115 










120 










125 








Dro 


Glu 


G1-, 


Gly 


Thr 


Tyr 


Leu 


Val 


Glu 


Ala 


Phe 


Ala 


He 


Asn 


Asn 


Asp 




130 










135 










140 








Tyr 


Tyr 


Ser 


Gin 


As n 


Val 


Ser 


Leu 


Pro 


S~r 


Thr 


His 


Tyr 


Val 


Tyr 


S~r 


145 










lbu 










155 












Gin 


Asp 


Gly 


Tyr 


Phe 


Arg 


Ser 


Gin 


Arg 


Phe 


He 


Glu 


Val 


Thr 


Leu 


Tyr 










165 










170 










175 




Pro 


Phe 


Arg 


Tyr 


Asn 


Pro 


Val 


Arg 


Gin 


Glu 


He 


Leu 


Phe 


Ala 


Lys 


Lys 








100 










185 










190 




Ilo 


Glu 


Val 


Thr 


He 


Thr 


Phe 


Asp 


Asn 


Pro 


Gin 


Pro 


Pro 


Leu 


Gin 


Lys 






195 










200 










205 






Asn 


Thr 


Gly 


He 


Phe 


Asn 


Lys 


Val 


Ala 


s-r 


Ser 


Ala 


Phe 


He 


Asn 


Tyr 




210 










215 










220 










Glu 


Ala 


Asp 


Gly 


Lys 


Ser 


Ala 


He 


Glu 


Asn 


Asp 


Met 


Val 


Phe 


Ser 


Arg 


225 










230 










Z'Si> 










240 


Gly 


Thr 


Thr 


Thr 


Tyx 


He 


S=r 


Gly 


Asn 


Val 


Ala 


Ser 


Asn 


Leu 


Pro 


Gin 










245 










250 










255 






Cys 


Asp 


Tyr 


Leu 


Val 


He 


Tyr 


Atfp 


Asp 


Met 


Phe 


Asn 


Val 


Asn 


Gin 








260 










265 










270 






Gin 


Pro 


His 


Asp 


Glu 


He 


Lys 


Arg 


Leu 


Cys 


Glu 


His 


Arg 


Ala 


Phe 


Tyr 






275 










280 










285 






A en 


Gly 


Phe 


Asp 


Val 


Ala 


Ala 


Val 


Ser 


He 


Lys 


Asp 


Val 


Leu 


Asn 


s»r 




2 90 










2 95 










300 










Ph- 




S-r 


Asn 


Ala 


hr 




Tyr 


He 


Asn 


Glu 


Thr 


Lys 


Leu 


Lys 


Asn 


305 










310 










315 








320 


Phe 


He 


Arq 


Ser 


Val 


Tyr 


Asn 


Gin 


Ser 


Asn 


Ala 


Lys 


Arg 


Thr 


Leu 


Asp 










325 










330 










335 




Gly 


Lys 


Leu 


Gly Tyr 


Val 


Leu 


Lent 


He 


Gly 


Lys 


Pro 


Leu 


Ser 


Lys 


Tyr 








34 0 










34 5 










350 






Leu 


Ala 


Asp 


Thr 


Asp 


Asn 


Thr 


Lys 


Val 


Pro 


Thr 


Ser 


Phe 


He 


His 


Asn 






355 










360 










365 








Val 


Ser 


Leu 


He 


Pro 


S-r 


Ills 


Pro 


Thr 


Phe 


Gly 


Ser 


Ilfc 


Cys 


Ala 


Ser 




37 0 










375 










380 








Asp 


Tyr 


Phe 


Phe 


Ser 


Cys 


Val 


Ser 


Pro 


Lfiu 


Asp 


Thr 


Val 


Gly Asp 


Leu 


385 










390 










395 










400 


Phe 


He 


Gly 


Arg 


The 


Ser 


Val 


Thr 


Asn 


Ala 


His 


Glu 


Leu 


Kis 


Asn 


T.eu 










405 










410 










415 




He 


Glu 


Lys 


Thr 


He 


Asn 


Lys 


Glu 


HQ 


ser 


Tyr 


Asn 


Pro 


He 


Ala 


His 








420 










425 










430 






Lys 


Asn 


Ho 


T.pu 


Tyr 


Ala 


Glu 


Gly 


Lys 


Gly 


Cys 


Arfp 


Ala 


Pro 


He 


Le'i 






435 










440 










445 








Arg 


Leu 


Phe 


Leu 


Lys 


Glu 


He 


Ala 


Ser 


Gly 


Tyr 


Thr 


Val 


Asn 


Her 


He 




A SO 










4 55 










460 










Leu 


Lys 


Ser 


Acn 


Gin 


Val 


Ser 


Ala 


He 


Asp 


Ser 


He 


Phe 


Asp 


Cys 


Leu 


465 










470 










475 








480 


Arm 


Asn 


Gly 


Ser 


His 


His 


Phe 


Tyr 


Phe 


Asn 


Thr 


His 


Gly 


Met 


Pro 


Thr 










405 










4 90 










495 




Val 


Trp 


Gly 


He 


Gly 


Gin 


Gly 


Leu 


Asp 


Val 


Asn 


Thr 


Leu 


Thr 


Ala 


Arg 








soo 










505 










510 






Leu 


Asn 


Asn 


The 


Ser 


Ser 


Gin 


Gly 


Leu 


Cys 


Thr 


Ser 


Leu 


Ser 


Cys 


Ser 






515 










520 










525 






Ser 


Ala 


Val 


Ala 


Asp 


Ser 


Thr 


He 


Arg 


Ser 


Leu 


Gly 


Glu 


Val 


Leu 


Thr 




530 










535 










540 
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Thr 


Tyr 


Al a 


Pro 


Asn 


Lys 


Gly 


Phe 


Ser 


Ala 


Phe 


Leu 


Gly 


Gly 


Ser 


Arg 


545 










S50 










555 










560 


Ala 


Thr 


Gin 


Tyr 


Ala 


Val 


Tyr 


Leu 


Glu 


Gly 


Pro 


Cys 


Pro 


Pro 


Ser 


Glu 










565 










57 0 








57 5 




Phe 


Tyr 


Glu 


Tyr 


Leu 


Pro 


Tyr 


Ser 


Leu 


Tyr 


His 


Asn 


Leu 


Ser 


Thr 


Val 








580 










585 










590 






Val 


Gly 


Glu 


llet 


Leu 


Ltu 


Ser 


Ser 


He 


He 


Asn 


Thr 


Asn 


Ser 


Val 


Asp 






595 










600 










605 






Thr 


Tyr 


Ser 


Lys 


Pho 


Asn 


Phe 


Asn 


Leu 


Leu 


Gly 


Asp 


Pro 


Ala 


Leu 


Asn 




610 










615 










620 










lie 


M«t 


Ala 


His 


Gly 


Met 


Glu 


Val 


Ser 


Asn 


Cys 


Ho 


Thr 


Leu 


Pro 


Asn 


625 










630 










635 










640 


Am 


Thr 


11* 


He 


Ser 


Ser 


Pro 


lie 


Thr 


He 


Lys 


Asn 


Gly 


Gly 


Cys 


Leu 




















650 










65b 




Lys 


He 


Pro 


Glu 


Lys 


Gly 


Val 


Leu 


His 


Phe 


Thr 


Asn 


Asn 


Gly 


Ser 


He 








660 










665 










67 0 






Gin 


VM 


Met 


Ser 


Cly 


Gly 


Thr 


Leu 


Glu 


lie 


Gly 


Asn 


Gin 


Aid 


Lys 


lie 






67 5 










680 










685 








Ser 


Gly 


Glu 


Thr 


Gly 


Ala 


Asn 


Pro 


Thr 


Phe 


He 


Thr 


Val 


Tyr 


Gly 


Asp 




690 










695 










7 00 








Gly 


Leu 


Ala 


He 


Asn 


Lys 


Gin 


Val 


Glu 


He 


Asp 


Asn 


Ho 


Asp 


Arg 


T.eu 


705 










710 










715 










720 


A?!i 


Leu 


Phe 


Sei 


Thr 


Kir 


Ser 


Val 


1 let 


Pro 


Lys 


Phe 


His 


Phe 


Asp 


Sex 










725 










730 










735 




Val 


T.'T 


Ph~ 


A."n 


Ser 


Ala 


Pro 


Leu 


Tyr 


Thr 


Thr 


Asn 


Cys 


He 


Val 


Glu 








7 JO 










"745 










750 






He 


Ser 


Asn 


C/s 


Glu 


Phe 


Thr 


A3n 


Arg 


Ser 


Asp 


He 


He 


Ser 


Lys 


Asn 






755 










760 










765 






Cys 


Asp 


1.61 1 


Ser 


Val 


Glu 


Asn 


Ser 


Itet 


Phs 


Ser 


Ser 


Ser 


Gly 


He 


Thr 




no 










775 










780 








Val 


Phe 


Lys 


Pro 




Ala 


Thr 


Ser 


Ser 


He 


Thr 


Gly 


Leu 


Ser 


Thr 


Lys 


785 










790 










7 95 










800 


Ala 


Lyr 


He 


Thr 


Asp 


Asn 


Thr 


Phe 


Phe 


Ala 


Thr 


Gly 


Asn 


Phe 


Ala 


Tyr 










805 










010 










815 


His 


He 


Thr 


Asn 


Thr 


Pro 


Gly 


Leu 


Thr 


Ala 


Thr 


Ser 


Asn 


Ala 


Ala 


He 








82C 










825 










830 






Lys 


Leu 


Asp 


Asn 


He 


Pro 


Glu 


Tyr 


Tyr 


He 


Ser 


Gly 


Asn 


Lys 


He 


Val 






835 










840 










845 








Asn 


Cys 


Asp 


Glu 


Ala 


Leu 


Val 


Leu 


Asn 


Asn 


Ser 


Gly 


Asn 


Arg 


Thr 


Asn 




850 










855 










860 










Arg 


Leu 


His 


Asn 


He 


Thr 


Arg 


Asn 


Val 


He 


Lys 


Asn 


Cys 


Arg 


He 


Gly 


865 










870 










875 










880 


Ser 


Thr 


Leu 


Tyr 


Asn 


S^r 


Tyr 


Gly 


He 


Tyr 


Asn 


Arg 


Asn 


Lys 


He 


Ser 










885 










890 










895 




A/*n 


Asn 


His 


lie 


Gly 


Val 


Acq 


Leu 


Lsu 


Asn 


Asn 


Ser 


Cys 


Phe 


Tyr 


Phe 








900 










90S 










910 






Asp 


n 


Ala 


Pro 


Val 


IlG 


Asn 


Glu 


Glu 


Asp 


Lys 


Gin 


Thr 


Phe 


He 


Ser 






915 










9?0 










92 5 








Asn 


Arq 


Thr 


Trp 


Gin 


Leu 


Tyr 


Ser 


Ssr 


Asn 


Gly 


Thr 


Phe 


Pro 


Leu 


Asn 




930 










935 










940 










Phe 


His 


Tyr 


Am 


Scr 


Leu 


Gin 


Gly 


Gly 


Asp 


Thr 


Asp 


Thr 


Trp 


lit; 


Tyr 


945 










950 










955 










960 


Asn 


Asp 


Thr 


Tyr 


Thr 


Asn 


Arg 


Tyr 


Ho 


Asp 


Val 


Ser 


Asn 


Asn 


His 


Trp 










965 










970 










975 


Gly 


Ac'n 


Asn 


Asp 


Leu 


Php 


Asp 


Pro 


A^n 


G±n 


Val 


Phe 


Asn 


Thr 


Pro 


Asp 








980 










985 










990 




Leu 


Ph- 


He 


Trp 


He 


Pro 


Phe 


Trp 


Asp 


Gly 


Leu 


Pro 


Asn Gly 


Arg 


Ser 






995 










1000 








1005 






Gly 


Asn 


Ser 


Ser 


Al« 


Glu 


Ala 


Val 


Glu 


Phe 


Gin 


Thr 


Ala 


Leu 


Asp 


Cys 




1010 








1015 








1020 




He 


Gly 


Asn 


S-?r 


Asp 


Tyr 


Leu 


Ser 


Ala 


Lys 


Val 


Ala 


Leu 


Lys 


He*: 


Met 


1 0?5 








1030 








1035 








1 


Val 


Glu 


Thr 


Tyr 


Pro 


Glu 


Ser 


Asp 


Phe 


Ala 


He 


Ala 


Ala 


Leu 


Lys 


Glu 










1045 








1050 








1055 


Leu 


Phe 


Arg 


He 


Glu 


Lys 


Htt 


Ser 


Gly 


Asn 


Asp 


Tyr 


Glu 


Gly 


Leu 


Lys 








1060 








1065 








1070 




Asp 


Tyr 


Phe 


Arg 


Ser 


Asn 


Pro 


Thr 


He 


He 


Ser 


Ser 


Gin 


Asn 


Leu 


Phe 



1075 1080 1085 



pro Thr Ala Asp Phe Leu Ser Ala Arg Cys Asp Ila.Val Cys Glu Asn 

1090 1095 1100 

Tyr Gin Ser Ala He Asp Trp Tyr Glu Asn Arg Leu Asn Ser Glu Tie 
1105 1110 1H5 1120 

Ser Tyr Gin Asp Sc-r Val Phe Ala Val He Asp Leu Gly Asp He Tyr 

1125 1130 1135 

Trp Asn Met Gin Leu Asp Ser Leu Arq Gly Thr Gly He Asp Leu Asn 
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1140 1145 11SO 

Tie Leu Ser Cys Glu Gin Arg Lys Ser Leu Glu Ssr His Gin Asn Val 

1155 1160 1165 

Lys Asn Tyr Leu Leu Ser Thr Leu Pro Glu Ser Thr Gly The Leu Leu 

1170 1175 1180 

Pro Pro Leu Glu Cys Asn Lys Sec Ser Leu Asp Lys Ser Lys lie lie 
1185 1190 1195 1200 

Ser He Ser Pro Asn Pro Ala Lys Ala Val Val Thr He lie Tyr Tyr 

1205 1210 1215 

Thr Asp Asn Pro Sec C} s Ser Val Tie Lys lie Tyr Gly He Asn Gly 

1220 1225 1230 

Ala Ser Ala Asp He Thr Gly Leu Pro Lys His Leu Ser Glu Gly Tyr 

1235 1240 1245 

Tyr Ser lie Gin Phe Asn Thr Ser Asm Phe Asp Pro Gly Phe Tyr Leu 

1250 1255 126Q 

Val Thr Leu Asn Val Asp Gin Lys He He Aep Thr Glu Lys Leu Arg 
1265 1270 1275 1280 

He Lys 



(2 J INFORMATION FOR SEQ ID 1IO: 466 

O) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1274 amino acids 
IB) TYPE: amino a^id 
(V) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(la.il HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAN I S1 1: Porphyroraonas gingivalis 

<Jx) FEATURE : 

(A) NAME/KEY: misc_feature 
(B> LOCATION 1...1274 



<xi] SEQUENCE DESCRIPTION: SEQ ID NO: 4 66 



Het 


Met 


Cys 


Ser 


Leu 


His 


Leu 


Gly 


Leu 


Gin 


Sec 


Gin 


Thr 


Trp 


His 


Gly 


1 








5 










10 










15 




Asp 


Pro 


Asp 


Ser 


Val 


Ala 


Ala 


Leu 


Fro 


Ser 


He 


Gly 


Tie 


Gin 


Glu 


Ser 








20 










25 










30 






Ser 


Cys 


Thr 


Arg 


He 


Thr 


Phe 


Glu 


Val 


Val 


Phe 


Pro 


Gly 


Phe 


Tyr 


Ser 






35 










40 










45 








Val 


Glu 


Lys 


Arg 


Glu 


Gly 


Asn 


Gin 


Val 


Phe 


Gin 


Arg 


He 


Ser 


Het 


Pro 




50 










55 










60 










Gly 


Cys 


Gly 


Ser 


Phe 


Gly 


Asn 


Leu 


Gly 


Glu 


Ala 


Glu 


Leu 


Pro 


Val 


Leu 


65 










70 








~* 5 










80 


Lys 


Lys 


Het 


He 


Ala 


Val 


Pro 


Glu 


Phe 


Ser 


Thr 


Ala 


Asn 


Val 


Ala 


Val 










85 










90 










95 




Lys 


He 


Lys 


Glu 


Thr 


Glu 


Thr 


Phe 


Asp 


Asn 


Tyr 


Asn 


He 


Tyr 


Pro 


Asn 








100 










105 










110 






Pro 


Thr 


Tyr 


Val 


Val 


Glu 


Glu 


Leu 


Pro 


Glu 


Gly 


Gly 


Thr 


Tyr 


Leu 


Val 






115 










120 










125 








Glu 


Ala 


Phe 


Ala 


He 


Asn 


Asn 


Asp 


Tyr 


Tyr 


Ser 


Gin 


Asn 


Val 


Ser 


Leu 




130 










135 










110 










Pro 


Ser 


Thr 


His 


Tyr 


Val 


Tyr 


ser 


Gin 


Asp 


Gly 


Tyr 


Phe 


Arg 


Ser 


Gin 


145 










150 










155 










160 


Ary 


Phe 


He 


Glu 


V^l 


Thr 


Leu 


Tyr 


Pro 


Phe 


Arg 


Tyr 


Asn 


Pro 


Val 


Arg 










165 










170 










175 




Gin 


Glu 


He 


Leu 


Phe 


Ala 


Lys 


Lys 


He 


Glu 


Val 


Thr 


lie 


Thr 


Phe 


Asp 








180 










185 










190 






Asn 


Pro 


Gin 


Pro 


Pro 


Leu 


Gin 


Lys 


Asn 


Thr 


Gly 


He 


Phe 


Asn 


Lys 


Val 






195 










200 










205 








Ala 


Ser 


Ser 


Ala 


Phe 


He 


Arn 


Tyr 


Glu 


Ala 


Asp 


Gly 


Lys 


Ser 


Ala 


He 




210 










215 










220 










Glu 


Aen 


Asp 


Uet 


Val 


Phe 


Sex- 


Arg 


Gly 


Thr 


Thr 


Thr. 


Tyr 


He 


Ser 


Gly 


22b 










230 










235 








240 


Asn 


Val 


Ala 


Ser 


Asn 


Leu 


Pro 


Gin 


Asn 


Cys 


A.rp 


Tyr 


Leu 


Val 


He 


Tyr 




















250 










255 


Asp 


Asp 


Het 


Phe 


Asn 


Val 


Asn 


Gin 


Gin 


Pro 


Hi- 


Asp 


Glu 


He 


Lys 


Arg 








260 










265 










270 






Leu 


Cys 


Glu 


His 


Arg 


Ala 


Phe 


Tyr 


Asn 


Gly 


the 


Asp 


Val 


Ala 


Ala 


Val 
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275 










280 










285 








Ser 


He 


Lys 


Asp 


Val 


Leu 


Asn 


Ser 


Phe 


Pro 


Ser 


Asn 


Ala 


Thr 


Ser 


Tyr 




290 










295 










300 










He 


Asn 


Glu 


Thr 


Lys 


Leu 


Lys 


Asn 


Phe 


He 


Acg 


Ser 


Val 


Tyr 


Asn 


Gin 


305 










310 










315 










320 


Ser 


Asn 


Ala 


Lys 


Arg 


Thr 


Leu 


Asp 


Gly 


Lys 


Leu 


Gly 


Tyr 


Val 


Leu 


Leu 










325 










330 










335 




He 


Gly 


Lys 


Pro 


Leu 


Ser 


Lys 


Tyr 


Leu 


Ala 


Asp 


Thr 


Asp 


Asn 


Thr 


Lys 








340 










345 










350 






Val 


Pro 


Thr 


Set 


Phe 


He 


His 


Asn 


Val 


Ser 


Leu 


He 


Pro 


Ser 


His 


Pro 






355 










360 










365 








Thr 


Phe 


Gly 


Ser 


He 


cys 


Ala 


Ser 


Asp 


Tyr 


Phe 


Phe 


Ser 


Cys 


Val 


Ser 




370 










37 5 










3B0 










Pro 


Leu 


Asp 


Thr 


Val 


Gly 


Asp 


Leu 


Phe 


He 


Gly 


Arg 


Phe 


Ser 


Val 


Thr 


365 










390 










395 










400 


Asn 


Ala 


His 


Glu 


Leu 


His 


Asn 


Leu 


He 


Glu 


Lys 


Thr 


He 


Asn 


Lys 


Clu 










405 










410 








115 




He 


Ser 


Tyr 


Asn 


Pro 


He 


Ala 


His 


Lys 


Asn 


He 


Leu 


Tyr 


Ala 


Glu 


Gly 

v 








420 










425 










430 




Ly* 


Gly 


Cys 


Asp 


Ala 


Pro 


He 


Leu 


Arg 


Leu 


Phe 


Leu 


Lys 


Glu 


11 a 


Ala 






435 










440 










445 








Ser 


Gly 


Tyr 


Thr 


Val 


Asn 


Ser 


He 


Leu 


Lys 


Ser 


Asn 


Gin 


Val 


Ser 


Ala 




4 50 










4 55 










460 










Tie 


ASp 


Ser 


lie 


Phe 


Asp 


Cys 


Leu 


Asn 


Asn 


Gly 


Ser 


His 


His 


Phe 


Tyr 


465 










470 










475 












Phe 


Aim 


Thr 


His 


Gly 


tlet 


Pre 


Thr 


Val 


Trp 


Gly 


He 


Gly 


Gin 


Gly 


Leu 










485 










4 90 








495 




Asp 


Val 


Asn 


Thr 


Leu 


Thr 


Ala 


Arg 


Leu 


A^sn 


Asn 


Thr 


Ser 


Ser 


Gin 


Gly 








500 










505 










510 




Leu 


Cys 


The 


Ser 


Leu 


Ser 


Cys 


Ser 


Ser 


Ala 


Val 


Ala 


Asp 


Ser 


Thr 


He 






515 










520 










525 








Arg 


Ser 


Leu 


Gly 


Glu 


Val 


Leu 


Thr 


Thr 


Tyr 


Ala 


Pro 


Asn 


Lys 


Gly 


Phe 




530 










535 










54 0 








Ser 


Ala 


Phe 


Leu 


G3y 


Gly 


Ser 


Arq 


Ala 


Thr 


Gin 


Tyr 


Ala 


Val 


Tyr 


Leu 


545 










550 










555 








560 


Glu 


Gly 


Pro 


Cys 


Pro 


Pro 


Ser 


Glu 


Phe 


Tyr 


Glu 


Tyr 


Leu 


Pro 


Tyr 


Ser 










565 










570 










575 




Leu 


Tyr 


His 


Asn 


Leu 


Ser 


Thr 


Val 


Val 


Gly 


Glu 


Met 


Leu 


Leu 


Ser 


Ser 








580 










585 










590 






He 


He 


Asn 


Thr 


Asn 


Ser 


Val 


Asp 


Thr 


Tyr 


Ser 


Lys 


Phe 


Asn 


Phe 


Asn 






595 










600 










605 








Leu 


Leu 


Gl y 


Asp 


Pro 


Ala 


Leu 


Asn 


He 


Met 


Ala 


His 


Gly 


Met 


Glu 


Val 




610 










615 










620 








Ser 


Asn 


Cys 


He 


Thr 


Leu 


Pro 


Asn 


Asn 


Thr 


He 


He 


Ser 


Ser 


Pro 


He 


625 










630 










635 










640 


Thr 


He 


Lys 


Am 


Gly 


Gly 


Cys 


Leu 


Lys 


He 


Pro 


Glu 


Lys 


Gly 


Val 


Leu 










6-15 










650 










655 




His 


Phe 


Thr 


Asn 


Asn 


Gly 


Ser 


He 


Gin 


Val 


Met 


Ser 


Gly 


Gly 


Thr 


Leu 








660 










665 








670 






Glu 


He 


Gly 


Asn 


Gin 


Ala 


Lys 


He 


Ser 


Gly 


Glu 


Thr 


Gly 


Ala 


Asn 


Pro 






675 










660 










685 








Thr 


Phe 


He 


Thr 


Val 


Tyr 


Gly 


Asp 


Gly 


Leu 


Ala 


He 


Asn 


Lys 


Gin 


Val 




690 










695 










700 








Glu 


He 


Asp 


Asn 


He 


Asp 


Arq 


Leu 


Asn 


Leu 


Phe 


Ser 


Thr 


His 


Ser 


Val 


705 










710 










715 










720 


Het 


Pro 


Lys 


Phe 


His 


Phe 


Asp 


Ser 


Val 


Lys 


Phe 


Asn 


Ser 


Ala 


Pro 


Leu 










725 










730 










735 




Tyr 


Thr 


Thr 


Asn 


Cys 


He 


Val 


Glu 


Il« 


Sex 


Asn 


Cys 


Glu 


Phe 


Thr 


Asn 


















745 










750 






Arg 


Ser 


Asp 


He 


He 


Ser 


Lys 


Asn 


Cys 


Asp 


Leu 


Ser 


Val 


Glu 


Asn 


Set 






•755 










760 










765 








ltel 


Phe 


Ser 


Ser 


Ser 


Gly 


He 


Thr 


Val 


Phe 


Lys 


Pro 


Met 


Ala 


Thr 


sor 




770 










775 










780 










Ser 


He 


Thr 


Gly 


Leu 


Ser 


Thr 


Lys 


Ala 


Lys 


He 


Thr 


Asp 


Asn 


Thr 


Phe 


785 










7 90 










795 








800 


Phe 


Ala 


Thr 


Gly 


Asn 


Phe 


Ala 


Tyr 


His 


He 


Thr 


Asn 


Thr 


Pro 


Gly 


Leu 










fl 05 










810 










83 5 




Thr 


Ala 


Thr 


Ser 


Asn 


Ala 


Ala 


He 


Lys 


Leu 


Asp 


Asn 


He 


Pro 


Glu 


Tyr 








82C 










825 










830 




Tyr 


He 


Ser 


Gly 


Asn 


Lys 


lie 


Val 


Asn 


Cys 


Asp 


Glu 


Ala 


Leu 


Val 


Leu 






635 










840 










845 








Asn 


Asn 


Ser 


Gly 


Asn 


Arg 


Thr 


Asn 


Arg 


Leu 


His 


Asn 


Tie 


Thr 


Arg 


Asn 




850 










855 










860 








Val 


He 


Lys 


Am 


Cys 


Arg 


He 


Gly 


Ser 


Thr 


Leu 


Tyr 


Asn 


Ser 


Tyr 


Gly 


865 










870 










875 








880 
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lie 


Tyr 


Asn 


Arg 


Asn 


Lys 


lie 


Ser 


Aen 


Asn 


His 


He 


Gly 


Val 


Arg 


Leu 










885 










8 90 










895 




L-u 


Asn 


Asn 


Ser 


Cys 


Phe 


Tyr 


Phe 


Asp 


Asn 


Ala 


Pro 


Val 


He 


Asn 


Glu 








900 










905 










910 






Glu 


Asp 


Lys 


Gin 


Thr 


Phe 


He 


Ser 


Asn 


Arg 


Thr 


Trp 


Gin 


Leu 


Tyr 


Ssr 






915 










920 










925 






Ser 


Asn 


Gly 


Thr 


Phe 


Pro 


Leu 


Asn 


Phe 


His 


Tyr 


Asn 


Ser 


Leu 


Gin 


Gly 




930 










935 










94 0 










Gly 


Asp 


Thr 


Asp 


Thr 


Trp 


He 


Tyr 


Aen 


Aep 


Thr 


Tyr 


Thr 


Asn 


Arg 


Tyr 


945 










950 










955 










960 


Tie 


Asp 


Vr, 1 


Ser 


Asn 


Asn 


His 


Trp 


Gly 


Asn 


Asn 


Asp 


Leu 


Phe 


Asp 


Pro 










965 










970 










97 5 




Asn 


Gin 


Val 


Pht 


Asn 


Thr 


Pro 


Asp 


Leu 


Phe 


He 


Trp 


He 


Pro 


Phe 


Trp 








900 










985 










990 






Asp 


Gly 


Leu 


Pro 


Asn 


Gly Arg 


Ser 


Gly 


Asn 


Ser 


Set 


Ala 


Glu 


Ala 


Val 






995 










1000 








1005 






GlU 


Phe 


Gin 


Thr 


Ala 


Leu 


Asp 


Cys 


Ho 


Gly Asn 


Ser 


Asp 


Tyr 


Leu 


Ser 




J 010 








1015 








1020 








Ala 


Lys 


Val 


Ala 


Leu 


Lys 


Met 


!let 


Val 


Glu 


Thr 


Tyr 


Pro 


Glu 


Ser 


Asp 


1025 








1030 








1035 








11 


Phe 


Ala 


He 


Ala 


Ala 


Leu 


Lys 


Glu 


Leu 


Phe 


Arg 


He 


Glu 


Lys 


Met 


Ser 










logs 








1050 








1055 


Gly 


Asn 


Asp 


Tyr 


Glu 


Gly 


Leu 


Lys 


Asp 


Tyr 


Phe 


Arg 


Ser 


Asn 


Pro 


The 








1050 








1065 








1070 




lie 


11* 


Ser 


Ser 


Gin 


Asn 


Leu 


Phe 


Pro 


Thr 


Ala 


Asp 


Phe 


Leu 


ser 


Ala 






1075 








ioao 








1085 






Arg 


C/S 


Asp 


lie 


Val 


Cys 


Glu 


Asn 


Tyr 


Gin 


Ser 


Ala 


He 


Asp 


Trp 


Tyr 




1090 








1095 








1100 








G1 \i 


Asn 


Arg 


Leu 


Asn 


Ser 


Glu 


He 


Ser 


Tyr 


Gin 


Asp 


Ser 


Val 


Phe 


Ala 


1105 








1110 








111b 








i : 


Val 


lie 


Asp 


Leu 


Gly 


Asp 


He 


Tyr 


Trp Asn 


Met 


Gin 


Leu 


Asp 


Ser 


Leu 










1125 








1130 








1135 


Arg Gly 


Thr 


Gly 


He 


Asp 


Leu 


Asn 


He 


Leu 


Ser 


Cys 


Glu 


Glr. 


Arg 


Lys 



1140 1145 1150 

Ser Leu Glu Ser His Gin Asn Val Lys Asn Tyr Leu Leu Ser Thr Leu 

1155 1160 1165 

Pro Glu Ser Thr Gly Thr Leu Leu Pro Pro Leu Glu Cys Asn Lys Ser 

1170 1175 1180 

Ser Leu Asp Lys Ser Lys lie He Ser He Ser Pro Asn Pro Ala Lys 
11B5 1190 1195 1200 

Ala Val Val Thr He He Tyr Tyr Thr Asp Asn Pro Ser Cys Ser Val 

1205 1210 1215 

He Lys He Tyr Gly He Asn Gly Ala Ser Ala Asp He Thr Gly Leu 

1220 1225 1230 

Pro Lys His Leu Ser Glu Gly Tyr Tyr Ser He Gin Phe Asn Thr Ser 

1235 1240 3245 

Asn Phe Asp Pro Gly Phe Tyr Leu Val Thr Ltu Asn Val Asp Gin Lys 

1250 1255 1260 

He He Asp Thr Glu Lys Leu Arg He Lys 
17 65 127 0 



(2) INFORMATION FOR SEQ ID NO: 467 

H) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 925 amino acids 
(BJ TYPF.: amino acid 
(D) TOPOLOGY: linear 

til) MOLECULE TYPE: protein 

(till HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival ie 

(is J FEATURE : 

(A| NAME /KEY : mise feature 

(B) LOCATION 1...925 

IxiJ SEQUENCE DESCRIPTION: SEQ ID NO: 4 67 

Met Ala He Met Het Lys Ser He Val Phe Arg Ala Phe Leu Thr He 

1 5 10 ]5 

Leu Leu Ser Trp Ala Ala He Thr Asn Pro Thr Ala Gin Glu He Ser 
20 25 30 
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Gly Met Asn Ala Ser Cys Leu Ala Ala Pro Ala Gin Pro Asp Thr He 

35 jo , 1£ 

Leu Tyr Glu Ser Phe Glu Asn Gly Pro Val Pro Asn Gly Trp Leu Glu 

50 55 60 

He Asp Ala Asp Ala Asp Gly Ala Thr Trp Gly Scr Pro Ser GJ y Ser 
65 70 75 80 

Phe Ser Val Pro Tyr Gly His Asn Gly Leu Cys Thr Tyr Ser His Il~ 

85 90 95 

Arg Ser Gly He Ser Thr Ala Gly Asn Tyr Leu He Thr Pro Asn He 

100 105 no 

Glu Gly Ala Lye Arg Val Lys Tyr Trp Val Cys Asn Gin Tyr Ser Tht 

Ho 120 125 

Asn Pro blu His Tyr Ala Val Met V«l Ser Thr Thr Gly Thr Ala II- 

130 135 no 

Glu Asp Fhe Val Leu Leu Phe Asp Asp Ser He Thr <ily Lys Pro Thr 
150 155 leo 

rro Leu Val Trp Arg Arg Arg He Val Asp Leu Pro Glu Gly Thr Lys 

165 170 175 

Tyr He Ala Trp Arg His Tyr Lys Val Thr A-p Ser His Thr Glu Ph« 

180 185 190 

Leu Lys- Leu Asp Asp Val Thr Val Tyr Arg Ser II- Git. Gly Pro Glu 

1?5 200 205 

Pro Ala Thr Asp Phe Thr Val He Asn He Gly Gin Asn Val Glv Arg 

210 215 220 

Leu Thr Trp Asn Tyr Pro Glu Asp Tyr Glr. Pro Glu Gly Lys Gly Asn 
225 230 235 240 

Glu Glu Leu Gin Lou Ser Gly Tyr Asn He Tyr Ala Asn Gly Thr L«u 

2<1 5 250 255 

Leu Ala Gin He Lys Asp Val Ser He Leu Glu Tyr Val Asp Ser Thr 

260 265 270 

Tyr Ser Leu Arg Asp Asn Pro Leu Gin Val Glu Tyr Cys Val Thr Ala 

275 280 205 

Val Tyr As P Glu Ser He Glu Ser Ser Thr Val Cys Gly Thr Leu His 

290 255 300 

Tyr Ala Thr Asp Ala He Leu Tyr Glu Asn Phe Glu Asn Gly Pro Val 
305 310 315 320 

Pro Asn Gly Trp Leu Val He Asp Ala Asp Gly Asp Gly Phe S=r Trp 

325 330 335 

Gly HJs Tyr Leu Asn Ala Tyr Asp Ala Phe Pro Gly His Asn Gly Gl- 

340 345 350 

His Cys Ser Leu Ser Ala Ser Tyr Val Pro Gly He Gly Pro Val Thr 

355 360 365 

Pro Asp Asn Tyr Leu He Thr Pro Lys Val Glu Gly Ala Lys Arg Val 

370 375 380 

Lys Tyr Trp Val Ser Thr Gin Asp Ala Asn Trp Ala Ala Glu Hi- Tvr 
385 390 395 ~ 4uC 

Ala Vai U-t Ala Ser Thr Thr Gly Thr Ala Val Gly Asp Phe Val 11= 

405 410 415 

Leu Phe Glu Glu Thr Met Thr Ala Lys Pro Thr Gly Ala Trp Tyr Glu 

«20 425 430 

Arg Thr He Asn Leu Pro Glu Gly Thr Lys Tyr He Ala Trp Aru His 

435 440 445 

Tyr Asn Cys Thr Asp He Tyr Phe Leu Lys Leu Asp Asp He Thr Vai 

450 455 460 

Phe Gly Thr Pre Ala Ser Glu Pro Glu Pro Val Thr Asp Phe Val Val 
465 470 47S 480 

Ser Leu He Glu Asn Asn Lys Gly Arg Leu Lys Trp Asn Tyr Pro Asn 

485 490 495 

Gly Tyr (31 1: Pro Asp Lys Thr Asp Asp Lys Asp Pro Leu Gin Leu Ala 

500 505 510 

Gly Tyr Asn He Tyr Ala Asn Gly Ser Leu Leu Val His He Gin Asp 

515 520 525 

Pro Thr Val Leu Glu Tyr He Asp Giu Thr Tyr Ser Ser Arq Asp Asp 

530 535 540 

Gin Val Glu Val Clu Tyr Cys Val Thr Ala Val Tyr Asn Asp Asn He 
545 550 555 56Q 

Glu Ser Gin Ser Val Cys Asp Lys Leu He Tyr Asp Ser Gin Ser Asp 

565 570 575 

He He Leu Tyr Glu Gly Phe Glu Ala Gly Ser lie Pro Giu Gly Trp 

580 585 590 

Leu Leu He Asp Ala Asp Gly Asp Asn Val Asn Trp Asp Tyr Tyr Pro 

5?S 600 605 

Trp Thr Met Tyr Gly His Asp Ser Glu Lys Cys He Ala Ser Pro Ser 

610 615 6^0 

Tyr Leu Pro tlet He Gly Val Leu Thr Pro Asp Asn Tyr Leu Val Thr 
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625 










630 










635 










640 


Pro 


Arg 


Leu 


Gl u 


Gly Ala 


Lvs 


Leu 


Val 


- 


Tyr 


l rp 


Val 


Ser 


Ala 


Gin 










645 










650 










655 




Asp 


Ala 


Val 


Ty r 
660 


Ser 


Ala 


Glu 


His 


Tyr 
665 


Al a 


Val 




Val 


Ser 
670 


Thr 




Gly Thr 


Al a 


Val 


Glu 


Asp 


Phe 


Va 1 




Leu 


Phe 


Glu 




Thr 


Met 


Thr 






675 










680 










685 










Lys 


Ala 


Asn 


Gly Ala 


i rp 


Tyr 


Glu 


Arg 


Thx 


11° 


-i nr 


Leu 


Pro 


Al a 




690 










695 










700 










Gly 


Thr 




Tyr 


He 


Ala 


Trp 


Arg 


His 


Tyr 


Ar P 


Cys 


Thr 


Asp 


Met 




705 










7lO 










715 










720 


Phs 


Leu 


Leu 


Leu 


Asp 
725 


Asp 


He 


Thr 


Val 


1 1 r 
7 30 


Arg 


Ser 


Thr 


Glu 


Thr 

735 


Val 


Pro 


Glu 


Pro 


Val 


Thr 


Asp 


Phe 


Val 


Va] 


Ser 




He 


Glu 


Asn 


Asn 










740 










745 










750 




Gly 


Arg 


Leu 


Lys 


Trp 


Ann 


Tyr 


Pro 


Asn 


Gly 


Tyr 


Glu 


Pro 


Asp Lys 


Thr 






755 










760 










765 








Asp 


Asp 
770 


Lys 


Lys 


Pro 


Leu 


Gin 
775 


Leu 


Thr 


Gly 


Tyr 


Asn 
780 


He 


Tyr 


Ala 


Asn 


Gly 


Ser 


Leu 


Leu 


Val 


His 


He 


Gin 


Asp 


Pro 


Thr 


Val 


Leu 


Glu 


Tyr 


He 


785 










7 90 










795 










800 


Asp 


Glu 


Thr 


Tyr 


Ser 


Ser 


Arg 


Asp 


Gly 


Gin 


v=U 


Glu 


Met 


Glu Tyr 


Cys 










805 










310 










815 




Val 


Thr 


Ala 


Val 


Tyr 


Asn 


Asp 


Asn 


He 


Glu 


Ssr 


Gin 


Ser 


Val 


Cys 


Asp 








820 










625 










830 


Lys 


Leu 


Asn 
835 


Tyr 


Thr 


He 


Thr 


Ser 
8 4 0 


I.eu 


Asp 


Asn 


He 


Gin 
845 


ser 


Asp 


Thr 


Set* 


Leu 
850 


Lye 


lie 


Tyr 


Pro 


Asn 
855 


Pro 


Ala 


Ser 


Tyr 


Val 
860 


Val 


Arg 


lie 


Glu 


Gly 


Leu 


Ser 


Arg 


Ser 


Lys 


Ser 


Thr 


He 


Glu 


Leu 


Tyr 


Asn 


Ala 


Leu 


Gly 


865 










870 










875 










880 


lie 


Cys 


Tie 


Leu 


Arg 
885 


Glu 


Glu 


Thr 


His 


Ser 
990 


Glu 


Lys 


Thr 


Glu 


He 
895 


Asp 


Val 


Ser Arg 


Leu 


Asn 


Asp 


Gly 


Val 


Tyr 


Leu 


He 


Lys Val 


Val 


Gly 


Gly 








900 










905 










910 








Lys 


Thr 
915 


Thr 


Thr 


Glu 


Lys 


Val 
920 


GlU 


He 


Lys 


Arg 


Pro 
925 









(2> INFORMATION FOR SEQ ID MO: 468 

(i> SEQUENCE CHARACTERISTICS: 

(Ai LENGTH : 922 amino acids 
(B> TYPE: amino acid 
ID) TOPOLOGY: linear 

(lit I tOLECULE TYPE: protein 

U U ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival ts 

(ixl FEATURE: 

(AJ NAME/KEY: mlsc_feature 
IB) LOCATION 1. . .922 

{Xi> SEQUENCE. DESCRIPTION: SEQ ID NO: 468 



Met 


Met 


Lys 


Ser 


lie 


Val 


Phe 


Arg 


Ala 


Phe 


Leu 


Thr 


He 


Leu 


Leu 


Sex 


1 








5 










10 










15 




Trp 


Ala 


Ala 


11- 


Thr 


Asn 


Pro 


Thr 


Ala 


Gin 


Glu 


He 


Ser 


Gly Met 


Asn 








20 










25 










30 






Ala 


Ser 


Cys 
3b 


Leu 


Ala 


Ala 


Pro 


Ala 


Gin 


Pro 


Asp 


Thr 


He 


Leu 


Tyr 


Glu 














40 










45 








Ser 


Phe 


Glu 


Asn 


Gly 


Pro 


Val 


Pro 


Asn 


Gly 


Trp 


Leu 


Glu 


lie 


Asp 


Ala 




50 










55 








60 










Asp 


Ala 


Asp 


Gly 


Ala 


Thr 


Trp 


Gly 


Ser 


Pro 


Ser 


Gly 


Ser 


Phe 


Ser 


Val 


65 










70 










75 










80 


Pro 


Tyr 


Gly 


His 


Atn 
8S 


Gly 


Leu 


Cys 


Thr 


Tyr 
90 


Ser 


His 


lie 


Arg 


Ser 


Gly 


He 


Ser 


Thr 


Ala 
100 


Gly 


Asn 


Tyr 


Leu 


He 
105 


The 


Pro 


Aen 


He 


Glu 
110 


Gly 


Ala 


Lys 


Arg 


Val 
US 


Lye 


Tyr 


Trp 


Val 


Cys 
120 


Asn 


Gin 


Tyr 


Ser 


Thr 
125 


Asn 


Pro 


Glu 


His 


Tyr 


Ala 


Val 


Met 


Val 


Ser 


Thr 


Thr 


Gly 


Thr 


Ala 


He 


Glu 


Asp 


Phe 
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130 135 HO 

Val Leu Leu Phe Asp Asp Ser lie Thr Gly Lys Pro The Pro Leu Val 
1*5 150 155 160 

Trp Arg Arg Arg He Val Asp Leu Pro Glu Gly Thr Lys Tyr He Ala 

165 170 175 

Trp Arg His Tyr Lys Val Thr Asp Ser His Thr Glu Phe Leu Lys Leu 

180 185 190 

Asp Asp Val Thr Val Tyr Arg Ser He Glu Gly Pro Glu Pro Ala Thr 

195 200 205 

Asp Phe Thr Val He Asn He Gly Gin Asn Val Gly Arg Leu Thr Trp 

210 215 220 " 

Asn Tyr Pro Glu Asp Tyr Gin Pro Glu Gly Lys Gly Asn Glu Glu Leu 
225 230 235 240 

Gin Leu Ser Gly Tyr Asn He Tyr Ala Asn Gly Thr Leu Leu Ala Gin 

245 250 255 

He Lys Asp Val Ser He Leu Glu Tyr Val Asp Ser Thr Tyr Ser Leu 

260 265 ?70 

Arg Asp Asn Pro Lsu Gin Val Glu Tyr Cyt Val Thr Ala Val Tyr Asp 

275 200 265 

Glu Ser He Glu Ser Ser Thr Val Cys Gly Thr Leu His Tyr Ala Thr 

290 295 300 

Asp Ala He Leu Tyr Glu Asn Phe Glu Asn Gly Pro Val Pro Asn Gly 
305 310 315 320 

Trp Leu Val He Asp Ala Asp Gly Asp Gly Phe Ser Tro Gly His Tyr 

325 33C " 335 

Leu Asn Ala Tyr Asp Ala Phe Pro Gly His Asn Gly Gly His Cys S-r 

3-10 3-15 350 

Leu Ser Ala Ser Tyr Val Pro Gly He Gly Pro Val Thr Pro Asd Asn 

355 360 365 

Tyr Leu He Thr Pro Lys Val Glu Gly Ala Lys Arg Val Lys Tyr Trp 

37 0 375 380 

Val Ser Thr Gin Asp Ala Asn Trp Ala Ala Glu His Tyr Ala Val Met 
395 390 395 400 

Ala Ser Thr Thr Gly Thr Ala Val Gly Asp Phe Val He L-u Phe Glu 

405 410 435 

Glu Thr (let Thr Ala Lys Pro Thr Gly Ala Trp Tyr Glu Arg Thr He 

420 425 430 

Asn Leu Pro Glu Gly Thr Lys Tyr He Ala Trp Arg His Tyr Asn Cys 

435 440 445 

Thr Asp He Tyr Phe Leu Lys Leu Asp Asp He Thr Val Phe Gly Thr 

450 455 460 

Pro Ala Ser Glu Pro Glu Pro Val Thr Asp Phe Val Val Ser Leu He 
465 470 475 480 

Glu Asn Asn Lys Gly Arg Leu Lys Trp Asn Tyr Pro Asn Gly Tyr Glu 

485 490 495 

Pro Asp Lys Thx Asp Asp Lys Asp Pro Leu Gin Leu Ala Gly Tyr Asn 

S00 505 510 

He Tyr Ala Asn Gly Ser Leu Leu Val His He Gin Asp Pro Thr Val 

515 520 525 

Leu Glu Tyr He Asp Glu Thr Tyr Ser Ser Ara Asp Asp Gin Val Glu 

530 535 * 540 

Val Glu Tyr Cys Val Thr Ala Val Tyr Asn Asp Asn He Glu Ser Gin 
545 550 555 560 

Ser Val Cys Asp Lys Leu Ho Tyr Asp Ser Gin Ser Asp He He Leu 

565 57 0 575 

Tyr Glu Gly Phe Glu Ala Gly Ser He Pro Glu Gly Trp Leu Leu He 

580 505 590 

Asp Ala Asp Gly Asp Asn Val Asn Trp Asp Tyr Tyr Pro Trp Thr M-t 

5*5 600 605 

Tyr Gly His Asp Ser Glu Lys Cys H e Ala Ser Pro Sor Tyr Leu Pro 

610 615 620 

H*t He Gly Val Leu The Pro Asp Asn Tyr Leu Val Thr Pro Arg Leu 
6 - b 630 535 640 

Glu Gly Ala Lys Leu Val Lys Tyr Trp Val Ser Ala Gin Asp Ala Val 

645 650 655 

Tyr Ssr Ala Glu His Tyr Ala Val Met Val Ser Thr Thr Gly Thr Ala 

€60 665 670 

Val Glu Asp Phe Val Leu Leu Phe Glu Glu Thr Met Thr Ala Lys Ala 

675 680 --685 

Asn Gly Ala Trp Tyr Glu Arg Thr He Thr Leu Pro Ala Gly Thr Vs 

690 695 700 

Tyr He Ala Trp Arg His Tyr Asp Cys Thr Asp Het Phe Phe Leu Leu 
705 710 715 720 

Leu Asp Asp He Thr Val Tyr Arg Ssr Thr Glu Thr Val Pro Glu Pro 
72 5 730 735 
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Val 


Thr 


Asp 


Phe 


Val 


Val 


Ser Leu 


He 


Glu 


Asn 


Asn 


Lys 


Gly 


Arg 


Lsu 








740 








745 










750 






Lys 


Trp 


Asn 


Tyr 


Pro 


Asn 


Gly Tyr 


Glu 


Pro 


Asp 


Lys 


Thr Asp 


Asp 


Lys 






755 








7 60 










765 








Lys 


Pto 


Leu 


Gin 


Leu 


Thr 


Gly Tyr 


Asn 


He 


Tyr 


Ala 


Asn 


Gly 


Ser 


Leu 




770 










775 








780 










Leu 


Val 


His 


11 e 


Gin 


Asp 


Pro Thr 


Val 


Leu 


Glu 


Tyr 


He 


Asp 


Glu 


Thr 












7 90 








7 95 








800 


Tyr 


Sec 


Sec 


Arg 


Asp 


Gly 


Gin Val 


Glu 


Met 


Glu 


Tyr 


Cys 


Val 


Thr 


Ale 










005 








810 










B15 




V«l 


Tjr 


Acn 


Asp 


Asn 


He 


Glu Ser 


Gin 


Ser 


Val 


Cys 


Aep 


Lys 


Leu 


Asn 








820 








825 










830 






Tyr 


Thr 


He 


Thr 


Ser 


Leu 


Asp hsrt 


Tlo 


Gin 


Ser 


Asp 


Thr 


Ser 


Leu 


Lys 






835 








340 










845 








He 


" iL 


Pro 


Asn 


Pro 


Ala 


Ser Tyr 


Val 


Val 


Arg 


He 


Glu 


Gly 


J.PU 


Ser 




850 










855 








860 








Arg 


Ser 


Lys 


Ser 


Thr 


He 


Glu Leu 


Tyr 


Asn 


Ala 


Leu 


Gly 


He 


Cys 


He 


865 










870 








875 










880 


Lfeu 


Acy 


Glu 


GlU 


Thr 


His 


Ser Glu 


Lys 


Thr 


Glu 


lis 


Asrp Vol 


Ser 


Ard 










885 








890 










895 






Afn 


Asp 


Gly 


Val 


Tyr 


Leu He 


Lyr 


Val 


Val 


Gly 


Gly 


Asn 


Lys 


Thr 








900 








90S 










910 






Thr 


Thr 


Glu 


Lys 


Val 


Glu 


He Lys 


Arg 


Pro 















915 920 

(2) IIIFORHATION FOR SEQ ID NO: 469 

11) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 921 amino acids 
(D) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) IIOLECULE TYPE: protein 

(ill J HYPOTHETICAL: YES 

{vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalxs 

t ix ) FEATURE : 

(Al I IAMB/ KEY: miscjeaturt 
(D) LOCATION 1 . . . 921 



(Ki) SEQUENCE DESCRIPTION: SEQ ID MO: 4 69 



Her 


Lyr 


Spr 


He 


Val 


Phe 


Arg 


Ala 


Phe 


Leu 


Thr 


Tie 


Leu 


Leu 


Ser 


Trp 


1 








5 










10 










15 




Ala 


Ala 


He 


Thr 


Asn 


Pro 


Thr 


Ala 


Gin 


Glu 


lie 


Ser 


Gly 


Met 


Asn 


Al a 








20 










25 










30 






Ser 


Cys 


Leu 


Ala 


Ala 


Pro 


Ala 


Gin 


Pro 


Asp 


Thr 


lie 


Leu 


Tyr 


Glu 


Ser 






35 










40 










45 








Phe 


Glu 


Asn 


G3 y 


Pro 


Val 


Pro 


Asn 


Gly 


Trp 


Leu 


Glu 


He 


Asp 


Ala 


Asp 




50 










55 








60 








Ala 


Asp 


Gly Ala 


Thr 


Trp 


Gly 


Ser 


Pro 


Ser 


Gly 


Ser 


Phe 


Ser 


Val 


Pro 


65 










70 










75 










80 


Tyr 


Gly His Asn 


Gly 


Leu 


Cys 


Thr 


Tyr 


Ser 


His 


He 


Arg 


Ser 


Gly lie 










85 










90 










95 




Ser 


Thr 


Ala 


Gly Asn 


Tyr 


Leu 


He 


Thr 


Pro 


Asn 


He 


Glu 


Gly Ala 


Lys 








100 










105 










110 




Arg 


VuL 


Lys 


Tyr 


Trp 


Val 


Cys 


Asn 


Gin 


Tyr 


Ser 


Thr 


Asn 


Pro 


Glu 


His 






115 










120 










125 








Tyr 


Ala 


Val 


Net 


Val 


Sor 


Thr 


Thr 


Gly 


Thr 


Ala 


He 


Glu 


Asp 


Phe 


Val 




130 










135 










140 








Leu 


Leu 


Phe 


Asp Asp 


Ser 


He 


Thr 


Gly 


Lys 


Pro 


Thr 


Pro 


Leu 


Val 


Trp 


145 










150 










155 










160 


Arg 


Arg 


Arg 


He 


Val 


Asp 


Leu 


Pro 


Glu 


Gly 


Thr 


Lys 


Tyr 


He 


Ala 


Trp 










165 










170 










175 


Arg 


His 


Tyr 


Lys 


Val 


Thr 


A&p 


Ser 


Hlr 


Thr 


Glu 


Phe 


Leu 


Lys 


Leu 


Asp 








180 










185 










190 




Asp 


Val 


Thr 


Val 


Tyr 


Arg 


Ser 


He 


Glu 


Gly 


Pro 


Glu 


Pro 


Ala 


Thr 


A*.p 






195 










200 










205 








Phe 


Thr 


Val 


He 


Asn 


He 


Gly 


Gin 


Asn 


Val 


Gly 


Arg 


Leu Thr Trp 


Aen 




210 










215 










220 










Tyr 


Pro 


Glu 


Asp 


Tyr 


Gin 


Pro 


Glu 


Gly 


Lye 


Gly 


A3n 


Glu 


Glu 


Leu 


Gin 


225 










230 










235 










2-10 
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Cly 


Tyr 


Asn 


lie 


Tyr 


Ala 


Asn 


Gl y 


Thr 


Leu 


Leu 


Ala 


Gin 


He 










245 










2 50 










255 




Lys 


Asp 


Val 




He 


Leu 


Glu 


Tyr 


Val 


Asp 


Ser 


Thr 


Tvr 


Ser 




rtrg 








260 










265 










270 






Asd 

p 


Asn 


Pro 




Gin 


Val 


Glu 




Cvs 


Val 


Thr 


Alu 


Val 




ASp 


Gl u 






275 










280 










285 








Set 


lie 


Glu 


Ser 


Ser 


Thr 


val 


Cys 


Gly 


Thr 


Leu 


His 


Tyr 


Ala 


Thr 






Z90 










295 










300 








Al a 


T 1 r- 


Leu 


Tyr 


Glu 


Asn 


Phe 


GlU 


Asn 


Gly 


Pro 


Vcl 


Pro 




Gl v 


i rp 


305 










31 0 










315 








320 


Leu 


Val 


He 


Asp 


Ala 


Asp 


Gly 


Asp 


Gty 


Phe 


Ser 


Trp 


Gly 


His 


Tvr 












32S 










33C 










335 




Asn 


At .t 


Tyr 


Asp 


Ala 


Phe 


Pro 


Gly 


His 


Asn 


Gly 


Glv 

VIA j, 


His 


Cys 


Scr 


Leu 








340 










345 










350 






Ser 


Ala 


Ser 


Tyr 


Val 


Pro 


Glv 


I le 




Pro 


Val 


Thr 


Pro 


Asp 


Asn 








355 










360 










365 








T.'mj 


ITr 


Thr 


Pro 


Lys 


Val 


Glu 


Glv 


Ala 


L 




Val 


Lys 


1 i r 


_ 

1 rp 


Val 




37q 










375 










380 










Ser 




Gin 


Asp 


Ala 


Asn 


i t p 


Ala 


Ala 


Gl u 


His 


*T**r 

1 j r 


Al a 


Val 




Al a 


305 










390 










395 










4 00 


Ser 


Thr 


Thr 




Thr 


Ala. 


Val 






Phe 


Val 


I lo 




he 


Glu 


Glu 










4 05 










410 














Thr 


e ~ 


The 


Ala 


Lys 


Pro 


Th r 




Ala 


Trp 


ryr 


GlU 


Arg 


Thr 


He 


Asn 








4 20 










425 










4 30 






T Oil 


Pre 


Glu 


Gly 


Thr 


L. j ~- 


i i r 


He 


Ala 


l rp 


Arg 




Tyr 


Asn 


Cys 


Thr 






435 










440 










4 4 5 








P 


lie 


T'T 


Phe 


Leu 


Lys 


Leu 




A— p 


He 


Thr 


Val 


Ph** 






Pro 




4 SO 










-155 










460 










Ala 


Ser 


Glu 


Fro 


Glu 


Pro 


Val 


Thr 


ASp 


Phe 


Vnl 


Val 


Ser 


I «*U 


He 




4 65 










470 










47 5 












As n 


Asm 


L**s 




Arg 


Leu 


T v«* 


_ 

l rp 


Asn 


_ 

ryr 


Pro 


Asn 


Gly 


Tyr 


Glu 


p 

Pro 










485 










4 90 








495 




sp 




Thr 


n— p 


Asp 


T "« 


Af p 


Pro 


L^u 


Gin 


L*5U 


Ala 


Gly 


Tyr 


Asn 


He 








500 










505 










510 






iyr 


Ala 


Asn 






Leu 


Leu 


val 


Hxs 


11^ 


in 


Asp 


Pro 


Thr 


Val 


,plJ 






515 










520 










525 








Glu 


i > r 


L le 


f\Sp 


Glu 


Thr 


* J r 


Ser 


Ser 


rg 


so 


Asp 


Gin 


Val 


Glu 


3 




53 0 










535 










540 










Glu 


r 


Cys 


Va 1 


Thr 


Ala 


Val 


l \ r 


Asn 


Asp 


Asn 


lie 


Glu 


Ser 


Gin 


Ser 


545 










550 










555 










560 


Val 


Cys 


Asp 


Lys 


Leu 


lie 


Tyr 


Asp 


Ser 


Gin 


Ser 


Asp 


lie 


He 


Leu 


Tyr 










565 










570 










57 5 


Glu 


Gly 


Phe 


Glu 


Ala 


Gly 


Ser 


He 


Pro 


Glu 


Gly 


Trp 


Leu 


Leu 


He 


Asp 








590 










58S 










590 






Ala 


Asp 


Gly 


Asp 


Asn 


Val 


Asn 


Trp 


Asp 


Tvr 


T 

1 jf r 


Pre 


Trp 


Thr 


Met 








5 95 










600 










605 








Gly 


Hli 


Asp 


Ser 


Glu 


Lys 


Cys 


He 


Ala 


Ser 


Pro 


Ser 


Tyr 


Leu 


Pro 


Met 




610 










615 










620 










lie 


Gly 


Val 


Leu 


Thr 


Pro 


Asp 


Asn 


Tyr 


Leu 


Val 


Thr 


Pro 


Arg 


Leu 


Glu 


625 










630 










635 










640 


Gly 


Ala 


Lye 


Leu 


val 


Lys 


Tyr 


Trp 


Val 


Ser 


Ala 


Gin Asp 


Ala 


Val 


Tyr 










645 










650 










655 


Ser 


Ala 


Glu 


His 


Tyr 


Ala 


Val 


Met 


Val 


Ser 


Thr 


Thr 


Gly Thr Ala 


Val 








660 










66S 










67 0 






Glu 


Asp 


Phe 


Val 


Leu 


Leu 


Phe 


Glu 


Glu 


Thr 


Ilet 


Thr 


Ala 


Lys 


Ala 


Asn 






675 










680 










685 








Gly 


Ala 


Trp 


Tyr 


Glu 


Arg 


Thr 


He 


Thr 


Leu 


Fro 


Ala 


Gly Thr 


Lys 


Tyr 




690 










695 










700 










Tlo 


Ala 


Trp 


Arq 


His 


Tyr 


Asp 


Cys 


Thi 


Asp 


Het 


Phe 


Phe 


T.eu 


Leu 


Leu 


705 










710 










715 










720 


Asp 


Asp 


He 


Thr 


Val 


Tyr 


Arg 


Ser 


Thr 


Glu 


Thr 


Val 


Pro 


Glu 


Pro 


Val 










725 










730 










735 




Thr 


Asp 


Phe 


Val 


Val 


Ser 


Leu 


He 


Glu 


Asn 


Asn 


Lys 


Gly Arg 


Leu 


Lys 








740 










745 








750 






Trp 


Asn 


Tyr 


Pro 


Arn 


Gly 


Tyr 


Glu 


Pro 


Asp 


Lys 


Thr 


Asp Asp 


Lys 


Lys 






755 










760 










7 65 






Tro 


Leu 


Gin 


Leu 


Thr 


Gly 


Tyr 


Asn 


He 


Tyr 


Ala 


Asn 


Gly 


Ser 


Leu 


Leu 




770 










775 








7 80 










Val 


His 


He 


Gin 


Asp 


Pro 


Thr 


Val 


L« J 


Glu 


Tyr 


He- Aap 


Glu 


Thr 


Tyr 


7 05 










790 










795 










BOO 


Ser 


S~r 


Arg 


Asp 


Gly 


Gin 


Val 


Glu 


Met 


Glu 


Tyr 


Cy? 


Val 


Thr 


Ala 


Val 










805 










810 








B15 




Tyr 


A^n 


Asp 


Asn 


He 


Glu 


Ser 


Gin 


Ser 


Val 


Cys 


Asp 


Lys 


Leu 


Asn 


Tyr 








870 










82o 








830 






Thr 


He 


Thr 


Ser 


Lou 


Asp 


Asn 


He 


Gin 


Ser 


Asp 


Thr 


Ser 


Leu 


Lys 


He 
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835 










840 










845 








Tyr 


Pro 
850 


Asn 


Pro 


Ala 




Tyr 
855 


Val 


Val 


Arg 


He 


Glu 
860 


Gly 


Leu 


Sex 


Arg 


iJer 


Lys 


Ser 


■fhr 


lie 


Glu 


Leu 


Tyr 


Asn 


Ma 




ray 


Tl£ 


Cys 


He 


Leu 


865 










670 










875 










880 


Aig 


Glu 


Glu 


Thr 


His 
885 


Ser 


Glu 


Lys 


Thr 


Glu 
890 


He 


A^p 


Val 


Ser 


Arg 
895 


Leu 


Asn 


r\c r 


Giy 


Val 

900 


Tyr 


Leu 


He 


Lys 


Val 
905 


Val 


Gly 


Gly 




Lys 
910 


Thr 


Thr 


Thr 


Glu 


Lys 
915 


Vol 


Glu 


He 


Lys- 


Arg 
920 


Pro 

















(2) INFORMATION FOR SEQ ID HO: 470 

(i) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 593 anu.no acids 
fB) TYPft: amino acid 
{ D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

I A) ORGANISM: Porphyromoiwsr gingival le 

(lxl JfEATURE : 

(A) NAME/ KEY: misc__f eature 
(D) LOCATION 1...593 





(xi] 


1 SEQUENCE DESCRIPTION: SEQ ID MO: 470 










Mot 


Asn 


Se r 


He 


Met 


Lys 


Tyr 


Gin 


Leu 


Tyr 


Thr 


Ala 


Val 


He 


Met 


AJs 


1 








5 










10 










15 




Leu 


Ser 


Val 


Ser 


Ser 


Val 


Cyc 


Gly 


Gin 


Thr 


Pro 


Arg 


Asn 


Thr 


Glu 


Tnr 








20 










25 










30 






Lys 


Arg 


Pro 


Asp 


Thr 


Leu 


Arg 


Arg 


Glu 


Leu 


Tnr 


He 


Val 


Asn Asp 


Gin 






35 










40 










45 








Thr 


Val 


Glu 


Met 


Glu 


His 


Ala 


Asp 


Pro 


Leu 


Pro 


Ala 


Ala 


Tyr 


Lys 


Ala 




50 










55 










60 










He 


Glu 


Pro 


Arg 


Leu 


Lys 


Pro 


Phe 


Arg 


Pro 


Glu 


Tyr 


Asn 


Lys 


Arg 


Thr 


65 










70 










75 










80 


Fhe 


Gly 


Phe 


Val 


Pro 


Glu 


Val 


Ser 


Ser 


Ser 


Gly 


Arg 


Asn 


Asn 


Leu 


Pro 










85 










90 










95 




Am 


He 


Leu 


Pro 


Thr 


Glu 


Gly 


His 


Met 


Lys 


His 


Arg 


Gly 


Tyr 


Leu 


Asn 








100 










105 








110 






He 


Gly 


He 


Gly 


His 


Thr 


Leu 


A.«tn 


Gin 


Arg 


Met 


Asp 


Ala 


Gly 


Tyr 


Arg 






115 










170 










125 








Leu 


He 


Asp 


Ala 


Glu 


Gin 


Glu 


Arg 


Leu 


Asn 


T.eu 


Phe 


Leu 


Ser 


Tyr 


Arg 




130 










135 










140 










Gly 


Met 


Lys 


Ser 


Ala 


Phe 


Asn 


Thr 


Gly 


Asp 


Phe 


Asp 


Gly 


Asp Arg 


Lyr 


145 










150 










155 










160 


Arp Arg 


Arg 


Met 


Met 


Ala 


Gly 


Val 


Asp 


Tyr 


Glu 


Gin 


Arg Arg 


Pro 


Ser 










165 










170 










175 




Phe 


Val 


Leu 


Ala 


Thr 


Gly 


Leu 


Tyr 


Tyr 


Ser 


Asn 


His 


Tyr 


Phe 


Asn 


Asn 








180 










185 










190 






Tyr 


Gly 


Arg 


Gly 


Ala 


Thr 


Thr 


Asn 


Val 


Gly 


Ser 


He 


Pro 


Gin 


Leu 


Ser 






195 










200 










205 








Thr 


Pro 


Val 


Thr 


Pro 


Gin 


Met 


Asp 


Asn 


Gly 


Thr 


His 


At- n 


Val 


Arg 


Val 




210 










215 










220 










Tyr 


Leu 


Gly 


Ala 


Lys 


Asn 


Asp 


Val 


He 


Asp 


Ala 


Arg 


He 


Asp 


Tyr 


Arg 


225 










230 










235 










240 


Phe 


Phe 


Arg 


Ser 


He 


Pro 


Tyr 


Leu 


Gly 


Thr 


Asp 


Pro 


Met 


hyp 


Ala 


Leu 




















250 










255 




Thr 


Glu 


His 


Thr 


Pro 


Glu 


Leu 


Asn 


Val 


Thr 


I let 


Ser 


Asn 


Glu 


Leu 


Ser 








260 










265 










2"C 






Asp 


Asp 


lit* 


Lys 


Leu 


Gly 


Val 


Glu 


V«l 


Arg 


Thr 


Gly 


Gly 


La:: 


Fhe 


Fh- 






275 










280 










285 








Ala 


hys 


Asn 


Ser 


Glu 


Met 


He 


Gin 


Thr 


Gly 


Val 


Leu 




Glu 


Thr 


Afp 




290 










295 










30O 










Arg 


Af n 


Leu 


Tyr 


Tyr 


Val 


Glu 


Gly 


Ala 


Pro 


Thr 


lis 


Gly 


Phe 


Val 


Gly 


305 










310 










315 










320 


Asp 


Ser 


Asp 


As i» 


Met 


Gin 


Trp 


Asn 


He 


Gin 


Ala 


Gly 


Val 


Gly 


He 


Ser 










325 










330 










335 




Ser 


His 


Phe 


Gly 


Ala 


Lys 


Gly 


Arg 


Leu 


Phe 


Phe 


Trp 


Pro 


Lys 


I.SU 


Asp 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCI7AU98/01023 



419/490 

340 345 350 



Ala 


Ser 




Ser 


lie 


Phe 


Pro 




Trp 


Arg 


Val 


Tyr Ala 


Lys Ala 


Phe 






355 










360 








365 






Gly Gly 


Va 1 


lie 


rg 


Asn 


ni *- 


Leu 


Ala 


Asp 


Val 


Met Gin 


Glu Glu 


Met 




37 0 










37 5 










380 






Pro 


Tyr 


Leu 


Met. 




Ash 


Thr 


He 


Val 


Leu 


Pro 


Ser Arg 


Asn Ala 


Leu 


385 










390 










395 






400 


Thr 


Ala 




Leu 




Val 




Gly 


Asn 


lie 


Ala 


Asp Val 


Val Arg 


Ilet 




















410 






415 




Glu 


Val 


Tyt 


Gly 


Asp 


Phe 


Scr 


Lys 


Leu 


Thr 


Gly Val Pro 


Phe Tyr 


Thr 








420 










425 








430 




Pro 


Thr 


Leu 


Pro 


Leu 


Tyr 


Asn 


Pro 


Ser 


Asp 


Leu 


Tyr Gin 


Tyr Asn 


Val 






435 










440 








445 




Ser 


Phe 


Leu 


Pro 


He 


Tyr 


Ala 


Asp 


Gly 


Ser 


Arg 


Trp Arg Ala Gly Gly 




450 










455 










460 






Lys 


I.cu 


Glu 


Tyr 


Ser 


Tyr 


Arg 


Asp 


Met 


Leu Arg 


Phe Leu 


Val Asp 


Ala 












470 










475 






4 80 


Ser 


Tyr 


Gly 


Ly* 


Trp 


Asn 


Lsu 


A?p 


Gly Gly 


Leu 


Val Ala 


Ser Met 


Gin 










485 










4 90 






4 95 




Pro 


Asp 


Leu 


lie 


Leu 


Lys 


Ala 


Glu 


Val 


Gly 


Val 


His Pro 


He Ala 


Pro 








500 










605 






510 




Leu 


Asp 


Val 


Arg 


Leu 


Arg 


Tyr 


Thr 


Gin 


Leu 


Asn 


Gly Arg 


Tyr Arg Tyr 






bib 










520 








525 






Ser 


Fhe 


Gly 


Ser 


Ala 


Gly 


Ser 


Glu 


Ala 


Leu 


Gly 


He Gly Asn Val 


His 




530 










535 








540 






Leu 


Leu 


Ser 


Ala 


Asp 


Val 


Ser 


Tyr 


Lys 


Leu 


Lys 


Lys Asn 


Lea Scr 


Lev. 


545 










550 










555 






560 


Tyr 


Leu 


Lyr 


lie 


Asp 


Asn 


Met 


Leu 


Ala 


Glu 


Thr 


Thr Glu 


Leu lie 


Giy 










S65 










570 






57 5 


Tyr 


Tyr 


Pro 


I let 


Gin 


Pro 


Phe 


His 


Cys 


Phe 


Ala 


Gly Phe 


Ssr Trp 


Thr 








580 










585 








590 




Ph- 





























(2) 1 11 FORI 1AT I ON FOR SEQ ID MG:47l 

(i) SEQUF.MCE CHARACTERISTICS: 

(A) LENGTH: 589 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(viJ ORIGINAL SOURCE: 

(Al ORGAN I St 1: Porphyromonas gingival* s 

FEATURE : 

(A) HA! IE/ KEY: misc_feature 

(B) LOCATION 1...589 



(>:i) SEQUENCE DESCRIPTION: SEQ ID MO: 471 



Met 


Lys 


Tyr 


Gin 


Leu 


Tyr 


Thr 


Ala 


Val 


He 


Mat 


Ala 


Leu 


Ser Val 


Ser 


1 








5 










10 








15 




Ser 


Val 


Cys 


Gly 


Gin 


Thr 


Pro 


Arg 


Asn 


Thr 


Glu 


Thr 


Lys 


Arg Pro 


Asp 








20 










25 










30 


Thr 


Leu 


Arg 


Arg 


Glu 


Leu 


Thr 


lie 


Val 


Asn 


Asp 


Gin 


Thr 


Val Glu 


Met 






35 










40 










4i> 






Glu 


Hi- 


Ala 


Asp 


Pro 


Leu 


Pro 


Ala 


Ala 


Tyr 


Lys 


Ala 


He 


Glu Pro 


Arg 




50 










55 










€0 






Leu 


Lys 


Pro 


Phe 


Arg 


Pro 


Glu 


Tyr 


Asn 


Lys 


Arg 


Thr 


Phe 


Gly Phe 


Val 


65 










70 










75 






80 


Pro 


Glu 


Val 


Ser 


Ser 


Ser 


Gly 


Arg 


Asn 


Asn 


Leu 


Pro 


Asn 


He Leu 


PlL> 










85 










90 








95 




Thr 


Glu 


Gly 


His 


Met 


Lyn 


His 


Arg 


Gly 


Tyr 


Leu 


Asn 


He 


Gly Il- 


Gly 








100 










105 










HO 


His 


Thr 


Leu 


Asn 


Gin 


Arg 


Met 


Asp 


Ala 


Gly 


Tyr 


Arg. 


Leu 


He Aap Ala 






115 










120 










125 






Glu 


Gin 


Glu 


Arg 


Leu 


Asn 


Leu 


Phe 


Leu 


Ser 


Tyr 


Arg 


Gly Met Lys 


Ser 




130 










135 










140 








Ala 


Phe 


Asn 


Thr 


Gly 


Arp 


Phe 


Asp 


Gly 


Asp 


Arg 


Lys 


Asp Arg Arg Met 


145 










150 










155 








160 


Met 


Ala 


Gly 


Val 


Asp 


Tyr 


Glu 


Gin 


Arg 


Arg 


Pro 


Ser 


Phe 


Val Leu 


Ala 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU9S701023 



420/ 490 











1 65 










1 70 










175 




Thr 


Gly 


L-u 


Tyr 


Tyr 


Ser 


Asn 


His 


Tyr 


Phe 


A^n 


Asn 


Tyr 


Gly 


Arg 


Gly 








1B0 










185 










1 90 






Ala 


Thr 


Thr 


Asn 


Val 


Gly 


Ser 


He 


Pro 


Gin 


Leu 


Ser 


Thr 


Pro 


Val 


Thr 






1 95 










200 










205 








Pro 


Gin 


Met 


A<F P 


Asn 


Gly 


Thr 


His 


Asn 


Val 


Arg 


Val 


Tyr 


Leu 


Gly 


Ala 




210 










215 










220 










Lys 


Asn 


Asp 


Val 


Tl e 


Asp 


Ala 


Arg 


He 


Asp 


Tyr 


Arg 


Phe 


Phe 


Arg 


Ser 


225 










230 










235 










240 


He 


Pro 


Tyr 


Leu 


Gly 


Thr 


Asp 


Pro 


Het 


Lye 


Ala 


L-3U 


Thr 


Glu 


Hi? 


Thr 










245 










250 










255 




Pro 


Glu 


Leu 


Asn 


Val 


Thr 


Met 


Ser 


Asn 


Glu 


Leu 


Ser 


Asp 


Asp 


He 


Lys 








260 










265 










270 






Leu 


Gly 


Val 


Glu 


Val 


Arg 


Thr 


Gly 


Gly 


Lsu 


Phe 


Phe 


Ala 


I.y3 


Asn 


Ser 






275 










280 










285 








Glu 


Met 


T! ~ 


Gin 


Thr 


Gly 


Val 


Leu 


Ser 


Glu 


Thr 


Asp 


Arq 


Asn 


Leu 


Tyr 




290 










235 










300 










Tyr 


Val 


Glu 


Gly 


Ala 


Pro 


Thr 


He 


Cli- 


Phe 


Val 


Gly A.sp 


Ser 


Asp 


Asn 


305 










310 










315 










32 0 


Met 


Gin 


Ti'p 


A^n 


lie 


Gin 


Ala 


Gly 


val 


Gly 


He 


Ser 


Ser 


His 


Phe 


Gly 










325 










330 










335 




Ala 


j>ys 


Gly 


Arg 


Leu 


Phe 


Phe 


Trp 


Pro 


Lys 


Leu 


Asp 


Ala 


Ser 


Leu 


Ser 








340 










345 










350 






Tie 


Phe 


Pro 


Ser 


Trp 


Arg 


Val 


Tyr 


Ala 




Ala 


Phe 


Gly 


Gly 


Val 


He 






355 










36C 










365 








Acg 


At* u 


Gl> 


Leu 


Ala 


A£p 


Val 


Met. 


Gin 


GlU 


Giu 


Ilet 


Pro 


Tyr 


Leu 


Met 




370 










37 5 










380 










Pro 


Am 


Thr 


He 


Val 


Leu 


Pro 


Ser 


Arg 


Asn 


Ai a 


Leu 


Thr 


Ala 


Gin 


Leu 


385 










390 










395 










400 


Gly 


Val 


Ly^ 


Gly 


Asm 


He 


Ala 


Asp 


Val 


Val 


Arg 


Met 


Glu 


Val 


Tyr 


Gly 










4 05 










410 










415 




Asp 


Phe 


Ser 


Lys 


Leu 


Thr 


Gly 


Val 


Pro 


Phe 


Tyr 


Thr 


Pro 


Thr 


Leu 


Pro 








420 










425 










430 






Leu 


Tyr 


Asn 


Pro 


Ser 


Asp 


Leu 


Tyr 


Gin 


Tyr 


Am 


Val 


Ser 


Phe 


Leu 


Pro 






435 










440 










445 








He 


Tyr 


Ala 


Asp 


Gly 


S-r 


Arg 


Trp 


Arg 


Ala 


Sly 


Gly 


Lys 


Leu 


Glu 


Tyr 




450 










455 










460 










Ser 


Tyr 


Arg 


Asp 


Met 


Leu 


Arg 


Phe 


T.ou 


Vnl 


Arp 


Ala 


Ser 


Tyr 


Gly 


Lys 


465 










470 










475 










4B0 


Trp 


Asn 


Leu 


Asp 


Gly 


Gly 


Leu 


Val 


Ala 


Ser 


Met 


Gin 


Pro 


Asp 


Leu 


He 










485 










490 










495 




Leu 


Lys 


Ala 


Glu 


Val 


Gly 


Val 


His 


Pro 


He 


Ala 


Pro 


Leu 


Asp 


Val 


Arg 








500 










505 










510 






Leu 


Arg 


Tyr 


Thr 


Gin 


Leu 


Asn 


Gly 


Arg 


Tyr 


Aru 


Tyr 


Ser 


Phe 


Gly 


Ser 






515 










520 










525 








Ala 


Gly 


Ser 


Glu 


Ala 


Leu 


Gly 


He 


Gly 


Asn 


Yal 


His 


Leu 


Leu 


Ser 


Ala 




530 










535 










640 










hs:p 


Val 


Ser 


Tyr 


Lys 


Leu 


Lys 


Lys 


Asn 


Leu 


Ser 


Leu 


Tyr 


Leu 


Lys 


He 


545 










550 










555 










550 


Asp 


Asn 


HeL 


Leu 


Ala 


Glu 


Thr 


Thr 


Glu 


Leu 


He 


Gly 


Tyr 




Pro 


Met 










565 










570 








575 




Gin 


Pro 


Phe 


His 


Cys 


Phe 


Ala 


Gly 


Phe 


Scr 


Trp 


Thr 


Phe 














S90 










585 















(2) INFORMATION FOR. SF.Q ID HO: 47 2 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 6 amino acids 
\B) TYPE: amino acid 
(D) TOPOLOGY; linear 

Ui) MOLECULE TYPE: protein 

tlii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas qinalvalis 

tix> FEATURE: 

(A) HAHE7 KEY : mlsc feature 
( D ) LOCATION 1 . - . 3 4 5 

<xi) SEQUENCE DESCRIPTION: SEQ ID WO: 472 

Met Met Glu Lys Cys He Phe Ala Hi.? Tyr Pro His Asn Leu Val Phe 



SUBSTITUTE SHEET (Rale 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



421/ 490 



1 




























IS 




Met 


Ile 


Arq 


Lys 


His 


Phe 


Gly 


Ily 


lie 


eu 




Phe 


Leu 


Scr 


Le\J 


al 








20 










25 










30 






Phe 


Ser 


Ala 


Gly 


Alo 


Gin 


Gin 


Gl u 


Ly.<? 


In 




Fhe 


His 


Phe 


Leu 


Asn 






35 










40 










45 








Leu 


Pro 


Ala 


r 


Ala 


Gin 


Ala 


Leu 


Ala 


Al a 




Gly Lys 


Al* 


lie 


Thr 




SO 










55 










60 










lis 


Val 


Afp 




Asn 


Pro 


Gly 


Leu 


Ala 


Phe 


Glu 


Asn 


Pro 


Ala 


Leu 


Leu 


65 










70 










75 










80 


Gly 


'»yr 


Glu 


Ser 


Gly Gly 


Arg 


Ala 


Phe 


Leu 


Ser 


Tyr 


Leu 


Tyr 


Tyr 


Met 










85 










90 










95 




Ser 


Gly 


Ser 


Hi f 


Ilet 


Gly 


Asn 


Ala 


Cys 


Tyr 


Ala 


Ser 


Ser 


Val 


sly 


Glu 








100 










105 










110 






Arg 


Gly Met 


Trp 


Gly Val 


Gly Met 


Arg 


Phe 


l,eu 


Asn Tyr 


Gly 


Ser 


Met 






115 










120 










125 








Gin 


Gly 


Tyc 


Asp 


Gin 


Asn 


Ala 


lie 


Ala 


Thr 


Gly 


Ser 


Phe 


Ser 


Ala 


Ser 




130 










135 










140 










Asp 


He 


Ala 


Val 


Gin 


Gly 


Phe 


Tyr 


Ser 


HI 3 


Glu 


Leu 


Ser 


Asn 


His 


Ph^ 


1-15 










150 










1.55 










ISO 


Arg 


Gly 


Gly 


Val 


Ser 


Leu 


Lys 


Ala 


Leu 


Tyr 


Ser 


Ser 


lie 


Glu 


Thr 


Tyr 










165 










170 










175 




Sec 


Sec 


Ph* 


Gl* 


Leu 


Gly 


Val 


Asp Val 


Gly 


lie 


Ser 


Tyr 


Tyr 


Asp 


Asp 








190 










185 










190 






Asp 


Lys 


my 


Tyr 


Ser 


Ala 


ser 


Ala 


Leu 


Phe 




Asn 


Val 


Gly Ala 


Gin 




195 










200 










205 








Leu 


Lysr 


Gly 


Tyx 


Asn 


Glu 


Glu 


Arg 


Glu 


Pro 


Leu 


Asp 


Trp 


Asp 


The 


'Jin 




21 0 










215 










220 










Leu 


Glx 


Phe 


Ser 


Arg 


Ser 


Phe. 


lie 


Asn 


Ala 


Pre 


Phe 


Arg 


Leu 


His 


lis 


225 










230 










235 










240 


Thr 


Leu 


Phe 


Asn 


Leu 


Asn 


Pro 


His 


Tyr 


Phe 


T.y^ 


ACJ 


Leu 


Val 


Pro 


Arq 










245 










250 








255 




Asp 


Leu 


Ser 


Lys 


Met 


Gin 


Lys 


Phe 


Leu 


Arg 


His 


Phe 


Ser 


He 


Gly Ala 








260 










265 










270 






Glu 


Phe 


Thr 


Pro 


Ser 


Glu 


Axg 


Phe 


Trp 


val 


Gly Leu 


Gly 


Tyr 


Thr 


Pro 






275 










280 










285 








Gin 


He 


Ala 


Gin 


Asp 


Phe 


Glu 


Val 


Glu 


Gly 


Gly Asn 


Lys 


Trp 


Gly Gly 




2 90 










295 










300 










Leu 


Set 


Ala 


Gly Val 


Gly 


Phe 


Thr 


Ser Gly 


Val 


Val 


Arg 


Val 


Gl y 


Val 


30S 










310 










315 










320 


Ser 


Al a 


Ala 


Thr 


Tyr 


His 


Pro 


Ala 


Ala 


Leu 


Ser 


Phe 


net 


Cys 


Ser 


Val 










325 










330 








335 




Gly 


lie 


Acq 


Leu 


Asp Asp 


Lys 


Ser 


lie 


Phe 















340 345 



(2) IH FORMAT TOM FOR SEQ ID MO: 47 3 

(i) SEQUENCE CHARACTERISTICS: 

tAJ LENGTH-. 34 5 amino acids 
(B> TYPE: amino acid 
{ D) TOPOLOGY; linear 

(ii) MOLECULE TYPE; protein 

(ill) HYPOTHETICAL s YES 

|vi) ORIGINAL SOURCE: 

(A) ORGAN I SH : Porphyromonas yinuivalis 

ilr.) FEATURE: 

(Al HA! IE/KEY: miscf eature 
(D) LOCATION 1 . . .34 5 



<xi) SEQUENCE DESCRIPTION: SEQ in HO: 473 



lleL 


Glu 


Lys 




Hi 


Fhe 


Ala 


His 


Tyr 


Pro 


His 


Asn 


Leu 


Val 


Fhe 




1 








5 










10 










15 




He 


Arg 


Lys 


His 


Phe 


Gly 


He 


He 


Leu 


Gly 


Phe 


Leu 


Ser 


Leu 


Val 


Phe 








20 










25 










30 






Ser 


Al a 


Gly 


Ala 


Gin 


Gin 


Glu 


Lys 


Gin 


Val 


Phe 


His- 


Phe 


Leu 


Asn 


t.eu 






35 










40 










45 








Pro 


Aia 


Thi 


Ala 


Gin 


Ala 


Leu 


Ala 


Ala 


Gly 


'Sly 


Lys 


Ala 


He 


Tnt 


lis 




50 










55 








60 










Val 


Asp Asp 


Asn 


Pro 


Gly 


Leu 


Ala 


Phe 


Glu 


Asn 


Pro 


Ala 


Leu 


Leu 


Gly 


65 










JO 










75 










30 


Tyr 


Glu 


Ser 


Gly 


Gly 


Arq 


Ala 


Ph« 


Leu 




Tyr 


Leu 


Tyr 


Tyr 


Met 


Ser 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



25 



40 



50 



55 



WO 99/29870 

422/490 



05 9Q 



Gly Ser His Hot r,ly Asn Ala Cys Tyr Ser Ser Val Gly Glu Axg 

100 105 no 

5 Gly Met Trp Gly V.l Gly „. t Arg Phe Leu Tyr GJy Sffr „ et ^ 

Gly Tyr Asp Gln Asn Ala lie «! Thr 6ly Ser phs ^ ftla Ser Agp 
II* Al* V.l Gin Gly Phe Tyr Ser 111, Glu Leu S« Asn Hi, Phe Arg 
10 Gly slr Val Ser Leu Lys A1 , Leu Tyz , „, Glu Thr Tyr *™ 

Ser Phe Giy L*u Gly Vol Asp Val Gly lie Ser Tyr Tyr Asp £p Asp 

185 X90 
Lys Gly Tyr Ser Ala Ser Ala Leu Phe Ly S Asn Val Gly Ala Gin Leu 

200 205 
Ly- Sly Tyr Acn Glu G lu Arg Glu Pro Leu Asp Trp Asp Phe Gin Leu 

Gly Phe ser Arg Ser Phe lie A,„ Ala Pro Phe Arg Leu His He Thr 

20 Leu Phe Asn Leu Asn Pro HI- Tyr Phe Lys Arg Leu Val Pro Arg Asp 

Leu Ser Lys Met Gin Lys Ph . Leu Arg His fhe s=r lle Gly Glu 

Phe Thr Pro Ser Glu Arg Phe Trp v" Gly Leu Giy Tyr Thr Pro Gin 

280 285 

lie Ala Gin Asp Phe Glu Val Glu Gly Gly Asn Lys Trp Gly Gly L.u 

^ 95 300 

3os Ma Gly val Gly ™; Thr Ssr G1 >- vai x* 1 Arg Val Gly Val S " 

30 Ala Ala Thr Tyr Hi, Pro Ala Ala Leu Ser Pne Met Cys S.r Val cly 

He Arg Leu Aep Asp Lys Ser lie Ph* 
340 345 

35 (2J IH FORMATION FOR SF.Q ID NOM74 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



3 3C 335 



Ui) MOLECULE TYPE: protein 

45 (ill) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(Ik) FEATURE: 

{A} NAME /KEY : mis c_ feature 
IB) LOCATION 1. . . 330 

<Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474 

Met He Arg Lys His Phe Gly He He Leu Gly Phe Leu Ser Leu Val 



Fh. Ser Ala Gly Ala Gin Gin Glu Ly, Gin Val Pne His Phe Leu A« 
60 Aia Gln M * JJ U Ala C,y Gly Ly, II, Thr 

He Val Asp Asp Asn Pro Gly Leu Ala Phe Glu Asn Pro Ala Leu L«u 
Gly Tyr Glu Ser Gly Gly Arg Ala Phe Leu Scr Tyr Leu Tyr Tyr net 

65 Ssr oly Set ias Gly Asn Ala cys ^ ^ a s ^ s ^ ^ ^ 80^ 

Arg Gly ,„ t jrp Gly Val Gly Met Arg Phe L eu Asn Tyr Gly Ser Met 

70 ^ **" A ^ Thf G1 " S *'~ S« Ala ser 

Asp lie Ala Val Gln Gly Tyr Ser H1 , fllu ^ u ^ ^ ^ 

Arg Gly Gly Val Ser Leu Lys Ala Leu Tyr Ser III He Glu Thr Tyr 
75 ,„ ser Phe Gly Leu Gly Val Asp Val Gly HI Ssr Tyr Tyr Affp £° 



1 



PCT/AU98/01023 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



423/490 











165 










170 










175 




Asp 


Lys 


Gly 


Tyr 


Ser 


Ala 


Ser 


Ala 


Leu 


Phe 


Lys 


Asn 


Val 


Gly Ala 


Gin 








180 










185 










190 






Leu 


Lys 


Gly 


Tyr 


Asm 


Glu 


Glu 


Arg 


Glu 


Pro 


Lsu 


Asp 


Trp 


Asp 


Phe 


Gin 






195 










200 










205 






l.~u 


Gly 


Phe 


Ser 


Arg 


Ser 


Phe 


He 


Asn 


Ala 


Pro 


Phe 


Arg 


Leu 


His 


He 




210 










215 










220 








Thr 


Leu 


Phe 


Asn 


Leu 


Asn 


Pro 


His 


Tyr 


Phe 


Lys 


Arg 


Leu 


Val 


Pro 


Arg 












230 










235 










240 


Asp 


Leu 


Ser 


I '' } 'C 


Met 
245 


Gin 


Lys 


Phe 


Leu 


Arg 

250 


His 


Phe 


Ser 


He 


Gly 
255 


Ala 


Glu 


Phe 


Thr 


Pro 


Ser 


Glu 


Arg 


Phe 


Trp 


Val 


Gly 


Leu 


Gly 


Tyr 


Thr 


Pro 








260 










265 








270 






Gin 


He 


Ala 


Gin 


Asp 


Phe 


Glu 


Val 


Glu 


Gly 


Gly 


Asn 


Lys 


Trp 


Gly Gly 






275 










200 










205 






Leu 


Ser 


Ala 


Gly Val 


Gly 


Phe 


Thr 


Ser 


Gly 


Val 


Val 


Arg 


Val 


Gly 


Val 




290 










295 








300 






S<»r 


Al a 


Ala 


Thr 


Tyr 


His 


Pro 


Ala 


Ala 


Leu 


Ser 


Phe 


Met 


Cys 


Ser 


Val 


305 










31 0 










315 








320 


Gly 


lie 


Arg 


Lsu 


Asp 
325 


Asp 




Ser 


T3* 


Pne 
330 















(2) IN FORI IATTOH FOR SEQ ID NO: 475 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

{A) ORGAIJXSM; Porphyromonas gingival is 

( IX } FEATURE : 

(A) W Al IE/ KEY : misc_f eature 
(B> LOCATION 1...32 4 



(Xij SEQUENCE DESCRIPTION: SEQ ID 110:475 



Met 


Cys 


Leu 


Glu 


Pro 


lie 


He 


Ala 


Pro 


He 


Ser 


Ser 


Glu 


Leu 


Leu 


Glu 


1 








5 










10 










15 




Gin 


Glu 


Leu 


Thr 


Ala 


Asp Arg 


Phe 


Leu 


Arq 


Het 


Thr 


Asn 


Lys 


Ala 


Gly 








20 










25 










30 




Asn 


Glu 


He 


Tyr 


Val 


Phe 


Thr 


Ala 


Glu 


Glu 


Ala 


Pro 


Hi* 


Cys 


net 


Lys 






35 










40 










45 




G.LU 


Val 


Gly Arg 


Leu 


Arg 


Glu 


Clu 


Ala 


Phe 


Arg 


His 


Tyr 


Gly Gly Gly 




50 










55 










60 










Thr 


Gly 


Lys 


Ala 


He 


Asp t He 


Asp 


Glu 


Phe 


Asp 


Thr 


Met 


Pro 


Gly 


Ser 


65 










70 










75 








80 


Tyr 


Lys 


Gin 


Leu 


He 


Val 


Trp 


Asp 


Pro 


Gin 


Asn 


Lys 


Ala 


He 


Leu 


Gly 










85 










90 










95 


Gl y Tyr 


Arg 


Phe 


He 


Tyr 


Gly 


Arg 


Asp 


Val 


Ala 


Phe 


Asp Thr Asp 


Gly 








100 










105 










110 




Lys 


Pro 


Leu 


Leu 


Ala 


Thr 


Ala 


Gl u 


Met 


Phe 


Arg 


Phe 


Ser 


Asp 


Ala 


Phe 






115 










120 








125 






Leu 


His 


Asp Tyr 


Leu 


Pro 


Tyr 


Thr 


Val 


Glu 


Leu 


Gly 


Arg 


Ser 


Phe 


Val 




130 










135 










140 








iier 


Leu 


Gin 


Tyr 


Gin 


Ser 


Thr 


Arg 


Met 


Gly 


Thr 


Ly» 


Ala 


He 


Phe 


Val 


145 










150 










155 










160 


Leu 


Asp 


Asn 


Leu 


Trp 


Asp 


Gly 


He 


Gly Ala 


Leu 


Thr 


Val 


Val 


Asn 


Pro 


Glu 


Ala 






16S 










170 










1/5 


Leu 


Tyr 


Pho 


Tyr 


Gly 


Lys 


Val 


Thr 


Met 


Tyr 


Lys 


Asp 


Tyr 


Asp 








180 










185 










190 




Arg 


Arg 


Ala 


Arg 


Asn 


Leu 


He 


Leu 


Tyr 


Phe 


Leu 


Arg 


Lys 


His 


Phe 


Ser 






l<?5 










200 








205 








Asp 


Pro 


Glu 


Gly 


Leu 


Val 


Lys 


Pro 


He 


His 


Pro 


Let* 


Pro 


He 


Glu 


He 




21 o 










215 










220 










Ser 


Ala 


Glu 


Asp 


Glu 


Ala 


Leu 


Phe 


Ser 


Sqt 


Ser 


A?p 


Phe 


Asp Thr 


A.*n 


225 










230 










235 








240 


Tyr 


Lys 


Thr 


Leu 


Asn 


He 


Glu 


Val 


Arg 


Lys 


Leu 


Gly 


He 


Asn 


He 


Pro 










245 










25^ 










255 




Pro 


Leu 


Val 


Ser 


Ala 


Tyr 


He 


Ala 


L«u 


Ser 


Pro 


Glu 


net 


Arg 


Val 


Phn 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



424/ 490 



10 



15 



25 



30 



55 



60 



260 265 2/0 

Gly Thr Ala Val Asn Glu Ser Phe Gly Glu Val Glu Glu Thr Gly He 

27 L 280 28 S 

Phe lie Ala Val Gly Lys He Leu Glu Glu Lys Lys Gin Arg Hi* He 

295 300 
Glu Ser Phe Tic Leu Ser Arg Asn Glu Lys Lys Gly Leu Asp Ser Ser 
310 31S 320 

Asn Gly Arg Ser 

(C) INFORMATION FOR SEQ ID NO:«l76 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: £47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



MOLECULE TYPE: protein 
20 Uii) HYPOTHETICAL: YES 



(vi) ORIGIIIAr. SOURCE : 

lAI ORGANISM: Porphyromonas gingival Ls 

(ix) FEATURE : 

(A) HAMF./KEY: miscjeature 

( B) LOCATION 1...54 7 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 47 6 



Met Lys Thr He Val Arg Tyr Ser Arg Leu Pro Val Ala Leu Phe Phe 

i. 10 is 

Cys Leu Leu Gly Ala Val His Leu Ser Val Glu Ala Gin Met Leu A^n 
or: 20 2S 30 

35 Thr Pro Phe Glu Leu Ser Asp Gin He Val Leu Ser Pro Thr Glu Arg 

Gin Tyr Arg Glu He Cys Val Gin Thr Lys Glu Lys Arg Gly Ala Asp 
DU 55 60 
40 65 U ^ ^ ^ ASP Ala Val 

Gly Ser Ala Tyr Gly Asp He Ala Gly Asp Tyr Leu Pro Tyr Asn Gly 

35 9 ° «5 

Am Asn Tyr Ser Ser Leu Ser Leu Glu Ser Gly Gly Arg He Ser Val 

An 100 10S 11D 

45 Arg Asn Tyr Gly Thr Leu Gin Gly Sor Ala Ser Tyr Ser Arg Gly Met 
ia ~* 120 

130 Arg TJe Gly Trp Ala Leu A£g Ai * n AJLa Glu ^ T * r ^ 

50 ill ^ Val A f? Ser Thr G1 V ^p Tyr His Phe Glu Asp 

150 155 ICO 

Tyr Arg Leu Ala Gly Tyr Tyr Ser Phe Arg Ala Gly Arg Leu Pro L-u 

16S 170 175 

Gly He Gly Phe Ser Tyr Arg Gly Glu Val Ala Tyr Arg Leu Thr Asp 

100 190 

Pro Arg Thr Thr Asn Thr Thr Gly Ala Leu Glu Leu Ser Cys Ala Thr 

195 200 205 

Ser Leu Thr Leu Pro Arg Glu Asn Arg Leu s- r Leu Ser Ala Ala Tyr 

215 220 

Leu Tyr HI* Arg Gin His Leu Thr Gin Tyr Asn Trp Arg Pro Gly Gin 



235 240 



Gin Asp Lys Phe Phe Val Ser Tyr Gly Phe Gly Gin Val Asp Val Ser 

Asn Ser Pro Tic Trp Phe Gly He Ser Arg Met Asn Tyr Val Asn Gly 
fi c 260 265 270 

TrP Ly£ Ser Ser Ar * Leu Asp Thr Arg Arg Gly Asp Ala He Gly 

25 280 28s 

Leu Asp Tyr Ser Gly Tyr Phe Leu Asp Thr Glu Glu Arg Ser Ser He 
_ , 295 300 

70 305 ^ Ti ' r Arq L * U <«V Ser Trp 

315 320 



His Leu Ser Asp Phe Asp Phe Ser Phe Sor Ala 



A/*p Tyr Ala Leu Arg 



325 330 33c 

Gin sly He Glu Arq Il s Tyr Glu Asp Tyr Lys Pro Asp Asp A--=n Tyr 

75 Hxs II. Ti r Asp Leu Arg II. L t « II, Arg Ar , 7zp , tat ^ 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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355 










360 










365 








Asn 


Glu 


Phe 


Ser 


Ala 


Gin 


Ala 


Gin 


Ala 


Ser 


Tvr 


Arg 


He 


Arg 


Thr 


Asp 




370 










375 










380 






Arg 


Gly 


Cys> 


Ala 


Leu 


Arg 


Val 


Ser 


Ala 


Gly 


Ser 


Asp 


Phe 


Tyr 


Gly Tyr 


385 










390 










3 95 






400 


Asp 


Glu 


Thr 


Tyr 


Arg 


Lys 


His 


Gly 


His 


His 


Thr 


Ilet 


Ser 


Gly Met 


Leu 










405 










410 










415 




.Aro 


Pro 


Phe 


Ala 


<Uy 


He 


Al* 


Tir 


Asp 


His 


Ala 


Cli- 


Ser 


I.ya 


Leu 


Asp 


Phe 


oiy 




420 










425 










430 




Leu 


Ser 


Leu 


Ser 


Ala 


Ala 


Tyr 


Arg 


list 


val 


Leu 


Thr 


His 


Ser 






435 










440 










445 








Tyr 


Lys 


lie 


Arg 


Thr 


He 


Gin 


Lyz 


Glu 


Gin 


Leu 


Asp 


Tyr 


Gin 


Leu 


Ala 




4 5<"> 










455 










460 








Tyr 


Leu 


Pro 


Tyr 


Al a 


Tyr 


Arg 


Asn 


Arg 


Glu 


Gly 


Val 


Glu 


Val 


Arg 


Ser 


465 










470 










475 








480 


Ser 


Leu 


Tyr 


Val 


Ser 


He 


Pro 


Het 


Gin 


Asr. 


Thr 


His 


Arg 


Leu 


Met 


Thr 


Glu 








485 










4 90 








4 95 




Leu 


Arg 


Leu 


Tyr 


Cly 


Asp 


Leu 


flet 


Lys 


Arg 


Lys 


Asp 


Gly 


He 


Ala 








500 










505 










510 






Tyr 


Gly 


Lys 


Thr 


Pro 


Gly 


Val 


He 


Ser 


HJs 


Tie 


Leu 


Ser 


Asp 


Pro 


Gin 






515 










S20 










525 






Ala 


Glu 


Ary 


Thr 


Scr 


Gly 


His 


Thr 


He 


Gly 


Ala 


He 


Cys 


Asn 


He 


Ser 




530 










535 










540 






Tyr 


Leu 


Phe 




























545 
































<2) 


illfOKMATION 


FOR 


SEQ 


ID NO:477 

















(1) SEQUHNCE CHARACTERISTICS: 

(A) LEIIGTH: 750 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



tii) HOLECULE TYPR: protein 

(iii) HYPOTHETICAL; YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyroroonas gingival is 

(ix) FEATURE: 

(A J NAME/KEY: misc_reature 
<B) LOCATION 1...750 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 477 



Met 




Lys 


Leu 


His 


Met 


He 


Ala 


Ala 


Leu 


Ala 


Val 


Leu 


Pro 


Phe 


Cy* 


1 


Thr 






c 










10 










15 




I.eu 


Ala 


Gin 


Ala 


Pro 


Val 


Ser 


Asn 


Scr 


Glu 


Tie 


A3p 


Ser 


Leu 


Ser 








20 










25 








30 






Asm 


Val 


Gin 

35 


Leu 


Gin 


Thr 


Val 


Gin 
40 


Val 


Val 


Ala 


Thr 


Arg 
45 


Ala 


Thr 


Ala 


Lys 


Thr 


Pro 


Val 


Ala 


Tyr 


Thr 


Asn 


Val 


Arg 


Lys 


Ala 


Glu 


Leu 


Ser 


Lys 




50 










S5 










60 








Ser 


Asn 


Tyr Gly Arg 


Asp 


He 


Pro 


Tyr 


Leu 


Leu 


Met 


Leu 


Thr 


Pro 


Ser 


65 










70 










75 








80 


Val 


Val 


Ala 


Thr 


Ser 


Asp 


Ala 


Gly 


Thr 


Gly 


He 


Gly 


Tyr 


Ser 


Gly 


Phe 


Arg 








05 










90 








95 




Val 


Arg 


Gly Thr 


Asp 


Ala 


Asn 


Arg 


He 


Asn 


Ile 


Thr 


Thr 


Asn 


Gly 








100 










105 










110 






Val 


Pro 


Leu 


Asn 


Asp 


Ser 


Glu 


Ser 


Gin 


Ser 


Val 


Phe 


Trp 


Val 


Asn 


Met 


Pro 




115 










120 










125 








Asp 
130 


Phe 


Ala 


Ser 


Ser 


He 
13E 


Glu 


Asp 


Leu 


Gin 


Vol 

140 


Gin 


Arg 


Gly Val 


Cly 


Thr 


Ser 


Thr 


Asn 


Gly Ala 


Gly 


Ala 


Phe 


Gly 


Ala 


Ser 


Val 


Asn 


Met 


145 


Thr 








150 










JS5 








150 


Ary 


Asp 


A."n 


Leu 


Gly 


Leu 


Ala 


Pro 


Tyr 


Gly 


Arg 


Val 


Asp 


Lou 


Ser 


Gly 


Giy 






165 










170 








175 




Ser 


Phe 


Gly 


Thr 


Phe 


Arg 


Arg 


Ser 


Val 


Lys 


Leu 


Gly 


Ser 


Gly 


Arg 


He 




180 










1B5 










190 






Gly 


Arq 


His 


Trp 


Ala 


Val 


Asp 


Ala 


Arg 


Leu 


Ser 


Lys 


He 


Gly 






195 










200 








205 






Ser 


Asd 


Gly 


Tyr 


Val 


Asp 


Ara 


Gly 


Ser 


Val 


Asp 


Leu 


Lys 


S*r 


Tyr 


Phe 


Ala 


21u 










215 










220 






Gin 


Val 


Giy 


Tyr 


Phe 


Gly 


Ser 


Asn 


Thr 


Ala 


Leu 


Arg 


Phe 


He 


Thr 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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225 










230 










235 










240 


Phe 


Gly 


Gly 


L Y s 


Glu 


Val 


Thr 


Gly 


He 


Ala 


Trp 


Asn 


Gly 


Leu 


Ser 


Lys 










24 5 










250 










255 




Glu 


Asp 


Glu 


Ala 


Lys 


Tyr 


Gly 


Arg 


Arg 


Tyr 


Asn 


Ser 


Ala 


Gly 


Leu 


Ilet 








260 










265 










270 






Tyr 


Val 


Asp 


Ala 


Gin 


Gly 


Val 


Pro 


His 


Tyr 


Tyr 


His 


Asn 


Thr 


Asp 


Asn 






275 










280 










285 








Tyi 


Glu 


Gin 


Ary 


His 


Tyr 


His 


Al a 


He 


Met 


Thr 


His 


Ser 


Phe 


Ser 


Pro 




290 










295 










300 










Ser 


Val 


lie 


Leu 


Asn 


Leu 


Thr 


Ala 


His 


Tyr 


Thr 


Al* 


Gl* 


Tyr 


Gly 


Tyr 


305 










310 










315 








320 


Thr 


Asp 


Glu 


Tyr 


Arg 


Thr 


Gly 


Arg 


Lys 


Leu 


Lys 


Glu 


Tyr 


Ala 


Leu 


Gin 










32b 










330 










335 




Pro 


Tyr 


Val 


Glu 


Asn 


Ser 


Val 


Thr 


Val 


Lys 


Lys 


Thr 


Asp 


Leu 


He 


Arg 








340 










345 










350 






Gin 


Lys 


Tyr 


Lsu 


Asp 


Asn 


Asp 


Phe 


Gly 


Gly- 


Leu 


He 


Gly Ser 


Leu 


Asn 






355 










360 










365 








Trp 


His 


Thr 


Gly 


Ala 


Trp 


Asp 


Leu 


Gin 


phs 


Gly Ala 


Ser 


Gly 


Asn 


He 




37 0 










315 










350 










Tyr 


Lys 


eiy 


Asp 


His 


Phe 


Gly 


Arg 


He 


Thr 


Tyr 


He 


Lys 


Lys 


Tyr 


Asn 












390 










3°5 










400 


Gin 


Pro 


Leu 


Ala 


Pro 


Asp 


Phe 


Glu 


Tyr 


Tyr 


Arg 


Asn 


Arg 


Ala 


Asp 


Lys 










405 










410 










415 




Arg 


Glu 


Gly 


Ala 


Ala 


Phe 


Ala 


Lys 


Ala 


Asn 


Trp 


Gin 


He 


Thr 


Pro 


Glu 








420 










425 










430 






Leu 


Asn 


Ket 


Tyr 


Ala 


Asp 


Leu 


Gin 


T^r 


Arg 


Thr 


He 


Gly 


Tyr 


Thr 


lie 






435 










440 










445 








Afn 


Gly 


He 


Thr 


Afc-p 


Glu 


Tyr 


Asp 


Glu 


Val 


Gin 


Gly 


Ser 


H-t 


Gin 


His 




450 










455 










460 










U- 


Asp 


Leu 


Asp 


Lys 


Thr 


Phe 


Arg 


Phe 


Lsu 


Asn 


Pro 


Lys 


Ala 


Gly Leu 


465 










470 










475 










480 


Thr 


Tyr 


Ser 


Phe 


Asp 


Asp 


Ala 


His 


Thr 


Ala 


Tyr 


Ala 


Ser 


Val 


Ala 


Val 










485 










490 










495 




hi a 


His 


Arg 


Glu 


Pro 


Asn 


Arg 


Thr 


Asn 


Tyr 


Thr 


Glu 


Ala 


Gly 


He 


Gly 








500 










505 










53 0 






Gin 


Tyr 


Pro 


Thr 


Pro 


Glu 


Arg 


Leu 


He 


Asp 


Tyr 


Glu 


Leu 


Gly Tyr Axg 






515 










520 










525 








Tyr 


Ala 


s-r 


Pro 


Leu 


Leu 


Ser 


Ala 


Gly 


Val 


Gly 


Leu 


Tyr 


Tyr 


Met 


Gin 




530 










535 










540 










Tyr 


Lys 


Asp 


Gin 


Leu 


Val 


Leu 


Asp 


Gly 


Arg 


Leu 


Sfer 


Asp 


Val 


Gly 


Gin 


545 










550 










555 










560 


list 


Leu 


Thr 


Ser 


Asn 


Val 


Pro 


Asp 


Ser 


Tyr 


Arg 


Het 


Gly 


Leu 


Glu 


Leu 










565 










57 0 










575 




Thr 


Leu 


Gly 


Trp 


Gin 


He 


Lsu 


Pro 


Arg 


Leu 


Leu 


Arg 


Trp 


Asp Ala 


Ser 








580 










585 










590 






Phe 


Thr 


Uet 


Set- 


Arg 


As u 


Lys 


He 


Asp 


Arg 


Tyr 


Val 


Gin 


Tyr 


Thr 


Ser 






595 










600 










605 








Val 


Tyr 


Asp 


Ala 


Asp 


Tyr 


Asn 


Trp 


Leu 


Glu 


Leu 


Lys 


Glu 


Glu 


Thr 


Leu 




63 0 










615 










620 










Glu 


Ser 


Thr 


Asp 


lie 


Ala 


Tyr 


Ser 


Pro 


A-»n 


Val 


He 


Ala 


Gly 


Ser 


llet 


67.5 










630 










635 










640 


Leu 


Thr 


Leu 


Ser 


His 


Ala 


Gly 


Phe 


Glu 


Met 


Ala 


Trp 


Thr 


Ser Arg 


Phe 










645 










650 










655 




Val 


Ser 


Lys 


Gin 


Tyr 


Leu 


Asp 


Asn 


Thr 


Gin 


Acq 


Ser 


Asp Arg Met 


Leu 








660 










665 










67 0 






Sor 


Ser 


Tyr 


Trp 


Val 


Asn 


Asp 


Leu 


Arg 


Leu 


Gly 


Tyr 


Val 


Leu 


Pro 


Val 






675 










680 










685 








His 


Phe 


Val 


Lys 


Arg 


Val 


Ala 


Leu 


Gly 


Val 


Gin 


Leu 


Asn 


Asn 


Leu 


Phe 




690 










695 










700 










Asn 


Leu 


Met 


Tyr 


Ala 


Ser 


Asn 


Ala 


Tyr 


He 


Tyr 


Asp 


Ala 


Gly 


Tyr 


Val 


705 










710 










715 








720 


Gin 


Ala 


Ser 


Gly 


Glu 


Leu 


Ser 


Ala 


Tyr 


Ala 


Asp 


Leu 


Arg 


Tyr 


Tyr 


Pro 










725 










730 










735 




Gl n 


Ala 


Gly 


Phe 


Asn 


Ala 


Leu 


Gly 


Ser 


Leu 


Thr 


He 


Asp 


Phe 












740 










745 










750 






(2) 


INFORMATION 


FOR 


SEQ 


ID NO:47B 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 amino acids 
IB) TYPE: amino acid 
(D) TOPOLOGY: linear 



MOLECULE TYPE: protein 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



427/ 490 



(iiij HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGAMISH: Porphyromonas gingival! s 

( i x ) FEATURF. : 

(A) NAME/KEY: miscf eature 
{ B ) LOCATION 1. . . 494 

(xi> SEQUENCE DESCRIPTION: SEQ ID HO: 478 

llet Lys Arg Arg Phe Leu Sex. Leu Leu Leu Leu Tyr lie Leu Ser S~r 

1 5 10 !& 

Tie Ser l.eu Scr Aln Gin Arg Phe Pro Met Val Gin Gly He Glu L-u 

20 2S 30 

Asp Thr Asp Ser Leu Phe Ser Leu Pro Lye Arg Pro Trp Arg Ala He 

35 40 4i> 

Gly l.ys "nr Tie Gly Val Asn Leu Ala Val Trp Gly Phe Asp Hi- E>h- 

50 55 60 

He llet Asn Glu Asp Phe Ala Asp He Ser Trp Gin Thr He Lys S-r 
65 70 7b go 

Asn Ph« Gin Thr Gly Phe Gly Trp Asp Asn Asp Lys Phe Val Thr Asn 

85 go 05 

Leu Phe Ala His Pro Tyr Hxs Gly Ser Leu Tyr Phe Asn Ala Ala Arq 

3 00 105 HO 

Ser Asn Gly Leu Ser Phe Arg His Ser Ala Pro Phe Ala Phe Phe Gl" 

115 120 125 

Ser Leu Met Trp Glu Leu Leu Met Glu Asn Glu Pro Pro S-r He A*n 

130 135 140 

Asp Leu Cys Ala Thr Thr He Gly Gly He Ala Leu Gly Glu Met Gly 
145 155 i6o 

His Arg Leu Ser Asp Leu Leu He Asp Asn Arg Thr Thr Gly Trp Glu 

16b 170 17c 

Arg Met Gly Arg Glu Val Ala He Ala Leu He Asn Pro Met Arg P»~ 

180 185 190 

Leu Asn Arg Leu Thr Ala Gly Glu Val Thr Ser Val Gly s~r Arg Ser 

195 200 205 

Gly Gin He Phe Gin Ser Val Pro He Asn He Val Val Asp Ala Glv 

210 215 220 

Phe Arg Phe Leu Ala Asp Lys Arg His Ala Arg Thr Gly Ala Thr Ala 
225 230 235 24Q 

Leu Thr Leu Asn Leu Arg Phe Asp Tyr Gly Asp Pro Phe Arg Ser Glu 

245 250 255 

Thr Phe Ser Pro Tyr Asp Phe Phe Gin Phe Lys Ala Gly Leu Ser Phe 

260 265 270 

Ser Glu Ser Gin Pro Leu Leu Ser Gin He Asn Leu He Gly He Leu 

- 7 " 200 2B5 

Ser Gly Cys Gin Leu Leu Ala His Glu Ara Thr Val Leu Val Glv G1-- 
295 300 ^ ' 

L-u Ph- Gin liis Phe Asp Tyr Tyr Asn Ser Glu Lys Arg He Ser Lv« 
305 310 315 

Asn Ser Glu Glu Val Leu Val Thr Pro Tyr Arg He Ser Gin Val Ala 

325 330 335 

Ala Leu Gly Gly Gly Leu He Phe Gin His His Gly Lys Phe Arg Arg 

3*0 345 350 

Arg Pro Leu Glu Leu Tyr Ala Glu Thr Tyr Leu Asn Val Val Pro M-t 

355 360 365 

Gly Ala Ser Leu Ser Asp His Tyr Asn Val Asp Aen Axg Asp Tyr Asn 

370 375 3B0 

Lru Gly Ser Gly Leu Ser Gly Lys Leu Tyr Leu Gly Ala Thr Tyr Asn 
385 390 395 400 

Asp Leu Trp Ser Trp Leu Leu Gly Val Glu Ser Tyr Arg Leu Tyr Thr 

40S 410 435 

Trp He Gly Tyr Glu Glu Pro His Gin Lys Asn Thr Asp Val Ser Ser 

420 425 430 

Phe Met Val Gin Gly Asp Glu Ser Lys Ala Arg L^u Leu Val Thr S-r 

435 440 445 

Scr Glu Fhe Ala Phe His Pro Gly Pro Trp His Val Ala He Val Ala 

450 455 46 Q 

Arg Arg Phe He Arg Lye Thr Ala Tyr Gin Phe Tyr Pro A.-n Val S-r 
4 f 5 470 475 480 

Phe Asp Thr Gly Asp He Gin Leu Arg Val Gly Phe His Ph« 
4 85 490 

(2) INFORMATION FOR SEQ ID NO: 47 9 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



10 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



WO 99/29870 PCT/AU 98/0 1023 

428/490 



fi> SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 2 94 ammo acidr 

(B) TYPE; amino acid 
{ D ) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

|vi) ORIGINAL SOURCE: 

\K\ ORGANISM; Porphyromonas gingival! s 



(ix) FEATURE: 
15 (A) NAME/ KEY: miscf eatucc 

{ D > LOCATION I. . .294 



(>.i) SEQUENCE DESCRIPTION: SEQ ID NO: 47 9 



Met 




Arg 


Leu 


He 


Val 


Phe 


Leu 


A.la 


Met 


Gly Gly Leu 


L^u 


Phe Thr 


1 








5 










10 










15 


Leu 


Ala 


Asn 


Ala 


Gin 


Glu 


Ala 


Asn 


Thr 


Ala 


Ser 


Asp 


Thr 


Pro 


Lys Lys 








20 










25 










30 


Asp 


Trp 


Thr 


lie 


Lys 


Gly 


Val 


Thr 


Gly 


Leu 


Asn 


Ala 


Ser 


Gin 


Thr Sar 






35 










40 










45 






Leu 


Thr 


Arn 


Trp 


Ala 


Ala 


Gly 


Gly 


Glu 


Asn 


Thr 


Val 


Ala 


Gly 


Asn Leu 




50 










55 










60 








Tyr 


Leu 


Asn 


He 


Asp 


Ala 


Asn 


Tyr 


Leu 


Lys 


Asp 


Lys 


Trp 


Ser 


Trp Asp 


65 










70 










75 








80 


Asn 


CI y 


Leu 


Arg 


Thr 


Asp 


Phe 


Gly 


Leu 


Thr 


Tyr 


Thr 


Thr 


Ala 


Asn Lys 










85 










90 










95 


Trp 


Ash 


Lys 


Ser 


Val 


Asp 


Lys 


He 


Glu 


Leu 


Phe 


Thr 


Lys 


Ala 


Gly Tyr 








100 










105 










110 




Glu 


lie 


Gly 


Ly-s 


nts 


Trp 


Tyr 


Gly 


Ser 


Ala 


Leu 


Phe 


Thr 


Phe 


Leu Ser 






115 










120 










125 






Gin 


Tyr 


Ala 


Lys 


Gly 


Tyr 


Glu 


Lys 


Pro 


Ser 


Asp 


His 


Leu 


Thr 


Gly Val 




130 










135 










140 








Lys 


His 


lie 


Ser 


Asn 


Phe 


Phe 


Ala 


Pro 


Ala 


Tyr 


Leu 


Thr 


Leu 


Gly He 


145 










150 










155 








160 


Gly Ala 


Asp 


Tyx 


Lys 


Pro 


Asn 


Glu 


Lys 


Phe 


Ser 


Leu 


Tyr 


Leu 


S~r Pro 










165 










170 










17 5 


Thr 


Thr 


Gly 


Lys 


Leu 


Thr 


Val 


Val 


Ala 


Asp 


Asp 


Tyr 


Leu 


Ser 


Ser Leu 








180 










185 










190 




Gly Ala 


Fhe 


Gly 


Val 


Lys 


Val 


Gly 


Glu 


T.ys 


Thr 


Met 


Phe 


Glu 


Leu Gly 






195 










200 










205 




Ala 


Leu 


Vat 


Val 


Gly 


Ser 


Ala 


Asn 


He 


Asn 


Leu 


Met 


Glu 


Asn 


Val Asn 




210 










215 










220 








Leu 


lie 


Thr 


Lys 


Ala 


Ser 


Phe 


Phe 


Scr 


Ala 


Tyr 


Thr 


Hi/? 


Asp 


Phe Gly 












230 










235 






240 


Asn 


lie 


Asp 


He 


Asn 


Trp 


Glu 


Ala 


Met 


Leu 


Ala 


Met 


Lys 


He 


Asrv Lys 










245 










250 










255 


Phe 


Leu 


Thr 


Ala 


Thr 


He 


Ala 


Thr 


Asn. 


Leu 


He Tyr 


Asp 


Asp 


Asp Val 








260 










265 










270 




Lys 


lis 


Asn 


Asp 


Gly 


Pro 


Lys 


He 


Gin 


Phe 


Lys 


Glu 


Val 


Val 


Gly Val 






275 










280 










285 




Gly Val 


Ala 


Tyr 


Thr 


Phe 






















2 90 





























{2) INFORMATION FOR SEQ ID NO: 4 B0 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 204 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

<iii) HYPOTHETICAL: YES 

70 (vi) ORIGINAL SOURCE: 

1A) ORGANISM: Porphyromonas gingival is 

li:<) FEATURE: 

(A) NAME/ KEY: miso_f eature 
75 (B) LOCATION 1...204 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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tx\) SEQUENCE DESCRIPTION: SEQ ID NO: 480 



Met 


Lys 


Lys 


Met 


He 


Leu 


Ala 


Ala 


Thr 


Met 


T.eu 


1.SSU 


Ala 


Thr 


He 


Gly 


1 








5 










10 










15 




The 


Ala 


Afn 


Ala 
20 


Cln 


Ser 


Arq 


Pro 


Ala 
25 


Leu 


Arg 


L*u 


Asp 


Ala 
30 


Asn 


Phe 


Val 


Gly 


Ser 
35 


Ai ri 


Leu 


Met 


Gin 


Lys 
40 


Val 


Al a 


Asn 


Thr 


Ser 
45 


Val 


Asn 


Asn 


Lyn 


Met 


Tl « 


Val 


Gly 


Leu 


Arg 


Val 


Gly 


Ala 


Ala 


Ala 


Glu 


Phe 


Ala 


Leu 


SO 










55 










60 










Ser 


Asn 


Asp 


Gl/ 


Phe 


Tyr 


Leu 


Ala 


Pro 


Gly 


Leu 


Ala 


Tyr 


Thr 


Met 


Arg 


65 










70 










75 










80 


Gly 


Ala 


Lys 


Met 


Glu 
85 


Ser 


Leu 


Ser 


Glu 


Thr 
90 


Thr 


Thr 


Axg 


Leu 


His 

95 


Tyr 


Leu 


■31 n 


He 


Pro 

100 


Val 


Asn 


Ala 


Gly 


Her. 
105 


Arg 


Phe 


S-r 


Phe 


Ala 
110 


Asp 


Asn 


Hot 


Ala 


Tlo 
115 




L~.u 


Glu 


Ala 


Gly 
120 


Pro 


Tyr 


Phe 


Ala 


Tyr 
125 


Gly 


Val 


Ala 


Gly Thr 


He 


Lys 


The 


Lys 


Vdl 


Ala 


Gly 


Val 


Thr 


Ala 


Ser 


val 


Asp 


Ala 




130 










135 










140 










Phe 


Gly Asp 


Asn 


Gly 


Tyr 


Asn 


Arg 


Phe 


Asp 


Leu 


Gly Leu Gly 


Leu 


Ser 


145 










150 










155 










160 


All 


Ala 


Leu 


Ser 


Tyr 
16b 


Asp 


Arg 


Tyr 


Tyr 


Val 
170 


Gin 


He 


Gly 


Tyr 


Glu 
175 


His 


Gly 


Leu 


Leu 


Am 
1B0 


Met 


Leu 


Lys 


Asp 


Ala 

185 


Pro 


Asp 


Lys 


Thr 


Ssr 
190 


Leu 


Arg 


Asn 


Hie 


Asp 
195 


Phe 


Phe 


Val 


Gly 


Leu 

200 


Gly 


Val 


Arg 


Phe 











(2) INFORMATION FOR SEQ ID NO: 4 61 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 3 amino acids 

( B) TYPE: amino acid 
(Dl TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphytomonas gingivalis 

<ix) FEATURE: 

(A) NAME/KEY: mise_feature 

(B) LOCATION 1...243 



<xi) SEQUENCE DESCRIPTION: SEQ ID MO: 4 81 



Met 


Lys 


Arq 


He 


Phe 


Thr 


Val 


Ala 


Leu 


Val 


Leu 


Leu 


Ala 


Ser 


Val 


Thr 


1 








5 










10 










15 




Met 


Ala 


He 


Gly 


Gin 


Ser 


Arg 


Pro 


Ala 


Leu 


Arg 


Val 


Asp Ala Asn 


Phe 








20 










25 










30 






Val 


Gly 


Ser 


Asn 


Gin 


Ser 


Met 


Lys 


Arg 


Asp 


Gly 


Tyr 


Val 


Trp 


A.*p 


Thr 






35 










40 










45 








Lys 


Met 


Asn 


Val 


Gly 


Leu 


Arg 


Val 


Gly 


Ala 


Ala 


Ala 


Glu 


Phe 


Met 


He 




50 










55 










60 










Gly 


Sec 


Arq 


Gly 


Phe 


Tyr 


Leu 


Ala 


Pro 


Gly 


Leu 


Asn 


Tyr Thr 


M«t 




65 










70 










75 










80 


Gly 


Ser 


Lye 


Thr 


Gl u 


Trp 


Asp 


He 


Pro 


Glu 


Met 


Val 


Pro Gly Thr 


Tyr 










85 










90 










95 


He 


Thr 


Met 


Val 


Ser 


Thr 


Arg 


Leu 


His 


Tyr 


Leu 


Gin 


Leu 


Pro 


He 


Asn 








100 










105 










110 






Ala 


Gly 


Met 


Arg 


Phe 


Asp 


Leu 


Met 


Asn 


Asp 


Met 


Ala 


Val 


Ser 


He 


Glu 






115 










1 20 










125 








Ala 


Gly 


Pro 


Fhe 


Leu 


Ala 


Tyr 


Gly 


He 


Tyr 


Gly 


Thr 


Tyr 


Arq 


Gin 


Lys 




130 










135 










14C 










Leu 


Glu 


Gly 


Trp 


Lys 


Pro 


Asn 


Asn 


Tyr 


Ser 


Thr 


Gig 


Phe 


Phe 


Gly 


Pro 


145 










150 










155 








160 


Thr 


Leu 


Gly 


Gly 


Pro 


Thr 


Asn 


He 


Arg 


Trp 


Asp 


He 


Gly 


Ala 


PiSn 


He 










165 










170 










175 




He 


Ala 


Ala 


Phe 


His 


Tyr 


Lys 


Arg 


Tyr 


Tyr 


He 


Gin 


Ile 


Gly 


Tyr 


Glu 








180 










135 










19C 






His 


Gly 


PL? 


Val 


Afp 


He 


Vftl 


Ser 


Gly 


Gly 


Gly 


Ser 


Asp 


He 


Pro 


Ar-g 
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195 200 20S 

Leu Asn Asp Asn Acq Gin Ser Ser Ser TJir Thr Ala Leu Arg Glu Lye- 

710 215 220 

Gly Asn Acn Glu Tyr Ala Tyr A^n Arg A.^p Phe Phr Val Gly lie Glv 

22! > 230 235 24 l 
Tyr Arg Phe 



U) INFORMATION FOR SEQ ID HO: 4 82 

(1) SRQUE1ICE CHARACTERISTICS: 

(A) LEHGTH: 207 amino acid.^ 

(B) TYPE: amino acid 
(P) TOPOLOGi: linear 

Ui) MOLECULE TYPE: protein 

tiii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival 1s 

(i?:) FEATURE: 

(A) NAME/ KEY: raisc_feature 
<B) LOCATION 1 . . .207 

<xi) SEQUENCE DESCRIPTION: SEQ ID HO : 4 82 

Met Lys Arg Met Leu Leu Leu Leu Val Val L-u Leu Tyr Gly He Ala 

J- b xo X 5 

Gly Arg Leu Ala Ala Gin Asp Val He Arg Pro Trp Ser Leu Gin Val 

20 25 30 

Gly Ala Gly Tyr Ser Acp Thr Glu A.-r. He Pre Gly Gly Phe Thr Tvr 

35 40 45 

Gly Phe Tyr Leu Gly Lys Arg Met Gly Ser Phe Leu Glu Val Gly L°u 

SO 55 oo 

Ser Met Tyr Asn Ser Thr Arg Gin Thr Ala Asn Asn Ala Asp S-r Phe 
65 70 



? 5 30 



Ala Ser Asn Clu Gly Asp Gly Ser Phe Gin Val Asn Met Ser Ser Pro 



85 



90 95 



Asn Glu Lys Trp Ser Phe Phe Asp Ala Gly Ser Ala Asn Cys Tyr w-t 

10O 105 

He Val Val Gly Val Asn Pro Leu Has Leu Phe Trp Gin Asn Ser Arg 

115 120 125 

His Asn Leu Phe Leu Ala Val Gin Ala Gly Leu Ser Asn Lys His Asn 

130 135 i4n 

He Hir Ph- He Tyr Gly Asp Lys Gly Ala Lys Val Ser He Tyr Thi 



1-15 150 



1S5 160 



A;m Ser Asn Thr Tyr He Gly Tyr Gly Ala Ara Val Ala Tyr Glu Tyr 

165 -jvo 175 

Gin He His Lys Asn Val Gly Ala Gly Ala Ala Yal Met Tyr Asp Hi-* 

180 185 loo 

uly Asn Lys Met Leu Thr Ala Met Ala Thr Leu Ser Thr His Phe 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 4 83 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 951 amino acids 
(B> TYPE: amino acio 
(D> TOPOLOGY: linear 

(ill MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

[vi} ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(ixl FEATURE: 

(Af NAME/ KEY: misc featur- 
{B} LOCATION 1 ... 951 

(>:i) SEQUENCE DESCRIPTION: SEQ 10 110:4 83 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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431/490 

llet Arg Val Sei Asp Leu Cys Scr Arg Lou Ser Trp Leu Leu Pro Val 

1 5 10 15 

He Lou Val Gly Leu Leu Cys Ala Thr Leu Val Ala Ala Glu Arq Pro 

20 25 30 

llet Ala Gly Ala Val Gly Leu His His Arg Arg His Ala Ala Leu s*r 

3S 40 45 

Asp Ser Thr Ala Lys Asp Thr Val Pro Leu Ala Lys Pro II- Pro Asp 

50 55 so 

Ser Ala Phe Arg Asp Ser Leu Pro Ala Asp Ser Tht Gly Ser Met A-g 
6* 70 75 80 

Gin Asp Ser Val Tyr Asp Asp Glu Phe Glu Leu Clu Asp lie Val Glu 

85 go 95 

Tyr Glu Ala Ala Asp Ser lie Val Leu Leu Gl> Gin Aei; Arg Ala Tyr 

lOO 105 no 

Leu Phe Gly Lys Ser Tyr Val Ser Tyr Gin Lys Ser Arq Leu Glu Ala 

115 120 125 

Asn Phe Met Tyr Leu Asn Thr Asp Ser Scr Thr Val Tyr Thr Ara T^-r 

130 135 HQ " 

Val Leu Asp Thr Ala Gly Tyr Pro Met Ala Phe Pro Val Phe Lys Asp 
"5 150 155 i6 S 

Gly Glu Gin Ser Phe Glu Ala Lys A-n Phe Thr Tyr Asn Phe Arg Thr 

165 170 175 

Glu Lys Gly He He Ser Gly Val He Thr Gin Gin Gly Glu Gly Tyr 

180 105 19Q 

Leu Thr Ala Gly Lys Thr Lys Lys Met Pro Asp Asn lie Met Phe Met 

195 200 205 

Gin Gly Gly Arg Tyr Thr Thr Cys Asp Asn His Asp His Pro His Ph- 

210 215 220 

Tyr He Asn Leu Ser Lys Ala Lys Val His Pro Glu Lys Asp He Val 
225 230 235 74 0 

Thr Gly Pro Val Asn Leu Val He Ala Asp Met Pro Leu Pro He Gly 

245 25o 2 5S 

Leu Pro Phe Gly Tyr Phe Pro Phe Ser Asn Lys Tyr Ser Ser Gly He 

260 265 270 

Leu Met Pro Thr Tyr Gly Glu Asp Asn Arg Tyr Gly Phe Tyr Leu Arq 

275 280 285 

Asn Gly Gly Tyr Tyr Phe Ala Phe Ser Asp Tyr He Asp Leu Ala L-u 

290 295 300 

Arg Gly Glu He Phe Ser Lys Gly Ser Trp Gly He Ser Ala Gin S=r 
305 310 315 

Lys Tyr Lys Lys Arg Tyr Lys Tyr Asn Gly Ser Phe Glu Ala Asn Tyr 

325 330 335 

Leu Val Ser Lys Ser Gly Asp Lys Tyr Val Pro Gly Asp Tyr Ser L-/S 

340 345 350 

Thr Thr Ser Leu Asn He Arq Trp Thr His Ser Gin Asp Pro Lys Ala 

355 360 365 

Asn Fro Leu Gin Thr Leu Ser Ala Asn Val Asn Phe Ala Thr Gly Ser 

370 375 3eo 

Tyr Phe Gin Asn Ser Lsu Asn Thr Thr Tyr Asp Val Asn Ala Arq Thr 
385 390 395 400 

Ala Thr Thr Arg Ser Ser Ala Val Ser Tyr Ser Arg Lys Phe Pro Gly 

405 410 415 

Thr Pro Phe Ser He Thr Gly Ser Met Asp He Ser Gin Asn Met Arg 

420 425 4 3o 

Asp Thr Thr Val Ser Leu Thr Leu Pro Asn Leu Ser lie Asn H«*t Ser 

435 440 445 

Thr Arg Tyr Pro Phe Lys Arg Lys Thr Arg Val Gly Pro Glu Arg Trp 

•ISO 455 460 

Tyr Glu Lys L*u Ser Val Gly Tyr Ser Gly Gin Leu Arg Asn Ser He 
465 4/0 475 48Q 

Leu Thr Lys Glu Lys Asp Leu Leu Gin Ser Asn Leu Val Arg Asp Trp 

485 490 495 

Lys Asn Gly Met Arg His S-r Val Pro He Ser Leu Thr Val Pro Leu 

500 505 510 

Leu Asp Tyr He Asn Leu Thr Met Gly Val Asn Tyi Asn Glu Trp Trp 

515 520 • 525 

Tyr Thr Lys Gly He Arg Lys Ser Trp Asn Glu Asp Lys Lys Thr Phe 

530 535 540 

Leu Pro Ser Asp Thr Thr Tyr Lys Phe Arg Arg Leu Tyr Asp Tyr S~r 
545 550 555 56Q 

Leu Ser Ala Gly Leu Ser Thr Thr Leu Tyr Gly Het Phe Lys Pro Trp 

565 S70 575 

Lys Pro Phe Ser Phe Gly Gly Asn Leu He Met lie Arg His Arg Ph- 

580 585 590 

Thr Pro Thr Val Ser Phe Ser Tyr Met Pro Asp Phe Thr Lys Arg Arg 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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65 



432/490 

, 5?5 600 605 

Tyr Gly Ph. Trp Glu Leu Leu Glu Hi, T hr A,p Gin Asn Gly Ly» Leu 

H« Thr Leu Leu Tyr Ser Pro Tyr Phe Glu Gin ??« p he Gly Ala Pro 

SeC Het Gli ' A£n S ^ Val *™ ^ Phe Asp A «n A-n Leu 

6g5 6SO 



640 
Leu 

Glu Ala Lys lie Lyi Ser Lys Ser Asp S^r Thr Gly He Ly S Lys H e 

10 Ser L eu II, A, P Gin Phe Thr Trp |« Thr Ser Tyr A,n Met Phe Ala 
J 680 €85 

A.-p Ser n. Arg Trp Ser £„ lle Sei Mj s „ flla ^ ^ 

is 70s Lys Ser P,,e Thr }™ a;9 ,-eu ser Giv i~ ^' P " T ^ 

Thr Ly. Tyr Tyr Glu Gly Glu A S p Gly Lys i" lis Pro Tyr Lya ™ 
Am Asp Leu Arg He Phe A*n Gly ty» £?. Leu Ma Arg Leu ^ 
20 Thr Gly Thr Ser Phe Ser Tyr Thr Zen A,n Ly, clu Ser Leu Ser Gly 
Leu II. Ala L,u Phe Ser Gly LyB GIu Acg ^ ™= fllu lye Ly£ 

25 70S H ~ G,U ^ *«P «• Asp II, !.e u 

G '" W " S1 " At3 a P f° Gi " *•» 8Iu Ser Oli «f Ser Leu Leu Glu A^ 
Asn Arg Gin Gly Gly Aia Vdl ^ Gly Tyr phe Ala ^ 

30 He Pro Trp S« Leu Ser Phe A*p Ty r Ser Trp A,n Xle "! Thr Asp 

Tyr £n Arg Tyr Asn Val ton Ly S Met Glu Hi. Tyr Tyr Arg Val Thr 

Gly 
Tyr 
Arg 

40 Ser Phe lie Pro He Gly Ala Tyr s=r ^ p ^ 

920 q^c 

Val Ly« Scr Ser Leu Leu Gin Asp Leu Ly, Tyr Gin Gin Se r A*n Ary 

45 in IX * Thr Asn Thr 2s 



35 865 ^ ^ r g^A ^ Ils «„ P« Pxo Asn Trp S er 



Phe Gly Phe Asn Ala A*n Tyr A*n Phe Acp Leu Ly S Lys Ile Thx Ser 



Leu Thr Cy S A,n vll Thr Arg Asp Met tils Cy, Trp Ala He llr Ala 



(2) INFORMATION FOR SEQ ID NO: 484 

c n (1 > SEQUENCE CHARACTERISTICS: 

°° «A) LENGTH: 1226 amino acid^ 

(B) TYPE: amino acid 

ID) TOPOLOGY: linear 

55 MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
60 <A) ORGWIIStl: t^orphyroznonas gingival!* 

U:0 FEATURE: 

(A) NAI IE/KEY : mite- featur- 

(B) LOCATION 1. . .122 6 



(Xi) SEQUENCE DESCRIPTION: SEQ ID HO:484 

„ et „ et T .y.~ Arg Tyr Thr He He Leu Ala Val Phe Leu Leu Pne ^ 
5 10 



7Q Thr Val Phe Thr Phe Gin He Lys Ala Ara Pre Tyr^lu Arg Phe Ala 
Asp Val Glu Lys Pro Trp I ie G j n ^ IUs Swc t|et ^ ^ r Lyc Lau 
Val Pro Ala Asn Lys Gly Leu He Gin Ala Glu He Val Tyr Gin 

75 ser Val Ser Glu His Ser Asp Leu Val He Ser Pro Val A-n Glu 11. 
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20 



25 



65 70 75 e0 

Arg Pro Ala Asn Arg Phe Pro Ser His Arg Lys Ser Phe Phe Ala Glu 

85 90 «5 

Asn Leu Arg Ala Ser Pro Pro Val Val Pro Val Aid Val Asp Ly- Tyr 
5 100 105 no 

Ala Val Pro Val Ala Asn Pro Mot Asp Pro Glu Asn Pro Asn Ala Trp 

120 125 
Asp Val Thr Leu Lys He Thr Thr Lys Ala Val Thr Val Pro Val Asd 
- n 130 i 3S uo 

1U Val Val Met Val He Asp Gin Ser Ser Ser Met Gly Gly Glu Asn He 
14S ISO 155 160 

Ala Arg Leu Lys Ser Ala He Ala Sec Gly Gin Ary Phe Val Lys Lys 

165 170 17 5 

Het Leu Pro Lys Gly Thr Ala Thr Glu Gly Val Arg He Ala L~u Val 

180 185 190 

Ser Tyr Asp His Glu Pro His Arg Leu Ser Asp Phe Thr Lys A<*p Thr 

195 200 205 

Ala Phe Leu Cy.« Gin Lys He Arq Ala Leu Thr Pro He Trp Gly Thr 

210 215 220 

His Tnr Gin Gly Gly Leu Lya Met Ala Arg Asn He Met Ala Thr Ser 
225 230 235 240 

Thr Ala Val Asp Lys HI.- He He Leu Met Ser Asp Gly Leu Ala Thr 

245 250 255 

Glu Gin Tyr Fro Val Lys Asn Val Thr Thr Ala Asp Phe He Gly Lys 

260 205 270 

Thr Gly Asn Ala Asn Asp Pro He Asp Leu Val He Gin Gly Ala 11 = 

275 280 205 

Asn Phe Pro Thr Asn Tyr Val Ser Asn Asn Pro Scr Thr Pro L-u Thr 
Qn 290 295 300 

Pro Asn Tyr Pro Thr His Ser Ser Lys Val Gly Arg Arg Asn Leu Pro 
3 °5 310 315 320 

Glu Ser Lys Phe Asp Tyr Ser Asn Leu Ser Ala Arg He Thr Phe Asp 

325 330 
Gly Val Ala Gly Ala Leu Val Tyr Glu Pro Arg Phe Pro His Pro Tyr 
35 340 34S 35o 

Tyr Tyr Tyr Phe Pro Cys Asn Ala Ala He Asn Glu Ala Gin Ph~ Ala 

355 360 365 

Lys Asn Ser Gly Tyr Thr He His Thr He Gly Tyr Asp Leu Gly Asp 
An 370 375 380 

W Phe Ala Leu Ala Asn Asn Ser Leu Lys Leu Thr Ala Thr Asp Glu Asn 
385 390 395 400 

His Phe Phe Thr Ala Thr Pro Ala Asn Leu Ala Ala Ala Phe Asp Asn 

405 410 415 

He Ala Gin Thr He Asn He Gly He Gin Arg Gly G) u Val Thr Asp 

4 20 4 25 430 

Phe Val Ala Pro Gly Phe He Val Lys Asn Leu Thr Gin Ser Gly Asp 

435 440 445 

Val Thr His Leu Leu Asn Val Ser Asn Gly Thr Val His Tyr Asp Val 

450 455 460 

Ser Thr Lys Lys Leu Thr Trp Thr Thr Gly Thr He Leu Ser S~r S-r 
465 470 4/5 480 

Glu Ala Thr He Thr Tyr Arg He Tyr Ala Asp Leu Asp Tyr He Gin 

4 85 490 4 

Asn Asn Asp He Pro Val Asn Thr Thr Ser Ala He Gly Pro Asp Leu 

500 505 510 

Gly Gly Phe Aep Thr Asn Thr Glu Ala Lys Leu Thr Tyr Thr Asn Ser 

515 520 525 

Afin Gl V Giu p ro Asn Gin Gin Leu II « Phe Pro Arg Pro Thr Val Lvs 

530 535 540 

Leu Gly Tyr Gly Val He Lys Arg His Tyr Val Leu Val Asn Lys Asp 
!L 4S „, 550 555 560 

Gly oln Pro He Gin Ala Asn Gly Thr Val Val Ser Ser Leu Ser Glu 

565 570 575 

Ala His Val Leu Gin Ser G)n Asp Phe Phe Leu Pro Ser Gly Gly Gl" 

±>90 5B5 590 

His Tie Val Pro Lys Trp lie Lys Leu Asp Lys Thr Thr Glu Ala Leu 

595 600 605 

Gin Tyr Tyr Ser Val Pro Pro Thr Asn Thr Val He Thr Thr Ala A*n 

610 615 620 

Gly Lys Arg Tyr Arg Phe Val Glu Val Pro Gly Ser Thr Pro A.-n Pro 
6f 5 630 635 64() 

Gly Gin Tie Gly He Ser Trp Lys Lys Pro Ala Gly Asn Ala Tyr Ph- 

64S 650 655 

Ala Tyr Lys Leu Leu Arn Tyr Trp Met Gly Gly Thr Thr Asp Gin Gin 
fi 60 665 670 



45 



50 



55 



60 



65 



70 



75 
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Ser Glu Trp Asp Val Thr Sec Asn Trp Thr Gly Ala Gin Val Pro Leu 

675 680 685 

Thr Gly Glu Asp Val Glu Phe Ala Thr Thr Glu Asn Phe Gly Ser Pio 
6?0 695 700 

5 Ala Val Ala Asp Leu His Val Pro Thr Thr Asn Pro Lys He He Gly 
705 710 715 720 

Asn Leu He Asn Asn Ser Asp Lys Asp Leu Val Val Thr Thr Ser Ser 

725 730 73S 

Gin Leu Thr He Asn Gly Val Val Glu Asp Asa Am Pro Asn Val Gly 
10 740 745 750 

Thr He Val Val Lys Ser Ser Lys Asp Asn Pro Thr Gly Thr Leu Leu 

755 760 765 

Phe Ala Asn Pro Gly Tyr Asn Gin Asn Val Gly Gly Thr Vsl Glu Phe 
77 0 775 7 90 

15 Tyr Asn Gin Gly Tyr Asp Cys Ala Asp Cys Gly Met Tyr Arg Arg Ser 
785 790 795 800 

Trp Gin Tyr Phe Gly He Pro Val Asn Glu Ser Gly Phe Pro lie Asn 

805 810 815 

Asp Val Gly Gly Asn Glu Thr Val Asn Gin Trp Val Glu Pro Phe Asn 
20 920 825 830 

Gly Asp Lye Trp Arg Pro Ala Pro Tyr Ala Pro Asp Thr Glu Leu Gin 

835 840 8-15 

Lys Phe Lys Gly Tyr Gin He Thr Asn Asp Val Gin Ala Gin Fio Thr 
850 855 8*0 

25 Gly Val Tyr Ser Phe Lys Gly Met He Cys Val Asp Al a Phe Leu 

865 970 fi/5 880 

Asn Leu Thr Arg The Ser Gly Val Asn Tyr Ser Gly Ala Asn Leu He 

805 89Q 895 

Gly Asn Ser Tyr Thr Gly Ala He Asp He Lys Gin Gly He Val Phe 
30 900 905 910 

Pro Pro Glu Val Glu Gin Thr Val Tyr Leu phe Asn Thr Gly Thr Ary 

915 920 925 

Asp Gin Trp Arg Lys Leu Asn Gly Ser Thr Val Ser Gly Tyr Arg Ala 
930 935 94 0 

35 Gly Gin Tyr Leu Ser Val Pro Lys Asn Thr Ala Gly Gin Asp Asn Leu 
945 950 955 960 

Pro Asp ArQ He Pro Ser Met. His Ser Phe Lou Val Lys Met Gin Asn 

965 970 97 5 

Gly Ala Ser Cys Thr Lsu Unfc He Leu Tyr Asp Lys Leu Leu Lys Asn 
40 900 985 990 

Thr Thr Val Asn Asn Gly Asn Gly Thr Gin He Thr Trp Arg Ser Gly 

995 1O0O 1005 

Asn Ser Gly Sex Aid Asn llet Pro Ser Leu Val Met Asp Val Leu Gly 
1010 1015 1020 

45 Asn Glu Ser Ala Asp Arg Leu Trp He Phe Thr Asp Gly Gly Leu Ser 

1025 1030 1035 1040 

Phe Gly Phts Asp Asn Gly Trp Asp Gly Arg Lys Leu Thr Glu Lys Gly 

1045 1050 1055 

Lou ser Gin Leu Tyr Ala llet Ser Asp He Gly Asn Asp Lys Phe Gin 
50 1060 1065 1070 

Val Ala Gli Val Pro Glu Leu Asn Asn Leu Leu He Gly Phe Asp Ala 

1075 1080 1085 

Asp Lys Asp Gly Gin Tyr Thr Leu Glu Phe Ala Leu Ser Asp His Phe 
1090 1095 1100 

55 Ala Lys Gly Ala Val Tyr Leu His Asp Leu Gin Ser Gly Ala Ly? Ills 

llOS 1110 1115 1120 

Arg He Thr Asn Ser Thr Ser Tyr Ser Phe Asp Ala Lys Arg Gly Asp 

1125 1130 1135 

Ser Gly Ala Arg Phr* Arg Leu Ser Tyr Gly Cys Asp Glu Asn Val Asp 
60 1140 1145 1150 

Acp S-r His Val Val Ser Thr Asn Gly Arg Glu He He He Leu Asn 

U5S 1160 1165 

Gin Asp Ala Leu Asp Cys Thr Val Thr Leu Phe Thr He Glu Gly Lys 
1170 1175 1180 

65 Leu Lou Arg Arg Leu Lys Val Leu Ala Gly His Arg Glu Val ttet Lys 

1185 1190 1195 1200 

Val Gin Thr Gly Gly Ala Tyr He Val His Leu Gin Arn Ala Phe Thr 

1205 1210 1215 

Asn Asp Val His Lys Val Leu Val Glu Tyr 
70 1220 1225 

(2) LtlFORllATIOM FOR SF.Q ID HO: 4 85 

(i) SEQUENCE CHARACTERISTICS : 
75 (A) LENGTH: 1225 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Uri) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Po rphyromonas ginqivalis 

[ix) FEATURE: 

(A) IJAHE/KEY: misc_f eature 

(B) LOCATION 1...1225 

(xi) SEQUENCE DESCRI PTIOII : SEQ ID NO: 485 

Met Lys Arq Tyr Thr He II* Leu Ala Val Phe Leu Leu Phe Cys Thr 

15 10 15 

Val Phe Thr Phe Gin He Lys Ala Arg Pro Tyr Glu Axg Phe Ala Asp 

20 25 30 

Val Glu Lys Pro Trp lie Gin Lys His Ser Met Asp Ser Lys Leu Val 

35 40 45 

Pro Ala ten Lys Gly Asn Lc-u He Gin Ala Glu Tie Val Tyr Gin Ser 

5V 55 60 

Vnl Glu His Ser A-p Leu Val He Ser Pro Vai Asn Glu He Aiq 

65 70 75 80 

Pro Ala Asn Arg Phe Pro Ser His Arg Lys Scr Phe Phe Ala Glu Asn 

05 9Q 95 

Leu Arg Ala Ser Pro Pro Val Val Pro Val Aia Vai Asp Lys Tyr Ala 

100 105 no 

Val Pro Val Ala Asn Pro Met Asp Pro Glu Asn Pro Asn Ala Trp Asn 

115 120 125 

Val Thr Leu Lys He Thr Thr Lys Ala Val Thr Val Pro Val Asp Val 

130 135 ijo 

Val Met Val He Aep Gin Scr Ser Ser Met Gly Gly Gin Asn He Ala 
14S 150 155 150 

Axq Leu Lys Ser Ala He Ala Ser Gly Gin Arg Phe Val Lys Lys Met 

165 170 i"/b 

Leu Pro Lys Gly Thr Ala Thr Glu Gly Val Arg He Ala Leu Val S-r 

180 195 190 

Tyr Asp His Glu Pro His Arg Leu Ser Asp Phe Thr Lys Asp Thr Ala 

195 200 205 

Phe Leu Cys Gin Lys He Arg Ala Leu Thr Pro He Trp Gly Thr Hi- 

210 215 220 

Thr Gin Gly Gly Leu Lys Met Ala Arg Asn He Met Ala Thr Ser Thr 
225 230 235 2^0 

Ala Val Asp Lys His Tie He Leu Met Ser Asp Gly Leu Ala Thr Glu 

24 5 250 255 

Gin Tyr Pro Val Lys Asn Val Thr Thr Ala A^p Phe He Gly Lyc Thr 

260 26b 270 

Gly Asn Ala A£-n A.*^p Pro He Asp Leu Val He Gin Gly Ala He Asn 

2*75 280 2B5 

Phe Pro Thr Asn Tyr Val Ser Asn Asn Pro Ser Thr Pro L~u Thr Pro 

255 300 
Asn Tyr Pro Thr His Ser Ser Lys Val Gly Arg Arg Asn Leu Pro Glu 
305 310 315 320 

Ser Lys Phe Asp Tyr Ser Asn Leu Ser Ala Arg Tic Thr Phe Asp Glv 

325 330 335 

Val Ala Gly Ala Leu Val Tyr Glu Pro Arg Phe Pro His Pro Tyr Tyr 

340 345 350 

Tyr Tyr Phe Pro Cys Asn Ala Ala He Acn Glu Ala Gin Phe Ala Ly- 

35S 360 365 

Asn S-r Gly Tyr Thr He His Thr He Gly Tyr Asp Leu Gly Asp Phe 

370 375 380 

Ala Leu Ala Asn Asn Ser Leu Lys Leu Thr Ala Thr Asp Glu Asn His 
385 390 395 400 

Phe Phe Thr Ala Thr Pro Ala Asn Leu Ala Ala Ala Phe Asp Asn 11= 

405 410 4i 5 

Ala Gin Thr He Asn 11 = Gly He Gin Arg Gly Glu Val Thr Asp phe 

420 425 430 

Val Ala Pro Gly Phe He Val Lys Asn Leu Thr Gin Ser Gly Asp Val 

435 440 445 

Thr His Leu Leu Asn Val Ser Asn Gly Thr Val His Tyr Asp Val S-r 

4S0 455 .460 

Thr Lys Lys Leu Thr Trp Thr Thr Gly Thr He Leu Ser Ser Ser Giu 
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4 65 

Ala Thr lie Thr T yr Arg He Tyr Ala Arp ^ ^ p Ty[ ^ 



A3,, top He Pro Val Asn Thr Thr Scr Jl" Xln Gly Pro Asp Leu Gly 

Asp Thr Asn Thr Glu AJLa 

515 520 S2S 



Gly Phe Asp Thr Asn Thr Glu Ala Lys Leu Thr Tyr Thr As,, Ser Asn 

520 

Gly Glu Pro ten Gin Gin Leu 11= Ph. Pco ^ Prc Tnr Val Lys L=>j 

10 Gly Tyr Gly Val He Lys Arg Hi. Tyr Val Leu "? Asn Lys Asp Gly 

Gin Pro lie Gin Ala Asn Gly Thr Val Val Ser Ser Leu Ser Glu S 

570 

lg Ha. Val Leu Gin S . r Gin Asp Phe Phe Leu Pro Ser Gl» Gly Gly His 

He Val Pro Lye Trp He ,.y. Leu Asp Lys Thr Thr Glu Wa Leu Gin 

Tyr Tyr S er Val Pro Pro Thr A^n Thr Val He Thr Thr Ala Asp Gly 

Ly S Arg Tyr Arg Phe Val Glu Val Fro Gly Ser Thr Pro Asn Pro Gly 
630 635 <z trt 

Gin 11, Gly He Ser Trp Lys Lys Pro Ala Gly Asn Ala Tyr Phe 111 

25 ^ L '' S I-eU gin A -*" Ti ' r TrP *g Thr Thr A *P Gin Gin Ser 

Glu Trp Asp Val Thr Ser A... Trp Thr Gly Ala Gin Val Pro L SU Thr 

680 6 B 

Gly Glu Asp Val Glu Phe A^a Thr Thr Glu Asn Phe Gl-/ S er Pro Ala 

30 Val Ala Asp Leu „„ Val Pro Thr Thr Asn Pro L^-s He He G,y Asn 
Leu He Asn Asn Sor Asp Lys A* P Leu Val Val Thr Thr Ser Ser Gin 
25 "0 735 



35 Leu Thr " e Tao ° ly Val Val Glu A ff Asn A£ " p " Asn Yfi Thr 

755 760 7S5 



He Val Val Lys Ser Ser Lys Asp Asn Pro Thr Gly Thr Leu Leu Ph- 

760 765 
Ala Asn Pro Gly Tyr Asn Gin Asn Val Gly Gly Thr Val Glu Phe Tyr 

40 Asn Gin Gly Tyr Asp Cys Ala Asp Cys Gly Met T yr Arg Arg Ser Trp 



Gin Tyr Phe Gly He Pro Val Asn Glu Ser Gly Phe Pro He Asn Asp 

810 

45 G1V *|S G1U Val AE " G1 " T ^P V ^ <=lu Phe Asn Gly 

825 pin 

Asp Lys Trp Arg Pro Ala Pro Tyr Ala Pro Asp Thr Glu Leu Gin L- 
„ , UJD 840 oje * 

Ph- Lys Gly Tyr Gin He Thr Asn Asp Val Gin Ala sin" P ro Thr Gly 

50 val Tyr Ser Phe Lys Gly llat „. Cys Val cy ., ™ ^ ^ ^ ^ 
Leu Thr Arg Thr Ser Gly Val Asn Tyr Ser 5" Ala Asn Leu He "y 

55 ^ 31V Ald "« As P "J ™ Oln GIy He val Phe Pro 

Pro Glu Vol Glu Gin Thr Val Tyr Leu Phe Asn Thr Gly %° c Ar g Asp 

Gin Trp Arg Lys Leu Asn Gly Ser Thr Val ser Gly Tyr Arg Ala Gly 

AO 93 5 94 0 

bU Gin Tyr Leu Ser Val Pro Lys Asn Thr Ala Gly Gin Asp Asn Leu Pro 
Asp Arg He Pro Ser Met His Ser Phe Leu Val Lys Met Gin Asn Gly 

» 7 ° 97S 



65 ^ ™£ L9U Unk Ile Leu ^r Asp Lys Leu Leu Lys £n Thr 

Thr Val Asn Asn Gly Asn Gly Thr Gin lie Thr Trp Arg s'er Gly Asn 

~ " 1000 100s 

ser «ys.r Ala Asn Met Proper Leu Val Mot Asp Val Le U Gly Asn 

70 Glu^er Ala Asp Arg Le^Trp He Phe Thr Asp Gly ? Gly Leu Ser Phe 

Gly Phe Asp As 11 oiyrrp Asp Gly Arg Lys Le^Thr Glu Lys Gly 
75 Ser G!n ,.-„ Tyr Ala „e t Ser Asp H Gry^sn Asp Lys Phe Gl^Va! 

ioc^ iovo 
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Ala Gly Val Pro Glu Leu Asn Asn Leu Leu II- Gly Phe Asp Ala Asp 

1075 1080 1085 

Lys Asp Gly Gin Tyr Thr Leu Glu Phe Ala Leu Ser Asp His Phe Ala 
10<?0 109S U00 

5 Lys Giy Ala Val Tyr Leu His Asp Leu Gin Ser Gly Ala Lvs His Ara 

UO^ 1110 1115 lizo 

11* Thr Asn 3-r Thr Ser Tyr Ser Phe Asp Ala Ly* Arg Gly Asp Ser 

1125 iijo ii 3L 

Gly Ala Arg Phe Arg Leu Ser Tyr Gly Cys Asp Glu Asn Val Asp Asp 
10 11^0 1145 1150 

Ser His Val Val Ser Thr Am Gly Arg Glu He* He He Leu Asn Gin 

1155 1160 n 6i> 

Asp Ala Leu Asp Cys Thr Val Thr Leu Phe Thr He Glu Gly Lys L-u 

1170 1175 H80 

Leu Arg Arg Leu Lys Val Leu Ala Gly His Arg Glu Val Met Lys Val 
1185 H90 U95 1200 

Gin Thr Gly Gly Ala Tyr He Val His Leu Gin Asn Ala Phe Thr Asn 

120b 1210 1215 

Arp Val His Lye Val Leu Val Glu Tyr 

1220 1225 

(2) INFORMATION FOR SEQ ID MO: 486 



15 



20 



(1) SEQUENCE CHARACTERISTICS : 
25 (A) LEIIGTM: 425 amino adds 

(B) TYPE: amino acid 
( D) TOPOLOGY : lineax 



30 



35 



50 



60 



65 



70 



75 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

tvi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonn.*? gingivalls 

(lx) FEATURE: 

(A) WAME/KEY: miscjeature 

(B) LOCATIOH 1...425 



40 |xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486 

I let Glu Val Lys Lyr Asn Thr Val Val Leu Arg Leu Leu He Trp Phe 
15 io 15 

Val Ala He Leu Leu Phe His Ser Ser Arg Leu Trp Gly Gin Glu Gly 
^3 20 25 30 

Glu Gly Ser Ala Axg Tyr Arg Phe Lys Gly Phe Val Asp Thr Tyr His 

J5 40 45 

Ala Val Arg Ser Ser Ser Pro Phe Asp Phe M-L Ser Sor Arg Thr Arg 

50 55 60 

Val Arg Gly Glu Leu Glu Arg S-r Phe Gly Asn Ser Lys Val Ala Val 
65 "'0 75 8" 

Ser Val Asn Ala Thr Tyr Asn Ala Leu Leu Lys Asp Glu Thr Gly Leu 



85 



90 os 



Arg Leu Arg Glu Ala Phe Phe Glu His Gin Glu Glu His Trp Glv L-u 

10O 105 nS 

Arg Leu Gly Arg Gin lie Val He Trp Gly Ala Ala Asp Gly Val Arq 

115 120 125 

He Thr Acp Leu Tie Ser Pro Met Asp Met Thr Glu Phe Leu Ala Gin 

130 135 140 

Asp Tyr Asp Asp He Arg Het Pro Val Asn Ala Leu Arg Phe Ser Val 
It 150 155 16Q 

Phe Asn Glu Ser I let Ly^ Val Glu Val Val Val Leu Pro Val Phe Giu 

165 170 175 

Gly Tyr Arg Leu Pro Val Asp Pro Arg Asn Pro Trp Asn Tie Phe <^r 

180 185 190 

Lsu Ser Pro He Ala Gin Gly Met Asn He Val Trp Lys Glu Glu Ala 

195 200 205 

Gly Lys Pro Ala Phe Lys Val Ala Asn He Glu Tyr Gly Ala Ara Trt> 

210 215 270 

Ser Thr Thr Leu Ser Gly He Asp Phe Ala Leu Ala Ala Lou His Thr 



225 230 



235 2 10 



Trp Asn Lys Het Pro Vol He Glu Val Gin Gly He Val Pro Thr Glu 

2*15 250 255 

He He Val Ser Pro Arg T>r Tyr Arg Met Cly Ph* Val Gly Gly Asp 
260 265 p7Q 
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Leu 


Ser 


Val 


Pro 


Val 


Gly Gin 


Phe 


Val 


Phe 


Arq 


Gly 


Glu Ala 


Ala 


Phe 






275 








280 










285 






Asn 


lie 


Asp 


Lys 


His 


Phe Thr 


Tyr 


Lys 


Ser 


His 


Ala 


Glu Gin 


Glu 


Gly 




290 








?95 










300 








Phe 


Gin 


Thr 


He 


Asn 


Trp Leu 


Ala 


Gly Ala 


Asp 


Trp Tyr Ala 


Pro 


Gly 


305 










310 








315 








320 


Glu 


Trp 


M^t 


He 


Ser 


Gly Gin 


Phe 


Ser 


Met 


Glu 


Ser 


He Phe 


Arq 


T*yr 










325 








330 








335 




Arg 


Asp 


Phe 


He 


Ser 


Gin Arg 


Gin 


His 


Ser 


Thr 


Leu 


lit; Thr 


Leu 


Asn 








340 








345 








350 






Val 


Ser 


Lys 


Lys 


Phe 


Phe Gly 


Ser 


Thr 


Leu 


Gin 


Leu 


Ser Asp 


Phe 


Thr 






355 








360 










365 






Tyr 


Tyr 


A*p 


Leu 


Thr 


Gly Lys 


Gly Trp 


Phe 


Ser 


Arg 


Phe Ala 


Ala 


Asp 




37 0 








3/5 










380 








Tyr 


Al a 


IiPii 


A.-n 


Aj?p 


Gin He 


His 


Leu 


Met 


Ala 


Gly 


Tyr Asp 


Trp 


Phe 


3es 










390 








395 








400 


Ser 


Ser 


Lys 


Gly 


Ser 


Gly He 


Phe 


Asp 


Arg 


Tyr 


Lys 


Asp Asn 


Ser 


Glu 










4 05 








410 








41b 




Leu 


Trp 


Phe 


Lys 


Ala 


Arg Tyr 


Ser 


Phe 















420 4 25 



(2) IHFORIIATIOH FOR SEQ ID NO: 4 07 

U) SEQUEJICE CHARACTERISTICS: 

[A) LEJJGTH: 4 04 amino acid* 
{ B) TYPE: ammo acid 
(D) TOPOLOGY : linear 

(xi) tlOLECULE TYPE: protein 

till) HYPOTHETICAL: YES 

(vij ORIGINAL SOURCE: 

(A> ORGANISM: Porphyromonae glngivalis 

( 1 X J FEATURE : 

(A) MAMS/ KEY: misr.J eature 
(B| LOCATION 1. . .404 





txi) 


SF.QUF.HCE DESCRIPTION: SEQ ID NO: 4 87 










Ilet 


Ser 


Ser 


Cys 


Glu 


Val 


Ala 


Tyr 


Phe 


Ser 


Leu 


Lys 


Pro 


He 


Asp 


Leu 


1 








5 










10 










15 




Gin 


Asn 


He 


Aro 


Glu 


Arg 


Asn 


His 


Ser 


Ser 


Asp 


He 


Ala 


Leu 


Ser 


Asn 








20" 










25 










30 






Leu 


Leu 


Aso 


Asn 


Ser 


Asn 


Gin 


Leu 


Leu 


Ala 


Thr 


He 


Leu 


He 


Gly 


Asn 






35 










40 










45 








Asn 


Val 


T1 a 


A^n 


Val 


Ala 


He 


Val 


He 


Leu 


Ser 


Am 


Tyr Ala 


He 


Glu 




50 










55 




















Gin 


Thr 


Phe 


Val 


Phe 


Ser 


Ser 


Pro 


He 


He 


Gly 


Phe 


Leu 


He 


Gin 


Thr 


65 










70 










75 










80 


He 


Leu 


T.eu 


Thr 


Thr 


Val 


Leu 


Leu 


Leu 


Phe 


Gly 


Glu 


He 


Leu 


Pro 


Lys 










85 










90 










95 




Val 


Tyr Ala 


Arg 


Lys 


Asn 


Pro 


Leu 


Gin 


Tyr 


Ser 


Arg 


Phe 


Ser 


Ala 


Ala 








100 










105 










110 






Ala 


Met 


Ser 


Val 


He 


Tyr 


Lys 


He 


Leu 


Ser 


Pro 


Phe 


Ser 


Lys 


Leu 


Leu 






115 










120 










125 








Val 


Lys 


Ssc 


ThE 


Gly 


11* 


Val 


Thr 


Arg 


Gly 


He 


Ser 


Lys 


Lys 


I.ys 


Tyr 




130 










135 










110 










Asp 


ller 


Ser 


Val 


Asp 


Glu 


Leu 


Ser 


Lys 


Ala 


Val 


Ala 


Leu 


Thr 


Thr 


Thr 


145 










150 










155 










160 


GlU 


Gly Glu 


Pro 


Glu 


Glu 


Lys 


Glu 


Met 


He 


Asn 


Glu 


He 


He 


Lys 


Phe 










165 










170 










175 




Tyr 


Asn 


Lys 


Thr 


Ala 


Cys 


Glu 


He 


Met 


Val 


Pro 


Arg 


He 


Asp 


He 


Val 








180 










185 










190 






Asp 


Val 


Acp 


Leu 


Ser 


Trp 


Pro 


Phe 


Arg 


Lys 


Met 


Leu 


Asp 


Fhe 


Val 


Val 






1?5 










200 










205 








Ser 


Ser 


Gly 


Tyr 


Ser 


Arg 


Leu 


Pro 


Vai 


Ser 


Glu 


Gly 


Ser 


Glu 


Asp 


Asn 




210 










215 










220- 










He 


Lys 


Gly 


Val 


lift 


Tyr 


He 


Lys 


Asp 


Leu 


He 


Pro 


His 


net 


Asp 


Lys 


225 










230 










235 










240 


Gly 


Asp 


Glu 


Phe 


Asp 


Trp 


Hir 


Pro 


Leu 


He 


Arg 


Lys 


Ala 


Tyr 


Phe 


Val 










245 










750 










255 




Pro 


Glu 


Asn 


Lys 


Arg 


He 


Asp 


Asp 


Leu 


Leu 


Glu 


Glu 


Phe 


Arg 


Ala 


Asn 








260 










265 










27 0 
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Lys 


Val 


Mis 


Val 


Ser 


lie 


Val 


Val 


Asp 


Glu 


Phe 


Gly 


Gly Thr 


Cya 


Gly 






275 










280 








285 




Leu 


lie 


Thr 


Met 


Glu 


Arp 


Tie 


Leu 


Glu 


Glu 


He 


Val 


Gly Glu 


He 


Thr 




290 










295 










300 




Asp 
305 


Glu 


Tyr 


Asp 


Glu 


Glu 

MO 


Glu 


Leu 


Pro 


Phe 


Lys 
315 


Val 


Leu Gly Asp 


Gly 
320 


Ser 


Tyr 


Leu 


Phe 


Glu 


Gly 


Lys 


Thr 


Ser 


Leu 


Ser 


Asp 


Val Arg 


His 


Tyr 










325 










330 






335 


Leu 


Asp 


Leu 


Pro 


Glu 


Asn 


Ala 


Phe 


Gly 


Glu 


Leu 


Gly 


Asp Glu 


Val 


Arp 








340 










345 








350 




Thr 


Leu 


Ser 


Gly 


Leu 


Phe 


Leu 


Gl u 


lie 


Lys 


Gin 


Glu 


Leu Pro 


His 


Val 






355 










360 










365 


Gly 


Asp 


Thr 


Ala 


Val 


Tyr 


Glu 


Pro 


Phe 


Arg 


Phe 


Gin 


Val Thr 


Gin 


llet 




370 










37 5 
















Asp Lys 


Arg 


Arg 


lie 


lie- 


Glu 


Tie 


Lys 


Tie 


Phe 


Pro 


Phe Glu 


Arg 


Thr 


385 










390 










395 






4 00 


Trp 


GlU 


Val 


Glu 























(2) INFORMATION FOR SEP ID 110:488 

(i) SEQUENCF. CHARACTERISTICS : 

(A) LEHGTH: 260 amino acids 
IB) TYPE: amino acid 
(D) TOPOLOG/: linear 

(ii) MOLECULE TYPE: protein 

{iii) HYPOTHETICAL: YES 



IvI) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) HANG/ KEY: misc^feature 

(B) LOCATIOI1 1. . ,2 60 

Ui) SEQUENCE DESCRIPTION; SEQ ID 110:488 

Met Lys Leu Leu Lou Tyr Leu Leu Leu Val Leu Ser Thr Leu S-r Pro 

15 IQ 1S 

Met Tyr Ser Gin Met Leu Phe Ser Glu Asn Leu Thr Met- Am Ho Asp 

20 25 3 n 

Sen Thr L/s Thr He Gin Gly Thr He Leu Pro Val Leu Asp Phe Lv* 

35 40 45 

Thr Glu T.ys Glu Asn Val Phe Thr Phe Lys A*n Thr Ala Asn Leu Asn 

55 fan 
Leu Leu He Lys Hi* Gly Gin Val He Asn Leu He Asn Lys Leu Glu 
65 70 7E 80 

Phe Scr Thr Tyr Gly Asn Lys Val Thr Val Ser Gly Gly Tyr V«al His 

85 go 95 

Thr Glu Tyr Arg Tyr Leu Leu His His Val Phe Glu Val Tyr Pro Tvr 

100 105 no 

Val Glu Ser Gin Trp Ala Glu Ser Arg Gly Met Lys Tyr Lys Val S=r 

115 120 125 

Thr Gly Leu Gin Ser Arg Tyr Arg Leu Val Asn Ser Asp Asn Cy S Leu 

130 135 140 

Met Phe Ala Thr Leu Gly Val Phe Phe Glu Phe Glu Lys Trp Glu Gin 
14S 150 155 160 

Pro Ala Thr Ser Leu Phe Ala Gly Thr Tyr Ala Tyr Ser Arg Ser 11- 

165 i7n 175 

Lys Ser His Leu Ser He Ser Phe Arg His Ara Leu Gly Glu His Trp 

180 185 ' io C 

Glu Phe Thr Thr Thr Ala He His Gin Gly Lys Pro Asp Ser Tyr Ph- 

195 20O 205 

Lys Lys Ala Arg Phe Gly Gly Ala He Asp L«u Lys Tyr His He Thr 

210 215 220 

Pro Thr He Gly He Arg Gly Ala Tyr Axg lie He Tyr Asp Thr Ala 



225 230 



235 - 24 0 



Pro He Val Pro VaL Arg Lys Asp Tyr Asn Thr Yai Asp Val Gly He 
24 5 2S5 

Asp He Ser Phe 
260 

(2» III FORMATION FOR SEQ ID MO: 4 89 
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(1> SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH; 834 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii» MOLECULE TYPE: protein 

(iii» HYPOTHETICAL: YES 

|vi) ORIGIIIAL SOURCE: 

(A) ORGANISM: Porphyronionas gingivalis 

(ix) FEATURE: 

{A) WA1 IE/KEY: misc feature 
t B > LOCATION 1. , .834 

(xi) SEQUEIICR DESCRIPTION: SEQ ID HO: 4 8 9 

Met Lys Arq lie Val Leu Ser Ser Phe Leu Phe Val Leu Ser Il~ L=u 
1 & 10 15 ~ 

Ser Leu list Ala Gin Asn Asn Thr Leu Asp Val His II- Ser Gly Thr 

20 25 30 

lie Lys Asp Ala Ser Ser Gly Glu Pro Val Pro Tyr Ala Thr Val Ser 

35 40 45 

He Arg Leu Thr Gly Ala Asp Thr Thr Gin Val Phe Arq Gin Val Thr 

SO 55 60 

A5 P G1 > ^ n G *V Tyr Phe Val He Gly Leu Pro Ala Ala Pro Ser Tyr 
*- r ' 7 0 75 8 J 

His Leu Thr Ala Ser Phe Val Gly Met Lys Thr His Thr Met Gin He 

85 90 95 

Ser Arg Gly Asn Gly Gin His Asp He Lys Ser He Asp He S~r Leu 

100 105 no 

Glu Ser Glu Asp Lys Gin Leu Ser Thr Val Thr Val Ser Ala Ala Arq 

H5 120 125 

Pro Leu Val Lys Met Glu He Asp Arg Leu Ser Tyr Asn Met Lys Asp 

130 135 140 

A**p Pro Ala Ala Lys Thr Asn Asn Leu Leu Glu Met Leu Arg Asn Val 
145 150 155 160 

Pro Leu Val Thr Val Asp Gly Gin Gly Asn He Gin Val Lys Gly S-r 

165 170 175 

Ser Asn Ph~ Lys He His Leu Asn Gly Arg Fro Ser Thr Met Val S~r 

180 185 190 

Ser Asn Pro Lys Glu Val Phe Arg Ser Tic Pro Ala Hi* Thr He L— 

195 200 205 * 

Arg Val Glu Val Hp Thr Asp pro Gly Val Lys Tyr Asp Ala Glu Gl" 

210 215 220 

Thr Ser Ala He Leu Asp He Val Thr Glu Glu Gly Lys Lys Leu Glu 
225 230 235 2 40 

Gly Tyr Ser Gly Ser II- Thr Ala Ser Val Ser Asn Asn Pro Thr Ala 

245 250 255 

Asn Gly Ser He Phe Leu Thr Ala Lys Ser Gly Lyst Val Gly Leu Thr 

260 265 270 

Thr Asn Tyr Asn Tyr Tyr Gly Gly Lys Asn Lys Gly Ser Arg Tyr Ph° 

275 280 205 

Thr Glu Arg Thr Thr Ser ?I=t Leu Gin Thr He Glu Glu Gly Lys Gl- 

2?0 295 300 

Gin Glu Thr Phe Gly Gly His Phe Gly Asn Ala Leu Leu Ser Ph- Glu 
305 310 315 320 

He Asp Ser Leu Acn Leu Phe Thr Val Gly Gly Asn Val Arg Leu Trp 

325 330 335 

Glu Met Thr Thr Asp Arg Aeix Ser Val Glu Lye Ser Phe Ala Gly Ser 

3^0 345 350 

Asn Leu Met Ser Tyr lie Asp Arg Lys Leu Lys Thr Gin Met Asp Ala 

355 360 365 

Gly Ser Tyr Glu Leu A&n Ala Asp Tyr Gin Hj s Ser Thr Arg Leu Pro 

370 3^5 380 

Gly Glu Leu Leu Thr Val Ser Tyr Arg Phe Thr His Asn Pro Asn Asn 



385 390 



395 - jco 



Ser Glu Thr Phe He Asp Gin Trp Lys Arg As»p Pro Leu A?n Thr Ala 

405 4 io 415 

Awn Thr He Gin Tyr Ala Gly Gin His Ser Lys Ser Asp Ala Gly Met 

420 425 430 

Asp Glu His Thr Ala Gin Val Asp Tyr Thr Ara Pro Leu Gly Gin Ala 

435 440 " 445 
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HI s 


Ser 


Leu 


Glu 


Ala 


f-;i v 


Leu 


Lys 


Tyr 


He 


Tyr 




His 


Ala 


Thr 


or 














455 






















Pro 


Leu 


Tyr 


Glu 


He 




Pro 


Ser 


Glu 






Pro 


Trp 


Gin 




4 65 










rl^ 




















a Tin 


Gly 


Ser 


Leu 


Tyr 


Ala 






Pro 


Ser 




Gly 


Lys 


Phe 


Arg 


His 


Asp 










4 BS 










490 










495 




Gin 


Tyr 


lie 


Gly Ala 


Al a 


Tyr 


Ala 


Gly 


Tyr 


n 


Tyr 


Arg 


Lys 


Asp 


n 








500 










505 










510 






Tyr 


* " r 


L**u 


Gin 


Thr 




Leu 


Arg 


Val 


Glu 


Ser 


er 


Arg 


Leu 


Lys 


Ale* 






515 










520 










525 












Pro 


Glu 


Asn 


Ala 


Al a 


Ala 


Asp 


Ph 


Ser 


Hi c 


Asn 


Ser 


Phe 


Asp 




530 






























Trp 


Val 


Fro 


Gin 


Lpu 


Thr 


Leu 


Gly Tyr 


Thr 


Pro 


Ser 


Pro 


Met 


Lys 


Gin 


545 










550 










555 












L?u 


Lys 




Ala 


Tyr 


Asn 


Ph** 


Arg 


He 


Gin 


Arg 


ro 


Ala 


Tie 


Gly 












555 










57 0 










57 5 




Lsu 


Asn 


Pro 


Tyr 


Arg 


Leu 


Gl n 


Thr 


Asn 


Asp 


lyr 


Gin 


Val 


Gin 


Tyr 


<jiy 








sao 










505 










590 






Asn 


Pro 


Asp 


Leu 


Lys 


Ser 


Glu 


Lys 


Arg 


Hie 


Hia 


Val 


Gly 


Leu 


Ser 


Tyr 






595 










600 










605 








Asn 


Gl T\ 


Tyr 


Gly 


Ala 


Lys 


Val 


Met 


Leu 


Thr 


Ala 


Ser 


Leu Asp Tyr 


Asp 




61 0 










615 










620 










Phe 


Cys 


Asn 


As ii 


Ala 


He 


Gin 


Asn 


Tyr 


Thr 


Phe 


Ser 


Asp 


Pro 


Ala 


Asn 


62 5 










630 










635 










6 4 0 


Pro 




Leu 


Phe 


iiis 


Gin 


Thr 


Tyr 


Gly 


Asn 


lift 


Gly 


Aru 


Glu 




Ser 










645 








650 










655 




Phe 


Ser 


Leu 


Asn 


Thr 


T-r 


Al a 


Met 


Tyr 


Thr 


Pro 


Al a 


Val 


Trp 


Val 


Ar~ 








G60 










665 










67 0 






lie 


Met 


Leu 


Asn 


Gly 


Asn 


He 


Asp 


Arg 


The 


Phe 


Gin 


Lys 


Ser 


Glu 


Ala 






€7 5 










680 










685 








Leu 


Gly 


He 


Asp 


Val 


Asn 


Ser 


Trp 


Ser 


Gly 


Met 


Val 


Tyr 


Ser 


Gly 


Leu 




690 










695 










700 










Met 


Phe 


Thr 


Leu 


Pro 


Lys 


Asp 


Trp Thr 


Val 


Asn 


Leu 


Pho 


Gly Gly 


Tyr 


705 










710 










71b 










720 


Tyr 


His 


Gly 


Gly Axg 


Scr 


Tyr 


Gin 


Thr 


Lys 


Tyr 


Asp 


Gly Asn 


Val 


Phe 










725 










730 










735 




Asn 


Asn 


lie 


Gly 


lie 


Ala 


Lys 


Gin 


Leu 


Phe 


Asp 


Lye 


Lys 


Leu 


Arg 


Val 








740 










745 










750 






Ser 


Leu 


Ser 


Ala 


Asn 


Asn 


He 


His 


Ala 


Lys 


Tyr 


S~r 


Thr 


Trp 


Lys 


Ser 






755 










760 










765 








Arg 


Thr 


lie 


Gly 


Asn 


Gly 


Phe 


Thr 


lie 


Tyr 


Sex 


Glu 


Asn 


Ala 


Gly 


11 = 


770 










775 










780 










Gin 


Axy 


Sei 


Val 


Ser 


Leu 


Ser 


Leu 


Thr 


Tyr 


Ser 


Phe 


Gly Lys Met 


Asn 


785 










790 










795 










300 


Thr 


Gin 


val 


Arg 


Lys 


Val 


Glu 


Arg 


Thr 


He 


Val 


Asn 


Asp Asp 


Leu 


Lys 










805 










810 










815 




Gin 


Thr 


Ser 


Scr 


Gin 


Gly 


Gin 


Gin 


Gly 


Gly 


Gly 


Gin 


Gly Asn 


Pro 


ThL 








82 0 




















830 






Gly 


Asn 































(2) INFORMATION FOR SEQ ID NO: A 90 

(1J SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 amino acids 
( B > TYPE: amino acid 
(D| TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iia) HYPOTHETICAL: YES 

*vi* ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(Is) FEATURE: 

(A) NAME/KEY: misc_f eature 
(R) LOCATION 1 . . . 399 

*Xi) SEQUENCE DESCRIPTION: SEQ ID HO: 4 90 

Met Arg Lou S^r Ala He Leu He Ala Leu He Val Met Leu Pro Ala 
15 10 15 

Val Leu Sex: Gly Gin His Tyr Tyr Ser (let Ala Gly Glu Arg Leu Glu 
'JO 25 30 
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Thr Asp Ser lie Arg Pro Asn Glu Leu Ser Ala Str lie Arg Sfer Ala 

35 40 45 

Leu Phe Phe Arg Asn Asn Glu Tyr I\m Ala Arg Ser Val Lys Cly Tyr 

50 55 60 

Thr Lnu Pro Gly Ala Arq Val Ser Ala Phe Ala Ser Tyr Ser Leu Pro 
65 70 75 00 

Ala Ala His Gly Val Lys Leu Ser Lou Gly Yal Ser Thr Leu Asn Tyr 

05 90 95 

Trp Gly Ala Ser Arq Tyr Pro Ala Gly lie Ala Tyr Ser Asp Leu Pro 

100 105 110 

Tyr Trp Thr Asp Tyr Asn Asp Tyr Val Arg Leu Arg lie- Lou pro Tyr 

115 120 125 

Val Gin Ala Met Leu Lys Pro Thr Ala Thr Thr Ala Leu Met Leu Gly 

130 135 1-10 

Arn lie Ala Gly Gly Thr Ala His Gly Leu 11^ GJu Pro lie Tyr Asn 
145 150 155 160 

Pro Glu Leu Asp Leu Thr Ala Asp Pro Glu Ala Gly Val Gin Phe Arg 

165 170 175 

Gly Asp Trp Thr Arg Phe Arg Met Asp Val Trp Val Arn Trp Met Ser 

180 185 190 

thst ~lo Phe Lys Arn Asp Asn His Gin Glu Ser Fhe Val Pr.e Gly Leu 

195 200 2Q5 

Ser Thr Thr Ser Lys Leu Leu Ser Gly Glu Gly Lys Trp Arg Leu Glu 

210 215 220 

Leu Pro Leu Gin Ala lie Ala Thr His Arg Gly Gly Giu Tyr Asn Trp 
225 230 235 2-10 

Aj.3 olr. Gin Asp Thr Val His Thr Trp Val Asn Gly Ala Val Gly Leu 

245 250 255 

Lys Leu Ser Tyr Arg Pro Arg Thr Acp Lys Pro list Gin Tie Trp Gly 

250 265 270 

Set Ala Tyr Gly Val Ala Ala Leu Ser Ser Gly Gly Tyr Pr.e Pio Tyr 

275 260 285 

Gl'i Arg Gly Trp Gly Gly Tyr Leu Ser Leu Gly Met Asp Lsu Glu His 

290 295 300 

Phe Ala Phe Arg Thr Asp Tyr Trp Tyr Gly Arg His Tyr Val Ser Pro 
305 310 315 320 

Phs Ala Ala Pro Phe Ala Asn Ser Leu Thr Tyr Asp Lys Gin Pro Leu 

325 330 335 

Thr Am Gly Trp Gly Asp Tyr lie Arg Leu Tyr ALa Asp Tyr Ser Trp 

340 345 350 

Arg Het Ala Accj Ser Val Ser Leu Ala Ala Val Ala Arg Val Trp Phe 

355 360 365 

Gin Pro Ser Asp Arg Phe Ala liet Ser His Ala Leu Glu Lsu Thr Met 

370 375 380 

Arg Tie Asp Pro Lys Phe Pro He Ala Phe Leu Lys Gly Asn His 
3B5 390 395 

(2) IllFORMMMOM FOR SEQ ID NO: 491 

(1) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 382 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

{ii> MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Porphyromonas gingivalis 

(lx) FEATURE: 

(M HAME/KEi: misc_teature 
(P.) LOCATION 1 . . . 382 

(Xi) SEQUENCE DESCRIPTION: SEQ 10 UO:4 9L 

I let Asn Lys Ser Leu Leu Ser Leu Ala Cys Leu lie Leu Cys Gly I let 

15 10 IS 

Pro Ala lie Ala Gin Gin Thr Gly Pro Ala Glu Arg Ser Gly Glu Pro 

20 2b 31 

Ser Leu Ala Glu Arg Val Phe Gly Leu Glu Gin Lys Gin L/s Lys Leu 

35 40 45 

Lys Val Tyr Leu Gly He Gin Ser Phe Tyr A-p Gin Pro Leu Val Asp 
50 bb 60 
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Asp 


Glu 




His 


He 


Gly 


His 


Phe 


Lys 


Val 


Gin 


Glu 


Leu 


Arg 


llet 


Ser 


65 










70 










75 








80 


Ala 


His 


Gly 


Glu 


Leu 


Asn 


Arg 


Hi s 


L»u 


Ser 


Phe 


Asp 


Trp Arg 


Gin 


Arg 










85 










90 










95 


Leu 


Asn 


Arg 


Ala 
100 


Ala 


Asp 


Gly 


Thr 


Ser 
105 


Phe 


Alo 


Asp 


Asn 


Leu 
110 


Ser 


Asn 


Ala 


He 


Asp 


He 


Ala 


Gly Val 


Asp 


Trp 


His 


Pro 


Asn 


Asp 


Lys 


Val 


Ser 






lib 










120 










125 






Phe 


Phe 


Phe 


Gl y 


Arg 


Gin 


Tyr 


Ala 


Arg 


Phe 


Gly 


Gly 


He 


Glu 


Tyr 


Asp 




130 










135 










140 






Met 


Asn 


Pro 


Val 


Glu 


He 


Tyr 


Gin 


Tyr 


Ser 


Asp 


LfMl 


Val 


Asp 


Tyr 


Met 


145 










lbO 










155 






160 


Thr 


Cys 


Tyr 


Thr 


Ser 


Gly Val 


Asn 


Phe 


Ala 


Trp 


Asn 


Phe 


His 


Pro 


Glu 










165 










170 










175 




Gin 


Gin 


Leu 


Gin 

mo 


Leu 


Gin 


Vr.l 


Leu 


Asn 
185 


Ala 


Tyr 


Asn 


Asn 


Arg 
190 


Phe 


Ala 


Asp 


Arg 


Tyr 


His 


Val 


Thr 


Pro 


Asp 


Val 


Ala 


Thr 


Ala 


Thr 


Ser 


Tyr 


Pro 






195 










200 










20b 






Leu 


Leu 


Tyr 


Ser 


Al a 


Gin 


Trp 


Asn 


Gly 


Thr 


Leu 


Leu 


Gly Gly Ala 


Leu 




210 










215 










220 










Hif 


ll-t 


Arg 


Tyr 


Ala 


Val 


Sec 


Met 


Al- 


His 


Gin 


Ala 


Gin 


Glu 


Arg 


Asn 


225 










230 










235 








240 


Met 


Trp 


Tyr 


Phe 


Thr 


Ala 


Gly Asn 


Leu 


Phe 


Asn 


Pro 


Gly 


Lys 


Arg 


He 










245 










250 








255 




Asn 


Gly 


Tyr 


Leu 


Asp 


Leu 


Thr 


Tyr 


Ser 


He 


Glu 


Gly 


Leu 


Asp 


Asp 


Lys 








260 










26b 










270 




Gly 


Tle 


Hc-t 


Thr 


Ala 


Arg 


Tyr 


Gly 




Gly 


Lys 


Thr 


Leu 


Thr 


Ai-p Val 






275 










280 










285 








Lys 


Tyr 


Tyr 


Ala 


Leu 


Val 


Ser 


Lys 


Trp Asn 


Phe 


Arg 


He 


Phe 


Asp 


Gin 




290 










29i> 










300 








Val 


Asn 


Leu 


Ph» 


Lfu 


Lys 


Gly 


Hen 


Tyr 


Glu 


Asn 


Gly 


Tyr 


Ala 


Pro 


Ala 


305 










310 










315 








320 


Gin 


Tyr 


Gly 


Glu 


Ser 


Ser 


His 


Thr Arg 


His 


Scr 


Tyr 


Gly 


Tyr 


Met 


Gly 










325 










330 








335 


Gly Val 


Glu 


Tyr 


Tyr 


Pro 


Thr 


Glu 


Thr 


Asn 


Phe 


Arg 


Leu 


Phe 


Val 


Thr 








340 










345 








350 






Tyr 


lie 


Gly 
355 


Arg 


His 


Tyr 


Arg 


Tyr 
360 


Ser 


Ala 


Thr 


Glu 


Thr 
365 


Glu 


Ser 


Thr 


Asn 


Ala 

370 


Leu 


Arg 


Ala 


Gly 


Leu 
375 


He 


Tyr 


Gin 


He 


Pro 
380 


Phe 


Leu 







{?.) INFORMATION FOR SEQ ID NO: 4 92 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 
{ 0) TOPOLOGY: linear 

(il) MOLECULE TYPE : protein 

(Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(AJ organ I SI I: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY: mi sc_f eature 

(B) LOCATION 1. . .222 

(^1) SEQUENCE DESCRIPTION: SEQ ID 110:092 

llet Lys His Leu Phe Lys Ser Thr Lou Val Leu Leu Cys Ala Leu Ser 

1 S 10 i & 

Phe Ser Gly Thr Tyr Thr Phe Ala Gin Glu Asn Asn Thr Glu Lys Ser 

20 25 jo 

Arg Phe Asp Phe Ser Val Arg Leu Gly Gin Gly Tyr II- Ala Glv S-r 

35 40 45 

Thr Thr Asn Leu Met Tyr Gly Tyr Thr Ser Ala Asn Asp Arg Leu Leu 

50 55 60 

Ser Gly Ala He Tyr Leu Gly Leu Thr Pro Ser Lys Lys Glu Asn Ala 
65 70 75 80 

Thr Gly Val Ala Phe Arq Phe Leu Ser Pro Ser Pro Gly Tyr Tyr Val 

85 go 95 

Asp He Ser Gly Lys Glu Asn Thr Leu Asn Tyr Ala Phe Tyr Val Val 
100 105 HO 
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Gi y 


Ala 


Tyr 


Asn 


Arg 


He 


Ala 


He 


Pro 


He 


Arg 


Pro 


He 


Lys 


Asn 


Phe 






115 










120 










125 








Asn 


Phe 


He 


Phe 


Ser 


Thr 


Glu 


Val 


Gly 


Het 


Al a 


Trp 


Het 


Ser 


Arg 


His 




130 










135 










140 










Glu 


Gin 


He 


Tyr 


Asn 


Ser 


Thr 


Ser 


Gin 


Thr 


Trp 


Asp 


Lys 


Gin 


Arg 


Lys 


145 










150 










1S5 










ISO 


Ser 


Arq 


Ser 


Gly 


Leu 


Asp 


Phe 


Gly 


Leu 


Gly 


Met 


His 


Leu 


Gin 


Unk 


Hi a 










165 










170 










175 




He 


Asn 


Lys 


Thr 


Val 


Tyr 


Phe 


Met 


Ala 


Gly 


Inr 


Asp 


Leu 


Thr 


Ser 


Cys 








180 










185 










190 






Het 


Phe 


Gly 


Lys 


Arg 


He 


Asn 


Asp 


Tyr 


Gin 


Gin 


Lys 


Asp 


Arg 


Thr 


Phe 






195 










200 










205 








He 


Ala 


Leu 


He 


Asp 


Asn 


Ser 


He 


Gly 


He 


Gly 


Leu 


Asn 


Leu 








210 










215 










220 










(2) 


INFORMATION 


FOR 


SEQ 


ID NO:4< 


3 3 

















U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 391 ammo acids 

(B) TYPE; amino acid 
t D) TOPOLOGY: linear 

Ui) MOLECULE TYPE; protein 

{ J i 1 ) HYPOTHETICAL: YES 

<vi> ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonac gingival ir 

(i>:> FEATURE: 

(A) NAME/ KEY: misc_f eature 

( B) LOCATION 1. . .391 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: A 93 



Met 


He 


He 


Lys 


Lys 


Met 


Leu 


Lye 


Asn 


T.yp 


Leu 


Ala 


Pro 


Leu 


Ala 


He 


1 








5 










10 










15 




Leu 


Phe 


Leu 


Phe 


Ala 


Pro 


Lys 


Ala 


Het 


Lys 


Ala 


Gin 


Glu 


Gin 


Leu 


Asn 








20 










2S 








30 






Val 


Val 


His 


Thr 


Ser 


Val 


Pro 


Ser 


Leu 




He 


Ser 


Pro 


Asp Ala 


Arg 






35 










40 










45 








Ala 


Ala 


Gly 


Met 


Gly Asp 


He 


Gly 


Val 


Ala 


Thr 


Thr 


Pro 


Asp 


Ala 


Tyr 




50 










55 










60 








Ser 


Gin 


Tyr 


Trp 


Asn 


Pro 


Scr 


Lys 


Tyr 


Ala 


Phe 


Met 


Asp 


Thr 


Lys 


Ala 


65 










70 










75 








SO 


Gil- 


He 


Ser 


Phe 


Ser 


Tyr 


Thr 


Pro 


Trp 


Leu 


Ser 


Lys 


Leu 


Val 


Asn 


Asp 










85 










90 










95 


lie 


Ala 


Leu 


Met 


Gin 


Met 


Thr 


Gly 


Phe 


Tyr 


Lys 


Leu 


Gly 


Thr 


Asp 


GlU 








100 










105 










110 






Asn 


Gin 


Ala 


He 


Ser 


Ala 


Ser 


Leu 


Arg 


Tyr 


Phe 


Thx 


Leu 


Gly 


Lys 


Leu 






115 










120 










125 








Glu 


Thr 


Phe 


Asp 


Glu 


Leu 


Gly 


Glu 


Ser 


Met 


Gly 


Glu 


Ala 


His 


Pro 


Asn 




130 










135 










140 










Glu 


Phe 


Ala 


Val 


Asp 


Leu 


Gly 


Tyr 


Ser 


Arg 


Gin 


Leu 


Scr 


Glu 


Asn 


Phe 


145 










150 










155 










160 


Ser 


Met 


Ala 


Val 


Ala 


Leu 


Arq 


Tyr 


He 


Arg 


Ser 


Asp 


Gin 


Ser 


Thr 


His 










165 










170 










175 




Asn 


Thr 


Gly 


Glu 


Asn 


Gin 


Ala 


Gly 


Asn 


Ala 


Phe 


Ala 


Ala 


Asp 


He 


Ala 








180 










195 










190 






Gly Tyr 


Leu 


Gin 


Lys 


Tyr 


Val 


Leu 


Leu 


Gly 


Asn 


Ala 


Glu 


Ser 


Leu 


Trp 






195 










200 










205 






SfJC 


Leu 


Cli- 


Phe 


Asn 


Val 


Lys 


Asn 


He 


Gly 


Thr 


Lys 


He 


Ser 


Tyr Asp 




210 










215 










220 










Gly 


Gly 


val 


Thr 


Ser 


Phe 


Phe 


He 


Pro 


Thr 


Ser 


Leu 


Asn 


Leu 


Gly Thi 


225 










230 










235 










240 


Gly 


Leu 


Leu 


Tyr 


Pro 


He 


Asp 


Asp 


Tyr 


Asn 


S~r 


He 


Asn 


Phe 


Asn 


Leu 










245 










250 










255 




Glu 


Leu 


Ser 


Lys 


Leu 


Leu 


Val 


Pro 


Thr 


Pro 


Pre 


He 


Met 


Asp 


Gin 


Asn 








260 










265 










270 






Asp 


G) n 


Ala 


Gly Tyr 


Glu 


Ala 


Ala 


Leu 


Lys 


Lys- 


Tyr" 


Gin 


Glu 


Thr 


Ser 






275 










280 








285 








Ser 


He 


Ser 


Gly 


He 


Phe 


Ser 


Ser 


Phe 


Gly 


Asp 


Ala 


Pro 


Gly Gly 


Leu 




290 










295 










300 










Lys 


Glu 


Glu 


Phe 


Arg 


Glu 


He 


Thr 


Trp 


Gly 


Leu 


Gly 


Ala 


Glu 


Tyr 


Ser 


30b 










310 










3lb 








320 
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Tyr Asp Aip Lys Phe Phe Val Arg 
325 

Thr Lys Gly Asn Leu Gin Tyr Phe 
340 

Asn lie Phe Arg lie Asp Ala Ser 
355 360 
Asn Pro Leu A*o Gin Thr Leu Arq 

370 * 375 

Gly Leu Arg Asn Leu Phe His 
385 390 



AJ a Gly Tyr Ser Tyr Leu Hi3 Pro 

330 335 
Thr Ala Gly Ala Gly Phe Lys Met 
US 350 
Tyr Lsu Leu Sor Thr lie Gin Ser 
365 

Phe Thr Leu Ala Phe Asp Met Asd 
380 



(2) INFORMATION FOR 3EQ ID HO: 4 94 

[i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 6 amino acids 
{B> TYPF. : amino acid 
(D) TOPOLOGY: linear 

tii) IIOLECULK TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi] ORIGINAL SOURCE: 

(A J ORGANISM: Porphyromonas gingival is 

{ix] FEATURE: 

(AJ MAI IC/KEY: misc_reature 
(BJ LOCATION 1. . .446 

(xxj SEQUENCE DESCRIPTION: SEQ ID 110:494 

tlet Lys Thr Thr Val Gin Gin He He Leu Cys Leu Ala Leu Met Met 

IS io 15 

Ser Gly Val Leu Gly Gly Asn Ala Gin Ser Phe Trp Glu Glu He Ala 

20 25 30 

Pro Pro Phe lie Ser Asn Glu Pro Asn Val Lys Tyr He He Pro Asn 

35 40 45 

Met Gly He Asp Ser Lys Gly Thr He Tyr Val Thr Val Thr Lys Arg 

50 55 60 

He Gin Gin Gly Ala Asn Tyr Thr Ser Glu Gin Leu Gly Met Tvr Tvr 
65 70 7S ' ao 

Arg Pro Leu Gly Asp Asn Glu Gin Trp Trp Lys Ills Asp Pro Tyr Phe 

85 90 95 

Asp Asp Lys T)e Val Ala Asp He Gin Thr Asp Ala Tyr Gly Arg Val 

100 105 no 

Tyr Val Cys Thr Thr Ser Ser Arg Asp CI n Glu Tyr Glr. Leu Tyr He 

115 120 125 

Asn Glu Gin Asn Glu Trp Arg Cys 11b Phe Lys Thr Ser Val Ser Thr 

130 135 140 

Tyx Glu Hi£ Gly Mot Ala Val The Arg Ser Ser Thr Gly Val Thr T-r 
US 150 i 5 5 ign 

He Gly Thr Arg His His He Phe Ala Ser Gly Val Asn Asp Phe Glu 

165 170 ivb 

Phe Asn Thr He Tyr Glu Asp Ser Thr Pro list Ser Cys Arg Phe Ala 

180 185 19Q 

Glu Ala Thr Asn Ser Gly Thr He Tyr Leu Ala Leu Met His Glu Thr 

195 200 205 

Thr Met Ser Thr Thr He Leu Thr Tyr Gin Asn Gly Glu Phe Val Asp 

210 215 220 

He Ser Glu Ser Glu Leu Ser Asn Ser He He Ala Ser Met Cys S~r 
225 230 235 2 4* 

Asn Lys Glu Gly Asp T3e He Ala Leu Val Thr Ser Tyr Thr Gly Phe 

24 5 250 255 

Met Ser Gly Thr Leu Ala He Arg Lys Ala Asp Glu Gly Trp Gin 

260 265 2~"-< 

Leu Val Gly Gly Asp He Gin Asn Ala He Val Gin Asn He Cys Me- 

275 280 2B5 

Met Asp Asp A5n Lys He Ala Cys Glu Val Phe Gly Thr Pre Asn Gl- 

290 295 300 

Val Asp Gly Arg Thr Arg Val Cys Val Ser Asp Ala Ser Val Fh~ Asp 
305 310 315 320 

Phe Glu Trp Tyr Glu Asp Glu Tie Tyr Gly Gly Leu He Phe Asp Thr 

325 330 335 

Phe Phe Tyr Ser Pro Trp Asp Lys Leu Leu Tyr Ala Lys Phe Gly Gl- 
340 345 350 
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He 


Uct 


I.CIU 

355 


Arg 


Ser 


Lys 


Glu 


360 


Phe 


He 


Thr 


Ser 


Phe 

365 


He 


Ser 


Pro 


Thr 


Val 


Val 


Gin 


Gly 


Val 


Atp 


Val 


Tyr 


Thr 


Leu 


Ala 


Gly 


Lys 


He 


Arg 




37 0 










37 b 










380 








lie 


G)u 


Ser 


Glu 


Thr 


Pro 


Val 


Ser 


Glu 


Val 


Leu 


Leu 


Phe 


Asp 


Leu 


Ala 


385 










390 










3 95 










400 


Gly Arg 


I Jet 


VaX 


Leu 


Arg 


Glu 


Thr 


He 


Asp 


Asn 


Lye 


He 


Tyr 


Ser 


Asp 










405 










410 










415 




He 


A.-p 


Thr 


Asn 
420 


Cly 


Leu 


Lys 


Arq 


Ser 
425 


Gly 


He 


Tyr 


Val 


Val 
430 


Ser 


Val 


Arg 


Leu 


Ser 
4 35 


Ser 


Gly 


Gin 


Val 


Phe 
440 


Ser 


His 


Lye 


Val 


Gin 
445 


Val 







(2) INFORMATION FOR SEQ ID NO: 4 95 

(i) SEQUENCE CHARACTERISTICS: 

[A) LENGTH: 300 amino acias 
{B> TYPE: amino acid 
!D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE : 

(A) ORGAN TSM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAJ IE/ KEY: rniscjeature 

(B) LOCATION I . . . 30B 



{xi] SEQUENCE DESCRIPTION: SEQ ID NO: 4 95 



Met 


He 


He 


Arg 


Cys 


Leu 


He 


Arg 


Arg 


Pro 


Arq 


Thr 


Val 


Leu 


Phe 


Gly 


1 








5 










10 










15 




Leu 


He 


Phe 


Val 


Val 


Gly 


Leu 


Phe 


Ser 


Ala 


Met 


Ala 


Gin 


Glu 


Lys 


Lys 








20 










25 










30 


Acp 


Ser 


Lou 


Ser 


Thr 


Val 


Gin 


Pro 


Val 


Pro 


Asn 


Ser 


Ser 


Met 


Val 


Glu 






35 










40 










45 








Gin 


Thr 


Pro 


Leu 


Leu 


Ser 


He 


Asp 


His 


Pro 


Val 


Leu 


Pro 


Ala 


Ser 


Phe 




50 










55 










60 










Gl n 


Asn 


Thr 


Arq 


Thr 


Leu 


Lys 


Arg 


Phe 


Arg 


Asp 


Lys 


His 


Leu 


Set 


Asp 


65 










70 










75 










80 


Ala 


Leu 


Leu 


Asn 


Gly 


Leu 


Lys 


Pro 


His 


Arg 


Ssr 


Ser 


Leu 


Gin 


Leu 


Asn 










85 










90 










95 




Glu 


Glu 


Leu 


Asn 


Phe 


Ala 


Ala 


Glu 


Arg 


Arg 


Asp 


Phe 


Val 


Ser 


Pro 


Leu 








100 










105 










110 






Leu 


Gin 


Thr 


Arg 


His 


Ala 


Ala 


Gly 


Val 


Leu 


Ser 


Trp 


Arg 


Pro 


Thr 


Asp 






115 










120 










125 








Arg 


tier 


HI j? 


Phe 


Tyr 


Thr 


Ser 


Gly 


Asn 


He 


Gly 


Leu 


Gly His 


Asp 


Leu 




130 










135 










140 










Leu 


Thr 


Gly 


Val 


Arg 


Lys 


Asp 


Phe 


Gly 


Trp 


Asn 


Ala 


Gly 


Ala 


Asp 


Phe 


145 










150 










155 










160 


Leu 


Leu 


Ser 


Gin 


Asn 


Leu 


Thr 


Ala 


His 


Val 


Gin 


Gly 


Gly 


Trp 


Gin 


Gin 










165 










170 










175 




Asn 


Phe 


Gly 


Phe 


He 


Pro 


Met 


Thr 


Ala 


Val 


Asn 


Gly 


Gin 


Leu 


Arg 


Trp 








180 










185 










1 90 






Gin 


Ala 


Thr 


Glu 


Arg 


Leu 


Ser 


Phe 


Thr 


Thr 


Gly 


He 


Asp 


Tyr Arg 


Gin 






195 










200 










205 








Val 


Gin 


Trp 


Asn 


Ala 


Phe 


Asp 


Asn 


Arg 


Thr 


Ph^ 


S^r 


Leu 


Lys 


Gly 


Ser 




210 










21S 










220 






Al a 


Arg 


Tyr 


Glu 


Val 


Met 


Asp 


Asn 


Val 


Phe 


Val 


Asn 


Gly 


Phe 


Gly 


Ser 


225 










230 










235 








240 


Tyr 


Pro 


Leu 


Tyr 


Ser 


Ser 


Thr 


Arg 


Ser 


Gly 


Leu 


Asn 


Met 


Ala 


Val 


Pro 










245 










250 










255 




Met 


His 


Gly 


Phe 


Gly 


Pro 


Gin 


Tyr 


Gly Gly 


Ser 


Leu 


Glu 


Leu 


Lys 


Vol 








260 










265 










270 




Ser 


Gla 


Arg 


Phe 


Gly 


Phe 


Ala 


Val 


Gly Met 


Glu 


Arg 


GlU 


Tyr Asn 


He 






275 










290 










285 








Trp 


Thr 


Arg 


Arg 


Trp 


Glu 


Thr 


His 


Tyr 


Phe 


Ala 


Tyr 


Pro 


Val 


Phe 


Tyr 



290 295 300 



Gly Asp Lys Lys 
305 

(2) INFORMATION FOR SEQ ID HO: 4 96 
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(1> SEQUENCE CHARACTERISTICS: 

fA) LENGTH: 31S amino acids 
IB> TYPE: amino acid 
[D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 

I A) ORGANISM: Porphyromonas gingival is 

tlx) FEATURE : 

(A) NAME/KEY : miscjeatur- 

(B) LOCATION 1. . .315 



(xi ) SEQUENCE DESCRIPTION: SilQ 1U MO: 4 96 



Met 


Lys 


Thr 


Am 


5 9 




Tyr 


Ala 


Phe 


Val 


Lsu 


Pro 


Leu 


Leu 


Leu Leu 


1 


















10 










15 


Thr 


Gly T.eu 


Leu 


Ala 


Trp 


Gly Gin 


Asp 


Ser 


Ssr 


His 


Gly Ser 


Asn Thr 








20 










25 










30 




Ala 


Phe 


Ala 


Thr 


Asp 


S-r 


Ssr 


Ser 


Arg 


Glu 


Lsu 


Pro 


Thr 


Glu 


Gin Ser 






35 










40 










45 






Ala 


Tyr 


Arg 


Tie 


His 


Ser 


Ala 


Tyr 


Met 


Val 


Gly 


Gly 


Gly 


Gly 


Ser lie 




50 










55 










60 




Thr 


Arg 


Asp 


Thr 


Tyr 


Leu 


Ser 


Pro 


Leu 


Arg 


Tyr 


Gly 


Gly 


Trp 


Thr L=u 


65 










70 










75 


80 


Asn 


Lou 


Leu 


G3y 


Glu 


Lys 


Thr 


Phe 


Pro 


Leu 


Lys 


Ala 


Ser 


Asp 


Ser Arg 










85 










90 








95 


Tr P 


Met 


He 


Arg 


Thr 


Gly 


His 


Glu 


Leu 


Asp 


Fhe 


Ala 


Leu 


Mot 


Asp Asn 








100 










105 










110 


Pro 


Ala 


Asn 


Asn 


Ala 


His 


Phe 


Tyr 


Ser 


Leu 


Leu 


Tyr 


Asn 


Gly 


Ser Ala 






115 










120 








125 




Ala 


Ala 


Leu 


Tyr 


Arg 


Leu 


Gly Ala 


Lys 


His 


Lou 


Arg 


Ala 


Ala 


Trp Met 




130 










135 










i jo 






Asp 


Asn 


Lsmi 


Arq 


Leu 


Ala 


Phe 


Gly 


Pro 


Gly 


Leu 


Glu 


lie 


Gly 


Leu Gly 


145 










150 








155 






160 


Gly 


He 


Tyr 


Ser 


Thr 


Arg 


Asn 


Gly Asn 


Asn 


Pro 


Ala 


Thr 


Leu 


Lys Leu 










165 










17 0 










175 


Tyr 


Thr 


Asn 


Ala 


He 


Ala 


Gin 


Ala 


Ser 


He 


Gly Tyr 


Tyr Val 


Pro Ser 








180 










185 










190 




Glu 


Thr 


Fhe 


Pro 


Leu 


Tyr 


Phe 


Arg 


Leu 


Leu 


Ser 


Gin 


He 


Asn 


Leu Phe 






195 










200 










205 






Gly 


He 


Ala 


Tyr 


Gly 


Asn 


Gly 


Phe 


Gly Glu 


Ser 


Tyr 


Tyr 


Glu 


Asn Phe 




210 










215 










220 






Leu 


Leu 


Am 


Asn 


Gly 


He 


Ala 


Gly 


Ser 


Leu 


His 


Phe 


Thr 


Tyr 


Pro Gly 


221? 










230 










235 






240 


Lys 


Phe 


Thr 


Arg 


Phe 


Thr 


Thr 


Leu 


He 


Thr 


Ala 


Asp 


He 


Pro 


He Arg 










245 










250 








255 


Asn 


Phe 


Cys 


Thr 


Leu 


Arg 


Val 


Gly 


Tyr Arg 


Tyr 


Ser 


His 


Leu 


Gly Ser 








260 










265 










270 


Ser 


Leu 


Asn 


Ala 


Leu 


Asp 


Thr 


Arg 


He 


His 


Ser 


His 


Thr 


Ala 


Phe Ho 






275 










260 










285 






Gly 


Phe 


Val 


Thr 


Glu 


Phe 


Tyr 


Arg 


Phe 


Arg 


Gly 


Arg 


Lys 


Ala 


Met Asn 




2 90 










295 








300 






Thr 


Gly 


Arg 


Arg 


Thr 


Ser 


Leu 


Tyr 


Tyr 


His 


Asp 










305 










310 










315 










(2) 


INFORMATION 


FOR 


SEQ 


ID HO: 497 















(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 amino acids 
(E) TYPE: amino acid 
ID) TOPOLOGY: linear 



(il) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 
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(AJ NAME/ KEY; Rii5c_feature 
(B) LOCATION 1...2 85 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 4 97 



Met 


Val 


Val 


Thr 


Leu 


Leu 


Vdl 


He 


Val 


Gly 


He 


Val 


Val 


Val 


Val 


Arg 


1 


















10 










15 




Tyr 


Ser 


Levi 


Arg 


Val 


His 


Val 


His 


Lys 


Thr 


Gly 


Thr 


Val 


Val 


Ser 


Ala 








20 










25 










30 






Ala 


He 


Phe 


Gly 


Phe 


He 


Lou 


Leu 


Gly 


Ly? 


Thr 


Val 


Pro 


Cys 


Asp 


Thr 






35 










40 










45 








Arg 


Asn 


Phe 


Phe 


Ser 


Ser 


Glu 


Ser 


Asp 


Glu 


Pro 


Glu 


Sec 


Arg 


Val 


Ala 




50 










55 










60 








Thr 


Glu 


He 


Ala 


His 


Leu 


Cys 


Glu 


Tie 


Gly 


Phe 


Gin 


He 


IliS 


Ala 


Ser 


65 










10 










75 










'80 


Ser 


He 


His 


Val 


Ala 


Val 


Arg 


Thr 


Asp 


Phe 


Gly Gin Ala 


Gly 


He 


His 










85 










90 










95 




Cys 


Pro 


MeL 


Ala 


Thr 


Asp 


Ala 


Ser 


Ala 


Thr 


Glu 


Phe 


Asp 


Arg 


Ser 


Ala 








100 










105 










1 10 






Glu 


Cys 


Ala 


Glu 


Arg 


Thr 


Ser 


Ala 


Gin 


lie 


Asp 


Thr 


Ala 


He 


Arg 


Scr 






115 










120 










125 








Gin 


Ser 


Gin 


He 


He 


Arg 


Thr 


His 


He 


Asp 


Thr 


Cys 


Pro 


Lys 


Ser 


Ser 




130 










135 










140 








Gly 


Thr 


He 


Gly 


Gly 


Ser 


Thr 


His 


Thr 


Ser 


Leu 


His 


Leu 


Lys 


Val 


Phe 


145 










150 










155 








160 


Asp 


Gly 


Arg 


Gly 


Glu 


Val 


v31y 


His 


He 


His 


Pro 


Lyr 


Asp 


Gly 


Leu 


Arg 










165 










170 










175 




Phe 


Gly 


Val 


Val 


Glu 


Gly 


Tyr 


Ser 


He 


Gly 


Ser 


Tyr 


Val 


Asp 


Ala 


He 








180 










185 










190 






Gly 


11* 


Gly 


Ala 


Thr 


His 


Ala 


Lys 


Ala 


G3 y 


He 


Ser 


Asd 


Thr 


Arg 


Thr 






195 










200 










205 








Gly He 


Ala 


Gly 


Gly 


Tyr 


His 


Gly 


Gly 


Ser 


Gin 


His 


Gin 


Gin 


He 


Gly 




210 










215 










220 








Asp 


Val 


Thr 


Thr 


He 


He 


Gly 


Leu 


Gly 


Lys 


Phe 


Gly Leu 


Ala 


Asn 


Val 


225 










230 










235 










240 


Gly Val 


Ser 


Asp 


Arg 


Gly 


Phe 


Arg 


Arg 


Gly 


Ala 


Ser 


Ser 


Tyr 


Asp 


Leu 










245 










250 










255 




Tyr 


Gly 


Leu 


Glu 


Leu 


His 


He 


Ala 


Lys 


Thr 


He 


Tyr 


Leu 


Ala 


Val 


Gly 








260 










265 










270 




Asp 


Gly 


Cys 


Leu 


Cys 


Arg 


Gin 


Ala 


Glu 


Arg 


Gin 


Asp Gly 












275 










280 










285 









(2) INFORMATION P0R SEQ ID NO: 4 98 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 99 amino acids 
f B) TYPE: amino acid 
<D) TOPOLOGY: linear 



(11 ) MOLECULE TYPE: protein 

Uli) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(At ORGANISM: Porphyronvonas ginqivalis 

(tX> FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1. . .599 



(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 4 98 



Met 


Glu 


Asn 


Leu 


Lys 


Asn 


He 


Gin 


Pro 


Arg 


Glu 


Asp 


Phe 


Asn 


Trp 


Glu 


1 








5 










10 










15 




Glu 


Phe 


Glu 


Ala 


Gly 


Gly 


Val 


His 


Ala 


Ala 


Val 


Ser 


Arg 


Gin 


Glu 


Gin 








20 










25 








30 






Glu 


Ala 


Ala 


Tyr 


Arp 


Lys 


Thr 


L~u 


Asn 


Thr 


He 


Lys 


Glu 


Lys 


Glu 


Val 






35 










40 










45 








Val 


Met 


Gly 


Arg 


V«l 


Thr 


Ala 


He 


Asn 


Lys 


Arg 


Glu 


Val 


Val 


He 


Asn 




SO 










55 








60* 










Val 


Gly Tyr 


Lys 


Ser 


Glu 


Gly 


Val 


Val 


Pro 


Ala 


Thr 


Glu 


Phe 


Arg 


Tyr 


65 










70 










75 










80 


Asn 


Pro 


Glu 


Leu 


Lys 


Val 


Gly 


Asp 


Glu 


Val 


Glu 


Val 


Tyr 


He 


Glu 


Asn 










8b 










90 










95 




Gin 


Glu 


Asd 


Lys 


Lys 


Gly 


Gin 


Leu 


Val 


Leu 


Scr 


HAp 


Arg 


Lys 


Gly 


Arq 
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100 105 no 

Ala Ala Arg Ser Tip Glu Arg Val Asn Glu Ala Leu Glu Lys Asp Glu 

115 120 125 

He Val Lys Gly Tyr Val Lys cys Arg Thi Lys Gly Gly Met He Val 

130 135 140 

Asp Val Ph- Gly He Glu Ala Phe Leu Pro Gly Ser Gin II- Asp Val 
US 150 tss 160 

Arg Pro He Arg Asp 'iyr Asp Ala Phe Val Glu Lys Thr Met Glu Ph~ 

lps 170 1-75 

Lys II 9 Val Lys He Asn Gin Glu Tyr Lys Asn Val Val Val Ser His 

180 105 j 90 

Lys Val Leu He Glu Ala Glu Leu Glu Gin Gin Lys Lys Glu Ho Il« 

195 200 205 

Gly Lys Leu Glu Lys Gly Gin Val Leu Glu Gly He Val Lys Asn II- 

210 215 220 

Thr Ser Tyr Gly Val Phe He Asp Leu Gly Gly Val A.*p Gly Leu He 
225 230 235 2 <10 

His He Thr Asp Leu Ser Trp Gly Arg val Ala His Pro Glu Glu He 

2^5 250 255 

Val Gin Leu Asp Gin Lys He Asn Val Val He Leu Arp Phc Asp Glu 

260 265 270 

Asp Arg Lys Arg Ho Ala Leu Gly Leu Lys Gin Leu Met Pro Hi* Pro 

275 280 285 

Trp Asp Ala Leu Asp Ser Glu Leu Lys Vai Gly Asp Lys Val Lys Glv 

290 2?5 300 

Lys Val Val Val Met Ala Asp Tyr Gly Ala Phe Val Glu He Ala Gin 
305 310 315 3 2 o 

Gly Val Glu Gly Leu He His Val Ser Glu IteL Ser Trp Thr Gin His 

325 330 335 

Leu Arg Ser Ala Gin Asp Phe Leu His Val Gly Asp Glu Val Glu Ala 

340 345 350 

Val He Leu Thr Leu Asp Arg Glu Glu Arg Lys Met Ser Leu Gly Leu 

355 360 365 

Lys Gin Leu Lys Pro Asp pro Trp Ala Asp He Glu Thr Arg Phe Pro 

370 375 380 

Val Gly Ser Arg His His Ala Arg Val Arg Asn Phe Thr Asn Phe Glv 
385 390 395 400 

Val Phc Val Glu He Glu Glu Gly Val Asp Gly Leu He His He Ser 

405 410 4 i5 

Asp Leu Ser Trp Thr Lys Lys He Lys His Fro Ser Glu Phe Thr Glu 

420 425 430 

Val Gly Ala Asp He Glu Val Gin Val He Glu He Asp Lys Glu Asn 

J 35 440 445 

Arg Arg Leu Ser Leu Gly His Lys Gin Leu Glu Glu Asn Pro Trp Asp 

450 455 460 

Val Phe Glu Thr Val Phe Thr Val Gly Ser He His Glu Gly Thr Val 
465 47 0 475 480 

He Glu Val Met Asp Lys Gly Ala Val Val Ser Leu Fro Tyr Gly Val 

405 490 495 

Glu Gly Phe Ala Thr Pro Lys His Met Val Lys Glu Asp Gly Ser Gin 

500 505 510 

Ala Val Leu Glu Glu Lys Leu Pro Phe Lys Val He Glu Phe Asn Lvs 

B15 520 525 

Asp Ala Lys Arg He He Val Ser His Ser Arg Val Phe Glu Asp Glu 

530 535 54 0 

Gin Lys Met Ala Gin Arg Glu Ala Asn Ala Glu Arg Lys Ala Glu Ala 
545 550 555 560 

Lys Ala Ala Gin Lys Glu Ala Ala Ala Glu Ala Ala Asn Pro Ala Gin 

565 57 0 sv b 

Ala Val Glu Lys Ala Thr Leu Gly Asp Leu Gly Glu Leu Ala Ala Leu 

580 585 590 

Lys Glu Lys Leu Ser Glu Asn 
595 



(2) Itl FORMATION FOR SEQ ID NO: 4 99 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 550 ammo acids 
(&) TYPE: amino acid 
(D) TOPOLOGY: 11 near 

til) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



iL v ™ « Cij -Ji, 413 i„ H 413? *U % KJl 
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(V1) ORIGINAL, SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(IX) FEATURE: 
5 (A) MAHE/KETi : roisct eature 

(B) LOCATION 1 550 

(xi| SEQUENCE DESCRIPTION: SEQ ID NO: 4 99 

10 Met Gly Arg Val Thr Ala He Aan Ly^ Arg Glu Val Val lie Asn Val 
15 10 15 

Gly Tyr Lys Ser Glu Gly Val Val Pru Ala Thr Glu Phe Arg Tyr Asn 

20 25 30 

Pro Glu Leu Lys Val Gly Asp Glu Val Glu Val Tyr He Glu Asn Gin 
15 35 40 45 

Glu Asp Lys Lys Gly Gin Leu Val Leu Ser Hi& Arg Lys Gly Arg Ala 

50 55 GO 

Ala Arg Ser Trp Glu Arg Val Asn Glu Ala Leu Glu Lys Asp Glu He 
65 70 75 80 

20 Val Lys Gly Tyr Val Lys Cys Arg Thr Lys Gly Gly HeL He Val Asp 
85 90 95 

Val Phe Gly Tie Glu Ala Phe Leu Pro Gly Ser Gin He Asp Val Arg 

100 105 HO 

Pro He Arg Asp Tyr Asp Ala Phe Val Glu Lys- Thr Met Glu Phe Lye 

us 120 125 

He Val Lys He Asn Gin Glu Tyr Lys Asn Val Val Val Ser His Lys 

130 135 140 

Val Leu He Glu Ala Glu Leu Glu Gin Gin Lys Lys Glu He He Gly 
„ n 146 150 155 160 

OU Lys Leu Glu Lys Gly Gin Val Leu Glu Gly He Val Lys Asn He Thr 
165 170 175 

Ser Tyr Gly Val Phe He Asp Leu Gly Gly Val Asp Gly Leu He His 

100 185 190 

lie Thr Asp Leu Ser Trp Gly Arg Val Ala His Pro Glu Glu He Val 
«*5 195 200 205 

Gin Leu Asp Gin Lys He Asn Val Val He Lsu Asp Phe Asp Glu Asp 

210 215 220 

Arg Lys Arg He Ala Leu Gly Leu Lys Gin Leu Met Pro His Pro Trp 
225 230 235 240 

40 Asp Ala Leu Asp Ser Glu Leu Lye Val Gly Asp Lys Val Lys Gly Lys 
245 250 255 

Val Val Val Met Ala Asp Tyr Gly Ala Phe Val Glu He Ala Gin Gly 

260 265 270 

Val Glu Gly Leu Ho His Val Ser Glu Met Ser Trp Thr Gin His Leu 
45 2Vb 280 285 

Arg S-r Ala Gin Asp Phe Leu His Val Gly Asp Glu Val Glu Ala Val 

290 295 30O 

He Leu Thr Leu Asp Arg Glu Glu Arg Lys I let Ser Leu Gly Leu Lyr 
305 310 315 320 

DU Gin Leu Lys Pro Asp Pro Trp Ala Asp He Glu Thr Arg Phe Pro Val 
325 330 335 

Gly Ser Arg His His Ala Arg Val Arg Asn Phe Thr Asn Phe Gly Val 

310 345 350 

Phe Val Glu He Glu Glu Gly Val Asp Gly Leu He His He Ser Asp 
55 355 360 365 

Leu Ser Trp Thr Lys Lys He Lys His Pro Ser Glu Phe Thr Glu Val 

370 375 380 

Gly Ala Asp He Glu Val Gin Val He Glu He Asp Lys Glu Asn Arg 
38 * 390 395 4 00 

Arg Leu Ser Leu Gly His Lys Gin Leu Glu Glu Asn Pro Trp Asp Val 

405 410 415 

Phe Glu Thr Val Phe Thr Val Gly Ser He His Glu Gly Thr Val He 

420 425 430 

Glu Val Met Asp Lys Gly Ala Val Val Ser Leu pro Tyr Gly Val Glu 
fc>5 435 440 445 

Gly Phe Ala Thr Pro Lys His Met Val Lys Glu Asp Gly Ser Gin Ala 

450 455 460 

Val Leu Glu Glu Lys Leu Pro Phe Lys Val He Glu Phe Asn Lys Asp 
- n 465 470 475 -- 400 

/U Ala Lys Arg He He Val Ser His Ser Arg Val Phe Glu Asp Glu Gin 
485 490 495 

Lys Met Ala Gin Arg Glu Ala Asn Ala Glu Arg Lys Ala Glu Ala Lys 

500 505 5io 

Ala Ala Gin Lys Glu Ala Ala Ala Glu Ala Ala Asn Pro Ala Gin Ala 
/5 515 520 525 



60 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



451/490 



Val Glu Lys Ala Thr Leu Gly Asp Leu Gly Glu Leu 7*1 a Ala Leu L"t- 

530 535 540 

Glu Lys Leu Ser Glu Asn 
545 550 

(2 J IHFOR1IATIOII FOR SEQ ID NO: 500 

(11 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 amino acids 

(B) TYPE: amino acid 
{D) TOPOLOGY: linear 

tiij MOLECULE TYPE: pro L- in 

(ill ) HYPOTHETICAL: YES 

(vij OJUG1IIAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE ; 

(A) NAME /KEY : masc_f mature 
(DJ LOCATION 1. . .4 50 

Ixi) SEQUEUCE DESCRIPTION: SEQ ID WO: 500 

I1et He Val Asp Val Phe Gly He Glu Ala Phe Leu Pro Gly S~r Gin 

1 5 10 15 

He Asp Val Arg Pro II- Arg Asp Tyr Asp Ala Phs Val Glu Lys Thr 

20 25 30 

lief Glu Phe Lys He Val Lys He Asn Gin Glu Tyr Lys Asn Val Val 

35 40 45 

Val Ser His Lys Val Leu lis Glu Ala Glu Leu Glu Gin Gin Lvs L—» 

50 55 60 

Glu He He Gly Lys Leu Glu Lys Gly Gin Val Leu Glu Gly He Val 
« 5 t0 75 80 

Lys Asn He Thr Ser Tyr Gly Val Phe He Asp Leu Gly Gly Val Asp 

85 go 95 

Gly Leu He His He Thr Asp Leu Ser Trp Gly Arg Val Ala His Pro 

100 105 no 

Glu Glu He Val Gin Leu Asp Gin Lys He Asn Val Val He Leu Asd 

115 120 125 

Phe Asp Glu Asp Arg Lys Arg He Ala Lsu Gly Leu Lys Gin Leu H«t 

130 135 140 

Pro His Pro Trp Asp Ala Leu Asp Ser Glu Leu Lys Val Gly Asp L** 
145 150 155 160 

Val Lys Gly Lys Val Val Val Met Ala Asp Tyr Gly Ala Phe Val Glu 

165 ivo 175 

He Ala Gin Gly Val Glu Gly Leu He His Val Ser Glu Met Ser Trp 

180 185 190 

Thr Gin His Leu Arg Ser Ala Gin Asp Phe Leu His Val Gly Asp Glu 

195 200 205 

Val Glu Ala Val Tl» L*3U Thr Leu Asp Aro Glu Glu Arg Lys Met Ser 

210 215 220 

Leu Gly Leu Lys Gin Leu Lys Pro Asp Pro Trp Ala Asp 11° Glu Thr 
225 230 -35 240 

Arg Phe Pro Val Gly Ser Arg His His Ala Arg Val Arg Asn Phe Thr 

245 250 255 

Asn Phe Gly Val Phe Val Glu He Glu Glu Gly Val Asp Gly Leu II- 

260 265 270 

His He Ser Asp Leu Ser Trp Thr Lys Lys He Lys His Pro Ser Glu 

275 280 2S5 

Phe Thr Glu Val Gly Ala Asp He Glu Val Gin Val Tie Glu II- Asp 

200 2?G 300 

Lys Glu Asn Arg Arq Leu Ser Leu Gly His Lys Gin Leu Glu Glu A-n 
305 310 31S 32r , 

Pro Trp Asp Val Phe Glu Thr Val Phe Thr Val Gly Ser II* His Glu 

32 - , 330 335 

Gly Thr Val He Glu Val Het Asp Lys Gly Ala Val Val Ser Leu Pro 

310 34 5 - 3$r. 

Tyr Gly Val Glu Gly Phe Ala Thr Pro Lys His Met Val Lys Glu Asp 

355 360 3<=S 

Gly Ser Gin Ala Vai Lsu Glu Glu Lys Leu Fro Phe Lys Val 11^ Glu 

3T0 375 380 

Phe Asn Lys Asp Ala Lys Arg He He Val Scr Hi:? Scr Ara Val Ph- 
385 3?0 305 ' 40 " 
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Glu Asp Glu Gin Lys Met Ala Gin Arg Glu Ala Asn Ala Glu Arq Lys 

405 410 415 

Ala Glu Ala Lys Ala Ala Gin Lys Glu Ala Ala Aid Glu Ala Ala Asn 

4?0 42b 430 

Pro Ala Gin Ala Val Glu Lys Ala Thr Leu Gly Asp Leu Gly Glu Leu 

43b 4 40 44 

Ala Ala Leu Lys Glu Lys Leu Ser Glu Asn 
■150 4 5b 

(2) IIIFOR11ATION FOR SEQ. JO HO: 501 

ID SEQUENCE CHARACTERISTICS; 

(A) T.ElfGTH: 426 ammo acids 
iB) TYPE : amino acid 
(DJ TOPOLOGY: linear 

tli) IIOI.RCUJ.E TYPE: protein 

(lii) HYPOTHETICAL: YES 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM; Forphyromonas gingivalis 

<i>:) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1...426 

Ixi) SEQUENCE DESCRIPTION: SEQ ID MO: 501 

Met Glu Phe Lys lis Val Lys lie Asn Gin Glu Tyr Lys Asn Val Val 

15 io x5 

Val Ser His Lys Val Leu lie Glu Ala Glu Leu Glu Gin Gin Lys Lvs 

20 2 b 30 

Glu He He Gly Lys Leu Glu Lys Gly Gin Val Leu Glu Gly He Val 

35 40 45 

Lys Asn He Thr Ser Tyr Gly Val Phe He Asp Leu Gly Gly Val Asp 

SO 55 60 

Gly Leu lie His He Thr Asp Leu Ser Trp Gly Arg Val Ala His Pro 
65 70 75 80 

Glu Glu lie Val Gin Leu A^p Gin Lys- He Asn Val Val Ho Lou Asp 

85 so ?5 

Phe Asp Glu Asp Arg Lys Arg He Ala Leu Gly Leu Lys Gin Leu Met 

100 105 no 

Pro His Pro Trp Asp Ala Leu Asp Ser Glu Leu Lys Val Gly Asp Lys 

115 120 12b 

Val Lye Gly Lys Val Val Val Met Ala Asp Tyr Gly Ala Phe Val Glu 

130 135 140 

He Ala Gin Gly Val Glu Gly Leu lis His Val Ser Glu Met Ser Trp 
150 155 160 

Thr Gin His Leu Arg Ser Ala Gin Asp Phe Leu His Val Gly Asp Glu 

165 170 175 

Val Glu Ala Val He Leu Thr Leu Asp Arg Glu Glu Arg Lys Met Ser 

180 18b 190 

Leu Gly Leu Lys Gin Leu Lys Pro Asp Pro Trp Ala Asp He Glu Thr 

195 200 205 

Arg Phe Pro Val Gly Ser Arg His His Ala Arg Val Arg Asn Phe Thr 

210 215 220 

Asn Phe Gly Val Pne Val Glu He Glu Glu Gly Val Asp Gly Leu II* 
225 230 235 2 40 

His He Ser Asp Leu Ser Trp Thr Lys Lys He Lys His Pro Ser Glu 

24 b 250 ?S5 

Phe Thr Glu Val Gly Ala Asp He Glu Val Gin Val He Glu He Asp 

260 265 270 

Lys Glu Asn Arg Arg Leu Ser Leu Gly His Lys Gin Leu Glu Glu A-n 

275 280 2 85 

Pro Trp Asp Val Phe Glu Thr Val Phe Thr Val Gly Ser II* His Glu 

250 295 300 

Gly Thr Val He Glu Val Met Asp Lys Gly Ala Val Val Ser Leu Pro 
JOb 310 315 320 

Tyr Gly Val Glu Gly Phe Ala Thr Pre Lys His Met Val L^s Glu Asp 

325 330 335 

Gly Ser Gin Ala Val Leu Glu Glu Lys Leu Pro Phe Lys Val 11° Glu 

34 0 34 5 350 

Phe Asn Lys Asp Ala Lys Arg He He Val Ser His Ser Ara Val Ph- 
355 360 3 6 5 
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Glu 


Affp 
37 0 


Glu 


Gin 


Lys 


Met 


Ala 
375 


Gin 


Arg 


Glu 


Ala 


Aid 


Glu 


Ala 


Lys 


Ala 


Ala 


Gin 


Lys 


Glu 


Al a 


Al a 


305 










390 








395 


Pro 


Ala 


Gl n 


Al a 


Val 


Glu 


Lys 


Ala 


Thr 


Leu 


Gly 










405 










410 


Ala 


Ala 


Leu 


Lys 
420 


Glu 


Ly^ 


Ltsu 


Ser 


Glu 
425 


Asn 





la Ala Asn 
400 



(2) INFORMATION FOR SEQ ID MO: 502 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 0 amino acids 

(B) TVPF. : amino acid 
(D) TOPOLOGY: linear 

(iii MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(Vlj ORTGTNAL SOURCE : 

(A) ORGANISM: Porphyromonas ginqivalis 

I ix J FEATURE : 

(A) NAI IE/KEY: tniscjeature 

(B) LOCATION 1...240 

(xij SEQUENCE DESCRIPTION: SEQ ID HO: 502 

Hnt Lys Lys Ala He Leu Ser Gly Ala Ala Leu Leu Leu Gly L-u Cys 

15 10 !b 

Ala Asn Ala Gin Asn Val Gin Leu His Tyr Asp Phe Gly His Ser II- 

20 25 30 

Tyr A.-p Glu Leu Asp Gly Arg Pro Lys Leu Thr Thr Thr Val Glu Aeii 

35 4G 45 

Phe Thr Pro Asp Lys Trp Gly Ser Thr Phe Phe Phe He Asp Met Asp 

SO 55 60 

Tyr Thr Gly Lys Gly He Gin Ser Ala Tyr Trp Glu He Ser Arg Glu 
65 70 75 BO 

Leu Lye Phe Trp Gin Ala Pro Val Ser He Ills Leu Glu Tyr Asn Gly 

85 go os 

Gly Leu Ser Thr Ser Phe Thr Phe Gly His Asp Ala Leu He Gly Ala 

100 105 no 

Thr Tyr Thr Tyr Asn Asn Pro Ser Phe Thr Aro Gly Phe Thr He Thr 

115 120 " 125 

Pro Het Tyr Lys His Leu Gly Ala His Asp Phe His Thr' Tyr Gin He 

130 i3.s 14n 

Thr Gly Thr Trp Tyr Uet His Pho Leu Asp Gly Leu Leu Thr Phe A-n 
145 15y 1£5 160 

Gly Phe Leu Asp Leu Trp Gly Phe Pro Gin Glu Asn Pro II- Gly Gl*- 

165 170 17 5 

Pro Val Leu Lys Glu Gly Arp Lys Phe Val phe Leu Ser Glu Pro Gin 

190 185 190 

Phe Trp He Asn Leu Asn Arg He Lys Gly He Asp Lys Arp Phe Asn 

195 200 205 

Leu Ser He Gly Thr Glu Met Glu He Ser Arg Asn Phe Ala Arg Met 

210 215 220 

Asp Lys Phe Ser Cys He Pro Thr Leu Ala Val Lyr Trp Thr Phe Asn 
225 230 235 240 

12) INFORMATION FOR SEQ ID HO: 503 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 34 amino acids 
(B> TYPE: amino acid 
(D> TOPOLOGY: linear 

(ii> MOLECULE TYPE: protean 

(iii) H/POTHETICAL: YES 

(VlJ ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

fix) FEATURE: 
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(A) IIAHE/KEY: miccf oat ur<= 
(B> LOCATION 1...4 34 

(xi) SEQUENCE DESCRIFTIOM: SEQ ID 110:503 



Met 


Tyr 


Lys 


Asp 


Tyr 


Lys 


Gly 


Leu 


Tyr 


Ala 


Set 


Leu 


Arg 


Trp 


Tyr 


Ala 


1 








5 










10 










15 




Leu 


lie 


lie 


Gly 


Leu 


Leu 


Phe 


Ala 


Ala 


Asp 


Gly 


lis 


Gin 


Ala 


Gin 


Asn 








20 










25 










30 






Asn 


Asn 


Phe 


Thr 


Glu 


Ser 


Pro 


Tyr 


Thr 


Arg 


Phe 


Gly 


Leu Gly Arg 


I.eu 






35 










40 










4b 








Gly 


Glu 


Arg 


Thr 


Thr 


He 


Ser 


Gly 


His 


Ssr 


Ilet 


Gly 


Gly 


Leu 


Gly 


Val 




50 










55 










60 










Gly 


Leu 


Arq 


Gin 


Gly 


Thr 


Tyr 


Val 


Asn 


Ala 


V-l 


Asn 


Pro 


Ala 


Ser 


Tyr 


65 










70 










7 b 










80 




Ala 


Val 


Asp 


Ser 


Het 


Thr 


Phe 


He 


Phe 


Asp 


Phe 


Gly 


Ala 


Ser 


Thr 










85 










90 










95 




Gly 


He 


Thr 


Trp 


Ty r 


Ala 


Glu 


Asn 


Gly 


Lys 


Lys 


A^p 


Asn 


Arg 


Lys 


Met 








100 










105 










110 






Gly 


Asn 


He 


GlU 


Tyr 


Phe 


Ala 


Met 


Leu 


Phe 


Pro 


He 


Ser 


Lys 


Ser 


He 






115 










120 










125 








Ala 


tfet 


Ser 


Ala 


Gly 


Val 


Leu 


Pro 


Tyr 


Sor 


Ala 


Ser 


Gly Tyr 


Gin 


Phe 




130 










135 










no 










Gly 


Ser 


Val 


Asp 


Gin 


Val 


Glu 


Gly 


Gly 


Ser 


Val 


Gin 


Tyr 


Thr 


Ary 


Lys 


145 










150 










155 










160 


Tyr 


Leu 


Gly 


Thr 


Gly 


Asn 


Leu 


Asn 


Asp 


Lc-j 


Tyr 


Val 


Cly 


He 


Gly 


Ala 










165 










no 










175 




Thr 


Pro 


Fhc 


T.-/s 


Asn 


Phe 


Ser 


He 


Gly 


Ala 


Asn 


Ala 


Ser 


Ser 


Leu 


Phe 








180 










185 










190 






Gly 


Arg 


Phe 


Thr 


His 


Ser 


Arg 


Gin 


Val 


He 


Phe 


Ser 


Thr 


Glu 


Ala 


Pro 






195 










200 










205 








Tyr 


Asn 


Pro 


Val 


His 


Leu 


Ser 


Thr 


Leu 


Tyr 


Leu 


Lys 


Ala 


Ala 


Lys 


Phe 




210 










215 










220 










Asp 


Phe 


Gly 


Met 


Gin 


Tyr 


His 


Leu 


Leu 


Leu 


Lys 


Ser 


Asp 


Arg 


Ser 


Leu 


225 










230 










23S 










2-10 


Val 


lie 


Gly 


Ala 


Val 


Tyr 


Ser 


Pro 


Arq 


Val 


Lys 


I1et 


His 


Ser 


GlU 


Leu 










245 










250 










255 




Thr 


Gin 


He 


Lys 


Asn 


Gin 


Val 


Gin 


Asn 


Gly 


Val 


Val 


Val 


Glu 


Ser 


Glu 








260 










2 65 










27 0 






Thr 


Gin 


Glu 


Tyr 


He 


Lys 


Gly 


Met 


Asp 


Tyr 


Tyr 


Thr 


Leu 


Pro 


His 


Thr 






27 5 










280 










285 








Leu 


Gly 


He 


Gly 


Phe 


Ser 


Tyr 


Glu 


Lys 


Lys 


A.3p 


Lys 


Leu 


Leu 


Leu 


Gly 




290 










295 










300 










Ala 


Asp 


Val 


Gl n 


Tyr 


Ser 


Lys 


Trp 


Lys 


Gly 


Glu 


Lys 


Phe 


Tyr 


Lys 


Ser 


305 










310 










315 










320 


Asp 


Cys 


Lys 


Phe 


Gin 


Asp 


Arg 


He 


Arg 


Val 


Ser 


Leu 


Gly Gly Glu 


lie 










325 










330 










335 




lie 


Pro 


Asp 


Tie 


Asn 


Ala 


Val 


Gly 


Het 


Trp 


Pro 


Lys 


Val 


Arg 


T} r 


Arg 








34 0 










345 










350 






Phe 


Gly 


Leu 


His 


Gly 


Glu 


Asn 


Ser 


Tyr 


Leu 


Lys 


Val 


Pro 


Thr 


Lys 


Gly 






355 










360 










365 




Gly 


Val 


Tyr 


Gin 


Gly 


Tyr 


His 


He 


Val 


Gly Ala 


Val 


Pile 


61 y 


He 


Gly 




370 










375 










300 






He 


Pro 


Leu 


Asn 


Asp 


Arg 


Arg 


Ser 


Phe 


Val 


Asn 


Val 


Ser 


Leu 


Glu 


Tyr 


365 










390 










395 










400 


Asp 


Arg 


Leu 


He 


Pro 


Lys 


Glu 


Gly 


Met 


lie 


Lys 


Glu 


Asn 


Ala 


Leu 


Lys 










405 










410 










415 




Leu 


Thr 


Phe 


Gly 


Leu 


Thr 


Phe 


Asn 


Glu 


Ser 


Trp 


Phe 


Lys 


Lys 


Leu 


Lys 



42C 425 430 



Leu Asn 



(2) INFORMATION FOR SEQ ID NO: 504 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 6 amino acids 
(BJ TYPE; amino acid 
(U) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

Uil) HYPOTHETICAL: YES 

(vij ORIGINAL SOURCE : 

I A) ORGANISM: Porphyromonas gingival if 
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{ ix t FEATURE : 

(A) NAME/KEY: mlsc_f eature 

(B) LOCATION 1 ... 926 



(>:i) SEQUENCE DESCRIPTION; SEQ ID NO: 504 



He L 


Aiu 




He 


Tyr 


Gin 




Leu 


Leu 


Scr 


lie 


Leu 


Leu 


Ala 


S a JC 


Leu 










5 










1 0 














Gl y 




Val 


Gly 


LRU 


Gill 


Ala 


Gin 


Gin 


Ala 


Gl*' 


Val 


Al a 


Gl*- 


Arg 


Val 








20 










PS 










3*^ 






eu 




Glu 


Glu 


Gly 


Asn 


Pro 


Met 


He 


Gin 


Al a 


A-^n 


Val 


Gin 


eu 


Val 






35 










4 0 










45 








Gin 


Ser 


Thr 


Gly 


Gin 


Val 


Ala 


Val 


Ala 


Ala 




Ala 


Thr 


Asn 




Lys 




£0 










55 










60 










Leu 


Phe 


Ser 


Leu 


Lys 


Thr 


Ser 


Gl n 


Glu 


Gl y 


Asp 


Tyr 


He 


Leu 


Arg 


6b 










IQ 










7 5 










80 


Val 




Tyr 


Val 


Gly Tyr 


Thr 


Thr 


His 


Asp 


Glu 


Lys 


He 


Ser 


Leu 


Arg 










85 










90 










95 






\j± y 


r\ 


Thr 


He 


Thr 




Lys 


A<t 


He 


Ser 


Met 


Asn 


Glu 


Asp 


Ala 








100 










1 05 










110 




Arg 


Leu 


i 


Gin 


Ser 


Val 


Thr 


Val 


Gin 


Ala 




Ala 


Ala 


Glu 


Val 


Val 
















120 










125 








Val 


Ara 




Asp 


Thr 


Leu 


Glu 


Phe 




Ala 


i*l y 


*' " 


Tyr 


Thr 


Val 


Ala 














135 










14 0 










Gin 


Gl" 


Ala 


Ser 


He 


Glu 


Glu 


Lfcu 


He 


Lys 


x-ys 


Leu 


Pro 


Gly 


Ala 


Glu 












150 




















1G0 


lie 




er 


Asp Gly 


Lys 


He 


r 




i ^n 


-i^ 
uii 




Asp 


lie 


Ser 


Lys 










165 




















175 




lie 


eu 


Val 


Asp 


Gly 


Lys 


Glu 


Phe 






Lys 


Asp 


Pro 




Vdl 


Ala 








180 










105 










1 qn 






He 


ye 


Asn 


Leu 


Pro 


Ala 


Asp 


Met 


Va 1 


sn 


T.ys 


Val 


Gin 




Leu 


Asn 






1 95 










200 










2 05 








Lyr 


Leu 


Ser 


Glu 


Leu 


Ser 


Arg 


Met: 


S~r 


Gl 

o y 


Phe 


Asp 


Asp 




Glu 


Glu 




210 










215 










220 










Glu 


Thr 


Val 


He 


Asn 


Leu 


Thr 


Val 


Lv^ 


Pro 


Glu 


T.vs 






Gly 


Leu 


22b 










230 










235 








240 


Phe 


Gly 


Thr 


Leu 


Gin 


Ala 


Gly 


ryr 




Thr 


A&p 


Gin 




Tyr 


Met 


Ala 










245 




















2S5 




Gl v 
la 4.1 


Gly 


Asn 


Val 


Asn 


Arg 


Phe 


"P 


vij.y 


Asn 


Lyj* 


Gin 


Trp 


Thr 


Leu 


He 








260 










265 










270 






Gly 


ser 


Ala 


Asn 


Asn 


Thr 


Asn 


Asn 


Met 


Gl v 


Phe 


Ser 


GlU 


Met 


Asp 


Ser 






275 










280 








205 








Glu 


Het 


Gly 


Scr 


Met 


Thr 


Phe 


Phe 


Ser 


Pro 


Gin 




Gly 


Gl 
yjJ - i 


Arg 


Arg 




290 










295 










300 










Gly 


Phe 


Clo- 


Asn 


Sei- 


Gly 


Gly 


Val 


Thr 


Ser 


Set 


Ser 


Met 


Leu 


Gly Gly 


305 










310 










315 










320 


Asn 


Phe 


ser 


Val 


Sln 


Phe 


Ser 


Ser 


Ala 


Leu 


Asn 


Thr 


Gly 


Gly 


Asp 


Ala 










325 










330 










335 




Arg 


Tyr 


Gly 


Tyr 


Asn 


Asp 


Lys 


Ala 


He 


Glu 


Thr 


Thr 


Lys 


Arg 


Val 


Glu 








340 










345 








350 






Asn 


He 


Leu 


Ala 


Glu 


Gly Asn 


Thr 


Tyr 


Met 


Asp 


Glu 


Asn 


He 


Leu 


Glu 






355 










360 










365 








Arg 


Ser 


Plie 


Ser 


His 


Asn 


Gly 


Gin 


Ala 


Arg 


Phe 


Arg 


Met 


Gin 


Trp 


Lys 




370 










375 










380 










Pro 


Ser 


Glu 


Arg 


Thr 


Glu 


Val 


Val 


Phe 


Glu 


Pro 


Asp 


Leu 


Ser 


He 


Ser 


385 










390 










395 










400 


Lys 


He 


Asp 


Gly 


Phe 


Phe 


Asn 


A?p 


Thr 


Tyr 


Glu 


Thr 


Lys 


Asp 


Ala 


Thr 










405 










410 








415 




Gly 


He 


Ser 


He 


Asn 


Lys 


Gly 


Ser 


He 


His 


Gin 


Thr 


Thr 


Gin 


Gly 


Asn 








420 










425 










430 






Asn 


Ph« 


Arg 


Leu 


Asn 


Gly Glu 


Leu 


Arp 


11- 


Ser 


Hla 


Lys 


T.eu 


Asn 


Asp 






435 










440 










445 






Glu 


Gly Arg 


Thr 


He 


Ser 


Ala 


Ser 


Val 


Ser 


Gly 


Gly 


Leu 


The 


Asp 


Glu 




450 










4 55 










460 








Asp 


Gly 


Asp 


Gly Ho 


Tyr Gin 


Ala 


Val 


Leu 


Gin 


Ser 


Val 


Glu 


Thr 


Asn 


465 










470 










475 










480 


Gin 


Lys 


Gin 


Phe 


Asn 


Asp 


Asn 


Ser 


Asn 


Leu 


Gin 


Tyr 


Arg 


Leu 


Arg 


Leu 










485 










4 90 














Ser 


Tyr 


Val 


Glu 


Pro 


Leu 


Gly 


Lys 


Asn 


Tyr 


Phs 


Ala 


Gl n 


Al 5 


Tie 


T.eu 








500 










£05 








510 






Asn 


Arg 


Arg 


Phe 


Ser 


Arg 


Arg 


Asn 


Ser 


Asp 


Arg 


Glu 


Val 


Tyr 


Arg 


Leu 






515 










520 










525 








Gly 


Asp 


Asp 


Gly 


Cln 


Tyr 


Ser 


He 


Leu 


Asp 


Ser 


Gin 


Tyr 


Gly 


Leu 


Ser 




530 










535 










54 0 
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Tyi Scr Arfii Glu Phe Thr Gin Tyr Arg II- Gly L-u Asn Leu Lys Ly- 
550 555 c * 0 

lie Ala Lys Thr Trp Asp Tyr Thr Val Gly Phe Asn Val Asp Pro As,, 

565 570 575 

Arg Thr Val Ser Tyr Arg Ser Val Ala Gly Val Glu Gin Asp Lys Leu 

580 585 590 

Ala Phe Asn Arg Val Asn Leu i'er Pro Met Leu Arg He Asn Tyr Lv- 
S95 600 605 

ffS Ar9 Thr Thr A£M Leu Ar ^ Val Af P T l' r Arg Gly Arg Thr Thr 

610 615 6 20 

Gin Pre Ser lie Asn Gin He Ala Pro Val Gin Asp He Thr Asn Pro 

630 635 &il0 

Leu Phe Vai Thr Glu Gly Am Pro Gly Leu Lys Pro Ser Tyr Ser Asn 

Asn Val Met Ala Met Phe Ser As P Phe Asp Aia Lys Ser Gin "g Ala 

6<5 ° 665 e70 

Phe Asn II* Val Phe Phe Gly Asn Tyr Thr Ph^ Asp Asp He Val Pro 

680 685 
Asn Thr JUs Tyr Asp Pro Ser Thr Gly He Arg Thr Thr Ara Tyr Glu 

695 700 
Asn Ala Ser Gly Thr Trp Gin Ala Asn Leu His Gly Thr Leu Ser Leu 
* Lb 710 715 720 

Pro Leu Lys Asn Arg Ala Phe Ser Phe Arg IleL Ser Leu Pae Asn Arg 

725 735 
Leu Ala Glu Gly Gin Sc*r Fhe He Asn Asp Asp Lys Asn Lys Ala Leu 

740 7 45 750 
Ser Fne Arg Thr Arg Glu Arg Leu Thr Leu Thr Tyr Arg Am Asn Trp 
755 760 * 



7 61 



11= Ac P Thr Ser He Gly Gly Asn lie Gly Phe Tyr Met Ala Asn Asn 



775 780 



< n *' 

785 790 ^ ASP Gly GlY ASn 

Tyr Gin Val Ala Leu Thr Leu p ro Tyr Gly Phe Arq He Asp s=r Asp 

805 810 81S 

Val Glu Tyr Asn Thr Asn Ser Gly Tyr Ser Gly Gly Phe Ser Leu Asp 

820 825 B30 

Glu Trp Leu Trp Asn Ala Ser Leu Ser Tyr Ser Phe Leu Arg Asp Ly- 

83 ° 840 645 

Ala Gly Thr Leu Arg Val Asn Gly Tyr As P He Leu Gly Gin Arg Ser 

850 655 060 

Ser He Sec Arg Ser Ala Ser Ala He Asn He Glu Glu Ser Met: s-r 
65 870 875 B80 

Asn Thr He Gly Arg Tyr Val Met V«a Asp Phe He Tyr Arg Ph~ A-n 

005 890 8 q 5 

Ala Phe Ser c.ly Gly Gly Ser Arg Ser Asp His Gin Arg Gly Asn Met 

900 905 9X0 

Asn Mi-g Pro Gly Pro Pro Phe Gly Gly Gly Arg Arg Pro Scr 
915 920 o 2 5 

{21 INFORMATION FOR SEQ TD HO: 505 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 amino acids 

(B) TYPF. : amino acid 
(D) TOPOLOGY: linear 

Hi) MOLECULE TYPE: protein 

{ili) HYPOTHETICAL: YES 

(vl ) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

UX) FEATURE: 

(A) HAME/KEY: mlscfeature 

(B) LOCATION 1 4 00 

C:i) SEQUENCE DESCRIPTION: SEQ ID ?IO:505 

Met Val Met Lys Leu He Lys Arg Ser Leu Leu Leu" Leu Gly Ala Val 

10 X5 
Leu Leu tie Thr Leu Pro Ala Tyr Ser Gin hsn Asp Asp He Phe Glu 

Asp Asp He Tyr Thr Ser Ary Lys Glu Tie Arg Lys Gin Asn Gin Val 
35 40 45 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



457/490 



Ly3 


Asp 


Trp 


Gin 


Asn 


Gin 


Glu 


Asp 


Gly 


Tyr 


Gly 


Asp 


Asp 


Thr 


Glu 


Tyr 




50 










55 










60 








Thr 


Val 


Al a 


Ser 


Asp 


Arg 


Asp 


He 


Asp 


Ala 


Tyr 


Asn 


Arg 


Arg 


Asp Gly 


65 










7 0 










75 










80 


Gin 


Ser 


Tyr 


Asp 


Gly 


Lys 


Lys 


Leu 


Ser 


Lys 


Asp 


Lys 


Lys 


Arg 


Asp 


Ser 










85 










90 










95 




Thr 


Arg 


Ser 


Ser 


Val 


Pro 


Gly Axg 


Tyr 


Ser 


Arg 


Arg 


Leu 


Ala 


Arg 


Phe 








100 










105 










110 






Tyr 


Lyrr 


Pro 


Aen 


Thr 


He 


Val 


He 


Ser 


Gly Ala 


A^p 


Asn 


Val 


Tyr 


Val 






115 










1 20 










125 








Thr 


Asp 


Asp 


Gly 


Glu 


Tyr 


Phe 


Val 


Tyr 


Sly 


Asp 


Glu 


Tyr 


Tyr 


Asp 


Asp 




130 










135 










14 0 










Ala 


Ser 


Ser 


Vol 


Asn 


He 


Tyr 


He 


Asn 


Ser 


Pro 


Trp 


Cys 


Asp 


Pro 


Phe 


145 










150 










155 










160 


Pro 


Tyr 


Thr 


Ser 


Trp 


Tyr 


Pro 


Ser 


Pho 


Scr 


Gly 


Trp 


Tyr 


Asn 


Tyr 


Thr 










165 










170 










175 




Trp 


Asn 


Tyr 


Pro 


Trp 


Phe 


Tyr 


Tyr 


Gly 


Ser 


His 


He 


Gly Trp Gly 


Gly 








180 










165 










190 






Tyr 


Tyr 


Tro 


Gly Tyr 


A.«*n 


Trp 


Tyr 


Trp 


Ser 


Tyr 


Tyr 


Tyr 


Asp 


Pro 


Phe 






195 










200 










205 








Tyr 


Asn 


Pro 


Tyr 


Gly 


He 


Gly 


Hat 


Gly Trp 


Gly 


Tyr 


Pre 


Tyr 


Gly 


Trp 




210 










215 










220 










Gly 


Ser 


Tyr 


Tyr 


Gly 


Trp 


Gly 


Gly 


Tyr 


Pro 


Gly 


Val 


He 


His 


Ills 


Tyr 












230 










235 










240 


His 


His 


Tyr 


Pro 


Lys 


Lys 


Thr 


Tyr 


Ser 


Asn 


Gly 


Gin 


His 


Ser 


Gly 


Ala 










2*15 










250 










255 




Tyr 


Tyr 


Ser 


Tyr Gly 


Arg 


Pro 


Asn 


Arg 


Tie 


Lys 


GJy 


Gly 


Thr 


Ser 


Gly 








260 










265 






270 






Ala 


Lys 


Leu 


Gly Thr 


Gly Arg 


Tyr 


Asp 


Arg 


He 


Gin 


Asn 


Ser 


Ser 


Ser 
















200 










285 








Gin 


Lys 


Asn 


Lys 


Phe 


Gly 


Leu 


Gin 


Ser 


A;?n 


Lys 


Pro 


Asn 


Asn 


Asn 


Leu 




290 










295 








300 










Gin 


Asa 


Val 


I»V3 


ser 


Gly 


Arg 


Thr 


Gly 


Arg 


Ala 


Asn 


Arg 


Asp Arg Asn 


305 










310 










315 










32G 


He 


Glu 


Thr 


Val 


Thr 


Pro 


Asn 


Asn 


Gly 


Gin 


Lys 


Gin 


Asn 


Arg 


Pro 


Val 










325 










330 








335 




Phe 


Gin 


Gin 


Asn 


Gin 


ser 


Gly 


Asn 


Asp 


Arg 


Pro 


Thr 


Gly 


Arg 


Asn 


He 








340 










34 5 










350 






Arg 


Ser 


Glu 


Arg 


Gin 


Gly 


Glu 


Acn 


Asn 


Asp 


Arg 


Thr 


Phe 


Ser 


Thr 


Pro 






355 










360 










365 








Ser 


Arg 


Ser 


Asn 


Ser 


Asn 


Gly Gly 


Phe 


Ser 


Thr 


Pro 


Ser 


Arg 


Ser 


Ser 




370 










375 










360 








Ser 


Gly 


Ser 


net 


Scr 


Gly Gly Gly Gly Arg 


Ser 


Gly 


Arg 


Gly 


Arg 


Asn 


385 










390 










355 










400 



|2 J INFORMATION FOR SEQ ID NO: 50G 

<i) SEQUENCE CHARACTERISTICS: 

(A( LENGTH: 398 amino acids 
(B) TYPE: anu.no acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphynomonae gingival is 

( lx } FEATURF. : 

(A) NAT IE /KEY : mlsc_f eaturt; 
(B» LOCATION 1 . . .398 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506 



Met 


Lys 


Leu 


He 


Lys 


Arg 


Ser 


Leu 


Leu Leu 


Leu 


Gly 


Ala 


Val 


Leu 


Leu 


1 








5 








10 










lb 




He 


Thr 


Leu 


Pro 


Ala 


Tyr 


Ser 


G)n 


Ann Asp 


Asp 


He 


Phe 


Glu 


Asp 


Asp 








20 










25 








30 


He 


Tyr 


Thr 


Ser 


Arg 


Lys 


Glu 


He 


Arg Lyt- 


Gin 


Atf*n 


Gin 


Val 


Lys 


Asp 






35 










4 0 








45 






Trp 


Gin 

50 


Asn 


Gin 


Glu 


Asp 


Gly 
55 


Tyr 


Gly Asp 


Asp 


Thr 
60 


Glu 


Tyr 


Thr 


Val 


Aid 


Ser 




Arg 


Asp 


He. 


App 


Ala 


Tyr Asn 


Arg 


Acg 


Asp 


Gly 


Gin 


Ser 



65 7 0 7S 80 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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458/490 

Tyr Asp Gly Lys Lys Leu Ser Lys Asp Lys Lys Arg Asp Ser Thr Arg 



65 go 



95 



Ser Ser Val Pro Gly Arg Tyr Ser Arg Arg Lou Ala Aro Phe Tvr T.ys 

loo io5 "no 

E'ro Asn Thr He Val He Ser Gly Ala Asp Asn Val Tyr Val Thr Asp 

115 120 125 

Asp Gly Glu Tyr Phe Val Tyr Gly Asp Glu Tyr Tyr Asp Asp Ala Ser 

130 135 no 

Ser Val Asn He Tyr He Asn Ser Pro Trp Cys Asp Pro Phe Pro Tyr 
** s 150 155 160 

Thr Ser Trp Tyr Pro Ser Phe Ser Gly Trp Tyr Am Tyr Thr Trp Asn 

165 170 17S 

Tyr Pro Trp Phe Tyr Tyr Gly Ser His He Gly Trp Gly Gly Tyr Tyr 

180 i 8 5 190 

Pro Gly Tyr Asn Trp Tyr Trp Ser Tyr Tyr Tyr A^p Pro Phe Tyr Asn 

1?5 200 ?05 

Pro Tyr Gly He Gly Met Gly Trp Gly Tyr Pro Tyr Gly Trp Gly S-r 

210 215 220 

Tyr Tyr Gly Trp Gly Gly Tyr Pro Gly Val He His His Tyr His Hi- 
2:5 230 235 240 

Tyr Pro Lys Lys Thr Tyr Ser Asn Gl / Gin Hir Ser Gly Ala Tyr Tyr 

245 250 25b 

Ser Tyr Gly Arg Pro Asn Arg He Lys Gly Gly Thr Ser Gl*- Ala Lys 

260 265 270 

Leu Gly Thr Gly Arg Tyr Asp Arg He Gin Asn S*r Ser Ser Gin Lys 

2^5 2 S0 2t j5 

Asn Lys Phe Gly Leu Gin Ser A^n Lys Pro Asn Asn Asn Leu Gin Asn 

290 295 300 

Val Lys Ser Gly Arg Thr Gly Arg Ala Asn Arg Asp Arg Asn He Glu 
305 310 315 320 

Thr Val Thr Pro Asn Asn Gly Gin Lys Gin Asn Arg Pro Val Phe Gin 

325 330 335 

Gin Asn Gin Ser Gly Asn Asp Arg Pro Thr Gly Arg Asn He Arg Ser 

340 34i> 3b0 

Glu Arg Gin Gly Glu Asn Asn Asp Arg Thr Phe Ser Thr Pro Ser Axg 

355 360 365 

Ser Asn Ser Asn Gly Gly Phe Ser Thr Pro Ser Arg Ser Ser Ser Gl^ 

370 375 380 

Ser Met Scr Gly Gly Gly Gly Arg Ser Gly Arg Gly Arq Asn 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 507 

(i) SEQUENCE CHARACTERXSTICS : 

(A) LENGTH: 581 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii! IIOLECULE TYPE; protein 

(Hi) HYPOTHETICAL: YES 

(vi> ORIGINAL SOURCE: 

[A) ORGANISM: Porphyromonae gingivalis 

(lx) FEATURE: 

{A> HAME/KEY: mi sc_f eat ar- 
te) LOCATION 1 . . .581 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO:507 



t let 


He 


Arg 


Lys 


Leu 


He 


Leu 


Leu 


Leu 


Ala 


Leu 


Met 


Pro Val Ala 


Ser 


1 


Ala 






S 










10 






15 


Val 


Phfe 


Ala 


Val 


Pro 


Thr 


Asp 


Ser 


Thr 


Glu 


Ser 


Lys Asp Asn 


Arg 


He 


Leu 




20 










25 








30 


Thr 


Ser 


Met 


Gin 


Ser 


Ser 


Ser 


Leu 


Asn 


Arg 


Asp Asp Ala 
45 


Pro 


Asp 




35 










40 












Lys 
50 


Trp 


Gin 


Pro 


Ilet 


His 
55 


Ala 


Asn 


Phe 


Ser 


He 


Gin Ser Asp 


Met 


Leu 
65 


Leu 


Ser 


Thr 


Ala 


Gin 
70 


Lys 


Ser 


Lys 


Asn 


Thr 

75 


Trp 


Phe Gly Asn 


Ser 
80 


Tyr 


Tie 


Met 


Gly 


He 


He 


Lys 


Asn 


Asn 


Tyr 


Leu 


Giu 


Phe Gly Ala 


Arg 


Phe 


Glu 






85 










90 






9b 


Asp 


Leu 
100 


Tyr 


Lys 


Pro 


Leu 


Pro 
105 


Gly 


His 


Glu 


Pro Glu Met 
110 


Gly 
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Arg 


Gly 


Val 


Pro 


His 


Met 




Val 


Lys 


Gly 


Ser 


Tyr 


Hl^ 


Trp 


Ala 


Glu 






115 










120 










125 








Leu 


Thr 


Met 


Gly 


Asp 


Phe 


Tyr 


Asp 


Gin 


Phe 


Gly 


Ser 


Gly 


llet 


Val 


Phe 




130 










13S 










140 










Arg 


Thr 


Tyr 


Glu 


Glu 


Arg 


Asn 


Leu 


CMy 


Tl o 


Asp 


Am 


Ala 


Val 


Arg 


Gl v 


145 










150 










155 








160 


Gly 


Arg 


lie 


Val 


Leu 


Thr 


Pro 


Phe 


Asp 


Gly 


Val 


Arg 


Val 


Lys 


CI y 


He 










165 










170 










175 




Ala 


Gly 


Gin 


Gin 


Arg 


Am 


Tyr 


Phe 


Anp 


Arg 


The 


Gl y 


T.ys 


V^l 


Phe 


Asn 








180 










ies 








190 






Ser 


Gly 


Arg 


Gly 


Tyr 


Leu 


Leu 


Gly 


Ser 


Asp 


Leu 


Glu 


Leu 


Asn 


Val 


Glu 






195 










200 










205 








Arg 


Trp 


Ser 


Ser 


Ala 


Ilet 


Arg 


Asp 


Ann 


Asp 


Tyr 


Hi s 


Leu 


A] a 


lie 


Gl v 




210 










215 










220 








Gly 


Ser 


Phe 


Val 


Ser 


Lys 


His 


Glu 


Ala 


Asd 


Glu 


Asp 


He 


Phe 


Va 1 


Gl ,J 


225 










230 










235 










24 0 


Val 


Gly 


Clu 


Aso 


Arg 


Lys 


A 


Leu 


Asn 


Leu 


Pro 


Leu 


Asn 


Val 


Pro 


He 










245 










250 










255 




1 let 


Gly 


Leu 


Arg 


Thr 


Aim 


Phe 


Gin 


Lys 


Gly 


Gly 


Leu 


Ala 


Leu 


i i r 


Ala 








260 










2 6b 










270 




Glu 


Tyr 


Gly 


Tyr 


Lys 


Tyr 


Asn 


Asp 


Pro 


Ser 


Ala 


Asp 


Asn 


Asp 


1 yr 


11 ** 






275 










280 










205 






Tyr 


His 


Asp 


Gly 


Gin 


Ala 


Ala 


Leu 


Leu 


Ssr 


Ala 


Ser 


Tyr 


Ser 


Lys 






290 










295 










300 




Gly 


Met 


Ser 


lie 


Leu 


Leu 


Gin 


Ala 


Lys 


Arg 


Cys 


Glu 


Asn 


Phe 


Al a 


Phe 


305 










310 










31 5 










320 


Arg 


Ser 


Lys 


Arg 


S«r 


Ala 


Gin 


Leu 


Thr 


Pro 


Leu 


Het 


He 


Asn 


i yr 












325 










330 










335 




Pro 


Ala 


Phe 


Thr 


Gl n 


A1 a 


His 


Thr 


Tyr 


Thr 


Leu 


Ala 


Ala 


lie 


Tyr 


Pro 








340 










345 










350 




Tyr 


Ala 


Thr 


Gin 


Pro 


Gin 


Gly 


Glu 


Trp 


Ala 


Phe 


Gin 


Gly Glu 


Leu 


Arg 






355 










360 










365 






Tyr 


Asn 


Phe 


Ala 


Arg 


Arg 


Thr 


Ala 


Leu 


Gly 


Gl'.* 


Arg 


Tyr 


Gly 


Thr 


y 




370 










375 










380 






Leu 


Arg 


lie 


Asn 


Val 


Ser 


His 


Val 


Arg 


Gly 


Leu 


Asp 


Lys 


Lys 


Het 


Leu 


385 










390 










395 










4 00 


Lys 


Glu 


Asn 


Pro 


Asp 


Glu 


Leu 


Tie 


Gly 


Thr 


A.**p 


Gly 


Tyr 


Thr 


Val 


Ser 










405 










410 










415 




Phe 


Phe 


Gly 


Met 


Gly 


Asp 


Leu 


Tyr 


Tyr 


Ser 


Asp 


lie 


Asp 


Val 


Glu 


He 








420 










425 










430 






Thr 


Lys 


Lys 


Val 


Ser 


Pro 


Gly 


Phe 


Asn 


Phe 


Thr 


Leu 


Thr 


Tyr 


Leu 


Asn 






4 35 










440 










445 






Gin 


lie 


Tyr 


Asn 


Asn 


Lys 


Val 


Leu 


His 


Gly 


Ala 


Ala 


Gly Glu 


Lys 


Pro 




450 










455 










460 








Glu 


Lys 


lie 


Tyr 


Ala 


Asn 


lie 


Phe 


Val 


Tyr 


Asp 


Gly 


Lys 


Tyr 


Lys 


Leu 


4 65 










470 










475 








480 


Ser 


Asn 


Lyc 


Val 


Ale 


Lou 


Arg 


Thr 


Glu 


L^u 


Gin 


Tyr 


Leu 


iiis 


Thr 


Lys 










4 85 










490 










4 95 




Gin 


Asp 


Gin 


Gly 


Asp 


Trp 


lie 


Tyr 


Gly 


IleL 


Ala 


Glu 


Leu 


3ei 


He 


L€U 








5on 










505 










510 






Pro 


Ser 


Leu 


Met 


Leu 


Ser 


Leu 


Ser 


Glu 


Gin 


Tyr 


Asn 


He 


Cly 


Glu 


Thr 






515 










520 










525 






Lys 


Lys 


His 


Tyr 


Val 


Met 


Gly 


Ser 


Val 


Thr 


Tyr 


Thr 


His 


Gly 


Ala 


His 




530 










535 










540 










Arg 


Val 


Ala 


Pr- 


Ssr 


Ala 


Gly 


l.yc 


Thr 


Arg 


Ala 


Gly Het 


Asn 


Cys 


Ser 


545 










550 










555 








560 


Gly 


Gly 


Val 


eys 


Arg 


Val 


Val 


Pro 


Glu 


Thr 


Gin 


Gly 


Phe 


Tyr 


Leu 


Ser 










565 










S7 0 








57 5 




Tyr 


Ser 


Thr 


Asn 


Leu 

























560 



{ ?. J IMFORIIATIOII FOR SEQ ID NO: 500 

{i> SEQUENCE CHARACTERISTICS: 

(A) LE11GTH: 239 amino acids 

(B) TYPE: amino acid 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

till) HYPOTHETICAL : YFS 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival Is 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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FEATURE : 

(A) NAME/ KEY : misefeature 

(B) LOCATION 1. . .23 9 

(Xl) SEQUENCE DESCRIPTION; SEQ ID IIO: 508 



Met 


Arg 


Ser 


Leu 


Phe 


Leu 


Ser 


Ala 


Lou 


Arg 


Ser 


Scr 


Scr 


Leu 


His 


Gly 


1 


Glu 






5 










10 










15 


Ser 


Arg 


Arg 


S~r 


Arg 


He 


Ser 


Ser 


Ser 


Val 


Val 


Met 


Ser 


He 


Arg 








20 










25 










30 




Gin 


Lys 


lie 


Arg 


Leu 


Phe 


His 


Leu 


Ser 


Val 


Cys 


Ala 


Gin 


Thr 


His 


Asp 






35 










40 










45 






His 


Leu 


He 


Glu 


lie 


His 


Leu 


Val 


Cys 


He 


Glu 


Phe 


Gly Ala 


He 


Asp 




50 










55 










60 








Thr 


Asp 


Glu 


Phe 


Artj 


Leu 


Ser 


Ser 


Hi - 


Al a 


Tyr 


Thr 


Thr 


Ser 


Pro 


Thr 


65 










70 










75 










BO 


His 


Thr 


Gly 


Al a 


He 


His 


Ills 


Asn 


Cys 


He 


Glu 


Arg 


Ser 


Tyr 


Gly Arg 










85 




















95 




Tyr 


Leu 


Val 


Thr 


Phe 


Gly 


Gin 


Glu 


Arg 


Asn 


Glu 


Leu 


His 


His 


His 


Sor 








100 










105 










110 






Arg 


Pro 


Asp 


Arg 


Asn 


Ala 


Glu 


Val 


Tyr 


Arg 


Phe 


Pro 


Phe 


Asp Asn 


Ala 






115 










120 










125 








Fhe 


His 


Ser 


lie 


Arg 


Tyr 


Glu 


Ala 


Phe 


Arg 


Pro 


He 


Arg 


Pro 


lie 


Val 




1J0 










135 










140 








Cy.- 




A-p 


Asp 


His 


Phe 


He 


Ala 


He 


Gly 


Sei 


His 


Leu 


Phe 


Phe 


Lyt 


145 




Gin 






150 










155 








160 


Asp 


Am 


He 


Phe 


Ser 


Ser 


Gly 


Ser 


G1 n 


Tyr 


Asp 


Asn 


Tyr 


Thr 


Val 


Ala 








165 










170 








175 




cy» 


Phe 


Val 


Glu 


Ser 


Leu 


His 


Asp 


Arg 


Glu 


Gin 


Arg 


Ser 


His 


Thr 








180 










185 








190 






His 


Thr 


Ala 


Sex 


Gly 


Thr 


Asn 


His 


Cys 


Ala 


Asp 


T.ou 


Phe 


Asp 


net 


Arg 


Thr 




195 










200 










205 




I.eu 


Scir 


Gin 


Arg 


Thr 


Tyr 


His 


He 


Arg 


Asp 


lie 


Val 


Ala 


Asp 


Phe 




210 










215 










220 








Glu 


Phe 


Gly 


Gin 


Phe 


Leu 


Gly 


Arg 


Phe 


Ala 


His 


Gly 


Leu 


Asn 


His 




225 










230 








235 








(2) 


III FORMAT IOH 


FOR 


SEQ 


ID NO: 509 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 



Ui) IIOLECULE TYPE: protein 

till) HYPOTHETICAL: YES 

<Vi> ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gi ngival is 

(ix) FEATURE: 

(A) NAME/ KEY: mi«c_f«aLure 

(B) LOCATION 1...211 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509 



Ilet 


Scr 


He 


Arg 


Gin 


Lys 


He 


Arg 


Leu 


Phe 


His 


Leu 


Ser 


Val 


Cys 


Ala 


1 








5 










10 










15 




Gin 


Thr 


His 


Asp 


His 


Leu 


He 


Glu 


He 


His 


Leu 


Val 


Cys 


He 


Glu 


Phe 








20 










25 








30 




Gly Al a 


He 


Asp 


Thr 


Asp 


Glu 


Phe 


Arg 


Leu 


Ser 


Ser 


His 


Ala 


Tyr 


Thr 


Thr 




35 










40 










45 






Ser 
50 


Pro 


Thr 


His 


Thr 


Gly 
55 


Ala 


He 


ills 


Mis 


Asn 
60 


Cys 


He 


Glu 


Arg 


Ser 
65 


Tyr 


Gly 


Arg 


Tyr 


Leu 
70 


Val 


Thx 


Phe 


Giy 


Gin 

75 


Glu 


Arg 


Asn 


Glu 


Leu 
80 


His 


His 


His 


Ser 


Arg 


Pro 


Asp 


Arg 


Asn 


Ala 


Glu 


Val 


Tyr 


Arg 


Phe 


Pro 


Phe 








85 










90 








95 




Asp 


Asn 


Ala 


Phe 


His 


S-r 


He 


Arg 


Tyr 


Gl u 


Ala- 


Fh~ 


Arg 


Pro 


He 


Arg 






100 










105 










110 






Pro 


He 


v,-u 


Cys 


His 


Asp 


Asp 


His 


Phe 


He 


Ala 


He 


Gly Ser 


His 




Phe 
130 


115 










120 










125 








Leu 


Phe 


Lys 


Asp 


Asn 


Gin 
135 


He 


Pho 


Ser 


Ser 


Gly 
140 


Ser 


Gin 


Tyr 


Asp 
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Acn 


Tyr 


Thr 


Val 


Ala 


Cys 


The 


Val 


Glu 


Ser 


Leu 


His 


Asp 


Arg 


Glu 


Gin 


US 










ISO 










155 










160 


Arg 


Ser 


His 


Thr 


His 


Thr 


Ala 


Ser 


Gly 


Thr 


Asn 


His 


Cys 


Ala 


Asp 


Leu 










165 










170 










175 




Phe 


Asp 


Met 


Arg 


Thr 


Leu 


Ser 


Gin 


Arg 


Thr 


Tyr 


His 


lie 


Axg 


Asp 


He 








i no 










185 










190 






Val 


Ala 


Asp 


Phe 


Glu 


Phe 


Gly 


Gin 


Phe 


Leu 


Gly 


Arg 


Phe 


Ala 


His 


Gly 



195 200 205 



Leu Asn His 

210 

(2) INFORMATION FOR SEQ. ID WO: 510 

U) SEQUENCE CHARACTERISTICS : 

(A) LE11GTH: "?81 amino acidr 
[3( TYPE: amino acid 
( D> TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(ill) HYPOTHETICAL: YES 

tvi) ORIGINAL SOURCE : 

(A> ORGANISM: Porphyromona; gingival as 

{i;:l FEATURE: 

(A) HAJ IE/KEY: raise feature 
{ B ) LOCATION* 1...7 81 



(xl) SEOUR11CE, DESCRIPTION: SEQ ID NO:510 



Het 


Asp 


Arg 


Pro 


Lys 


Pro 


Ser 


Tyr 


He 


Val 


Arg 


He 


Ala 


Ala 


He 


Leu 


1 








5 










10 










15 




Cys 


Leu 


Phe 


Val 


Gly 


Arg 


Pro 


Leu 


Phe 


Ala 


Gin 


Ser 


Tyr 


Val 


Asp 


Tyr 








20 










25 










30 




Vai 


Asp 


Pro 


Leu 


He 


Gly 


Thr 


Leu 


Ser 


Ser 


Phe 


Glu 


Leu 


Ser 


Ala 


ciy 






35 










•10 
















Asn 


Thr 


Tyr 


Pro 


Val 


He 


Gly 


Leu 


Pro 


Trp 


Gly 


Met 


Asn 


Ser 


Trp 


Thr 




50 










55 










60 










Pro 


Met 


Thr 


Gly 


Val 


Pro 


Gly 


Asp 


Gly 


Trp 


Gin 


Tyr 


Thr 


Tyr 


Ser 


Ala 


65 










70 










75 








80 


His 


Lys 


He 


Arg 


Gly 


Phe 


Lys 


Gin 


Thr 


His 


Gin 


Pro 


Ser 


Pro 


Trp 


lie 










85 










90 










95 




Asn 


Asp 


Tyr 


Gly 


Gin 


Phe 


Ser 


Leu 


Leu 


Pro 


Leu 


Thr 


Ala 


Pro 


Gin 


Lys 








100 










105 










110 




Pro 


Ser 


Ser 


Asn 


Asp 


Ser 


He 


Ala 


Leu 


Thr 


Lys 


Trp 


Cys 


Lys 


Gin 


Leu 






115 










120 










125 








Phe 


Sei 


Asp 


Glu 


Gin 


Thr 


Ser 


Trp 


Phe 


Ser 


m s 


J.ys 


Al a 


Glu 


Thr 


Ala 




130 










125 










1 10 










Thr 


Tro 


Tyr 


Tyr 


Tyr 


Ser 


Val 


Tyr 


Leu 


Ala 


Asp 


Tyr 


Asp 


Thr 


Arg 


Val 


145 










150 










155 










160 


Giu 


Met 


Ala 


Pro 


Thr 


Glu 


Arg 


Ala 


Ala 


Tie 


Phe 


Arg 


He 


Arg 


Tyr 


Ser 










165 










170 








175 




Gly 


Asn 


Thr 


Glu 


Ser 


Gly 


Ser 


Gly 


Arg 


Trp 


Leu 


Arg 


Leu 


Asp 


Ala 


Ph^ 








3 B0 










185 










190 






Thr 


Gly 


Gly 


Ser 


Glu 


He 


Ser 


He 


Val 


Asp 


Pro 


His 


Thr 


Val 


Val 


Gly 






19b 










200 










205 






He 


Ser 


Arg 


Lys 


Asn 


Ser 


Gly 


Gly 


Val 


Pro 


Ala 


Asn 


Phe 


Ala 


Cys 


Tyr 




210 










215 










220 






Phe 


He 


Leu 


Gin 


Ser 


Asp 


Thr 


Pro 


Mat 


Ala 


Asp 


Val 


Leu 


Leu 


Glu 


Thr 


225 










230 










235 










240 


A»rp 


Thr 


Gly 


I*Y* 


Ser 


Asp 


Glu 


Gly 


Thr 


Arg 


Ala 


Trp 


Ala 


Ala 


Cys 


Arg 










2J5 










250 










255 




Phe 


Asp 


Ser 


Gin 


Glu 


Vol 


Thr 


Val 


Arg 


Val 


Ala 


Ser 


Ser 


Phe 


He 


Ser 








260 










265 










27 0 






Val 


Glu 


Gin 


Ala 


Glu 


Arg 


Asn 


Leu 


Ala 


Glu 


Val 


Lys 


Gly 


Gin 


Ser 


Phe 






275 










280 










285 








Asp 


Arg 


He 


Arg 


Leu 


Ala 


Gly 


Arg 


Glu 


Ala 


Trp 


Asn 


Lys 


Val 


Leu 


Gly 




2*0 










2 05 










300 






Arg 


He 


His 


Val 


Glu 


sly 


Gly 


Thr 


Lys 


Asp 


Glu 


Arg 


Thr 


Thr 


Phe 


Tyr 


305 










310 










315 










320 


Ser 


Ala 


Leu 


Tyr 


Arg 


Cys 


Leu 


Leu 


Phe 


Pro 


Arg 


Arg 


Phe 


Tyr 


Glu 


Glu 










325 










330 










335 




Asp 


Ala 


Ser 


Gly 


Asn 


Pho 


Val 


His 


Tyr 


Ser 


Pro 


Tyr 


Asn 


Gly Glu 


Val 



3-10 345 350 
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Leu Pro Gly Tyr Leu Tyr Thr A.*?p Thr G]y Phe Trp Asp Thr Phe Arg 

355 360 365 

Ala Leu Phe Pro Leu Leu Asn Leu Leu Tyr Pro Asp Glu Asn lie Lys 

370 375 360 

Tie Gin Glu Gly Leu Leu Asn Val Tyr Arg Glu Ser Gly Phe Phe Pro 
385 390 395 400 

Glu Trp Ala Ser Pro Gly His Arg Asp Cys Net lie Gly Asn Asn Ser 

405 <|io 415 

Ala Ser Val Leu Ala Asp Ala Tyr Leu Lys Gly Val Aro Val Glu Asp 

420 425 "* 430 

Thr Arg Thr Leu Met Asn Gly Leu Leu His Ala Thr Lys Ala Val His 

435 440 445 

Pro Ly.T lie Ser Ser Thr Gly Arg Lys Gly Trp Glu Trp Tyr Asn Ser 

450 455 460 

Leu Gly Tyr Val Pro Ala Asp Ala Gly He Asp Glu Ser Ala Ala Arg 
465 470 475 4 GO 

Thr T.cu Glu Tyr Al i Tyr Asn Asp Trp Cys He Leu Arg Leu Gly Arg 

485 490 495 

Thr Leu Gly Trp Asp Arg Ala Ala Leu Asp Thr Leu Ala His Arg Ser 

500 SOS 510 

Mot Asn Tyr Arg His T.eu Phe Asp Pro Glu Thr Lys Leu Met Arg Gly 

515 520 525 

Arg Asn Gin Asp Gly Ser Phe Arg Thr Pro Phe Ssi Pro Phe Lys Trp 

530 535 540 

Gly Asp Vsl Phe Thr Glu GJ y Asn Ala Trp His Tyr Thr Trp Ser Val 
S45 550 555 S60 

Phe His Asp Val Gin Gly Leu He Asp Leu Met Gly Gly Asp Arg Pro 

565 570 575 

Phe Val Ser Met l.eu Asp Ser Val Phe Asn Thr Pro Pro Met Phe Asp 

580 585 590 

Glu Ser Tyr Tyr Gly Phe Val He His Glu He Arg Glu Met Gin He 

595 600 605 

Ala Asp Met Gly Arn Tyr Ala His Gly Asn Gin Pro He Gin His Met 

610 615 620 

He Tyr Leu Tyr Aen His Ala Gly His Pro Trp Lys Ala Gin Glu Arg 
S25 630 635 640 

Leu Arg Glu Val Met Gly Arg Leu Tyr Arg Pro Thr Pro Asp Gly Tyr 

645 650 ess 

Cys Gly Asp Glu Asp Asn Gly Gin Thr Ser Ala Trp Tyr Val Phe Ser 

660 665 670 

Ala Leu Gly Phe Tyr Pro Val Thr Pro Ala Thr Asp Gin Tyr Val Leu 

675 680 685 

Gly Ser Pro He Phe Ser Lyc Val Tie Leu Ser Phe Pro Asp Gly His 

690 695 700 

Lys Thr Val Leu His Ala Pro Ala Asn Ser Ala Asp Thr Pro Tyr ll~ 
705 710 715 720 

Arg Ser He Ser Val Glu Gly Lys Glu Trp Ser Cys Asn Tyr Leu Thr 

7 25 730 735 

His Glu Gin L-u Arq Ser Ser Ala Ser He Gin Trp Met Met Asp Thr 

740 745 750 

Lys Pro Ain Tyr Asn Arg Gly Met Lys Glu Ser Asp Arg Pro Tyr Ser 

7 55 760 765 

Phe Ser Thr Glu Gin Gin Arg Arg Ala Asn His Ser Asn 
7 70 775 730 

(2) INFORMATION FOR SEQ ID HO: 511 

11} SCQUCHCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

til) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

lv1 } ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

tlx) FEATURE: 

(A) NAME/ KEY: ml sc_f ea t ur c- 
{&) LOCATION 1. . ,271 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 511 
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Met Met Lys Ser Met Arg Ser Val Leu l.cu T.ru i. =u Phe Pro I.cu S-r 

1 5 10 15 

Leu He Thr Ala Leu Gly Cys Ser Asn A^n Lys Ala Ala Glu S-r Lvs 

20 25 30 

Ser Val Ser Phe Asp Ser Ala Tyr Leu Glu Arg Tyr Tie Pro L-u Arq 

35 40 45 

Ala Asp lie Asp Thr Pro Ser Leu His Val Met He Ser Tyr Val Tvr 

SO S5 60 

Pro Ser Gly Asp Asp Met Leu Thr Glu He Phe Asn Cly Leu Leu Phe 
65 7 0 75 80 

Gly Asp Ser Leu Met Asp Ser Ser Ser Pro Glu Asn Ala Met Glu Gly 

85 90 95 

Tyr Ala Gin Met Leu Gly Glu Asp Tyr Arg Ser Asn Asn Ala Glu Ala 

100 105 no 

Asn Leu Gin Gly Leu Pro Ser Asp Leu Leu Asp Tyr He Tyr Lys Gin 

US 120 125 

Glu Asn Thr He Ala Tyr Cys Asp Thr Gly Lciti Tic Ser Thr Arq Tin 

130 135 140 

Asn Thr Tyr Thr Tyr Glu Glj Gly Ala His: Thr Glu Asn Thr Val Arq 
145 150 155 iso 

Phe Ala Asn He T.eu Arg Thr Thr Gly Lys Val Leu Glu Glu Arg Asp 

165 170 175 

He Phe Lys He Asp Tyr Ala Glu Arg Leu Ser Ala Leu He lie Glv 

1B0 105 190 

Glu Leu Val His Asp Phe Gly T.yn Thr Thr Pro Ala Glu Leu A?p A la 

195 200 205 

He Gly Phe Phe Asn Ala Glu Glu He Gin Pro Asn Gly Asn Phe Met 

?10 215 220 

He Asp Asp Lys Gly Leu Thr Tyr Cys Phe Asn Glu Tyr Gin He Ala 
225 230 235 240 

Ala Tyr Ala Arg Gly Ala Val Tyr Val Arg Leu Gly Tyr Asp Val Leu 

245 250 255 

Ala Pro Leu Leu Arg Asp Asp Ser Pro Leu Lys Arq Tyr Leu Pro 
260 265 270 

(2) INFORMATION FOR SEQ ID HO: 512 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 
(B } TYPE: amino acid 
<D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

tiii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyronwnas gingival is 

fix) FEATURE: 

(A) NAME/ KEY: mi sc_ feature 
{B> LOCATION 1...27 0 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 512 



Met 


Lys 


Ser 


Met 


Arg 


Ser 


Val 


Leu 


Leu 


Leu 


Leu 


Phe 


Pro 


Leu 


Ser 


Leu 


Thr 






5 










10 










15 




He 


Ala 


Leu 
20 


Gly 


Cys 


Ser 


Asn 


Asn 
25 


Lys 


Ala 


Ala 


Glu 


Ser 
30 


Lys 


Ser 


Val 


Ser 


Phe 

35 


Asp 


Ser 


Ala 


Tyr 


Leu 
40 


Glu 


Arg 


Tyr 


Tie 


Pro 
45 


T.eu 


Arg 


Ala 


Asp 


He 


Asp 


Thr 


Pro 


Ser 


Leu 
55 


His 


Val 


Met 


He 


Ser 
60 


Tyr 


Val 


Tyr 


Pro 


Ser 
65 


Gly 


Asp 


Asp 


Met 


Leu 
70 


Thr 


Glu 


He 


Phe 


Asn 
75 


Gly 


Leu 


I.eu 


Phe 


Gly 
an 


Asp 


Ser 


Leu 


Ilet 


Asp 
85 


Ser 


Ser 


Ser 


Pro 


Glu 
90 


Asn 


Ala 


Met 


Glu 


Gly Tyr 


Ala 


Gin 


Met 


Leu 


Gly Glu 


Asp 


Tyr 


Arg 


Ser 


Asn 


Asn 


Ala 


Glu 


Ala 


Asn 








100 










105 










110 




Leu 


Gin 


Gly 


Leu 


Pro 


Ser 


Asp 


Leu 


Leu 


Asp 


Tyi 


lie 


Tyr 


Lys 


Gin 


Glu 






\ 15 










120 










125 






Asn 


Thr 


He 


Ala 


Tyr 


Cys 


Asp 


Thr 


Gly 


Leu 


He 


Ser 


Thr 


Arg 


He 


Asn 




130 










135 










140 






Thr 
145 


Tyr 


Thr 


Tyr 


Glu 


Gly 
IbO 


Gly 


Ala 


His 


Thr 


Glu 
155 


A3T\ 


Thr 


Val 


Arg 


Phe 
160 
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Ala 


Asn 


lie 


Leu 


Arg 


Thr 


Thr 


Gly 


Lys 


Val 


Leu 


Glu 


Glu 


Arg 


Asp 


11* 










165 










170 










175 




Phc 


Lys 


Tie 


Asp 


Tyr 


Ala 


Glu 


*rg 


Leu 


Ser 


Ala 


Leu 


lie 


lie 


Gly 


Gin 








180 










1B5 










190 






Leu 


Val 


His 


Asp 


Phe 


Gly 


Lye 


Thr 


Thr 


Pro 


Ala 


Glu 


Leu 


Asp 


Alo 


lie 






195 










ZOO 


















Gly 


Phe 


Phe 


Am 


Ala 


Glu 


Glu 


lie 


Gin 


Pro 


A^n 


Gly 


Asn 


Phe 


tlet 


He 




210 










215 










220 










Asp 


Asp 


Lys 


Gly 


Leu 


The 


Tyr 


Cys 


Phe 


Asr, 


Glu 


Tyr 


Gin 


lie 


Ala 


Ala 


225 










230 










235 










240 


Tyr 


Ala 


Arg 


Gly 


Ala 


Val 


Tyr 


Vol 


Arg 


Leu 


Gly 


Tyr 


A^p 


Val 


Leu 


Ala 










245 










250 








255 




Pro 


Leu 


Leu 


Arg 


Asp 


Arp 




Pro 


Leu 


Lys 


Arg 


Tyr 


Leu 


Pro 












260 










265 










210 






(2] 


1H FORMATION 


FOR 


SEQ 


ID NO:513 

















111 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 
( 0 ) TYPE: amino acid 

UM TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

HYPOTHETICAL: YES 

[VI ) ORIGINAL SOURCE: 

I A) ORGANISM: Porphyromonaj gingival Is 

(1>:) FEATURE : 

(AJ NAME/ KEY: mlsc_feature 

(B) LOCATION 1. - .267 



(Xi) SEQUENCE DESCRIPTION: SEP ID NO: 01 J 



Met 


Arg 


Ser 


Val 


Leu 


Leu 


Leu 


Leu 


Phe 


Pro 


Leu 


Ser 


Leu 


He 


Thr 


All 


1 








5 










10 










15 




Leu 


Gly uys 


Ser 


Asn 


Asn 


Lys 


Ala 


Ala 


Glu 


Ser 


Lys 


Ser 


Val 


Ser 


Phe 








20 










25 










30 






Asp 


Ser 


Ala 


Tyr 


Leu 


Glu 


Arg 


Tyr 


He 


Pro 


Leu 


Arg 


Ala 


Asp 


lie 


Asp 






35 










40 










45 








Thr 


Pro 


Ser 


Leu 


His 


Val 


Met 


He 


Ser 


Tyr 


V„l 


Tyr 


Pro 


Ser 


Gly 


Asp 




50 




















60 






Asp 


Met 


Leu 


Thr 


Glu 


He 


Phe 


Asn 


Gly 


Leu 


Leu 


Phe 


Gly 


Asp 


Ser 


Leu 


65 










70 










75 










80 


M-t 


A.-p 


Ser 


Ser 


Ser 


Pro 


Glu 


Asn 


Ala 


Met 


Glu 


Gly 


Tyr 


Ala 


Gin 


Met 










05 










90 










95 




Leu 


Gly 


Glu 


Asp 


Tyr 


Arg 


Ser 


Asn 


Asn 


Ala 


Glu 


Ala 


Afn 


Leu 


Gin 


Civ- 








100 










105 










110 






Leu 


Pro 


Ser 


Asp 


Leu 


Leu 


Asp 


Tyr 


He 


Tyr 


Lys 


Gin 


Glu 


Asn 


Thr 


ile 






115 










120 










12S 








Ala 


Tyr 


Cys 


Arp 


Thr 


Gly 


Leu 


Tie 


Ser 


Thr 


Arg 


He 


Asn 


Thr 


Tyr 


Thr 




130 










135 










140 










Tyr 


Glu 


Gly 


Gly 


Ala 


His 


Thr 


Glu 


Asn 


Thr 


Val 


Arg 


Phe 


Ala 


Asn 


He 


145 










150 










155 








160 


Leu 


Arg 


Thr 


Thr 


Gly 


Lys 


Val 


Leu 


Glu 


Glu 


Arg 


Asp 


lie 


Phe 


Lys 


He 










165 










170 










175 




Asp 


Tyr 


Ala 


Glu 


Arg 


Leu 


Ser 


Ala 


Leu 


lis 


He 


Gly 


Gin 


Leu 


Val 


His 








180 










18b 










190 






Asp 


Phs 


Gly 


T.ys 


Thr 


Thr 


Pro 


Ala 


Glu 


Leu 


Asp 


Ala 


lie 


Gly 


Phe 


Phe 






195 










200 










205 






Asn 


Ala 


Glu 


Glu 


He 


Gin 


Pro 


Asn 


Gly 


Asn 


Pne 


Met 


Tie 


Asp 


Asp 


Lys 




210 










21S 










220 




Gly 


Leu 


Thr 


Tyr 


Cys 


Phe 


Asn 


Glu 


Tyr 


Gin 


He 


Ala 


Ala 


Tyr 


Ala 


Arg 


225 










230 










235 










2-10 


Gly 


Ala 


Val 


Tyr 


Val 


Arg 


Leu 


Gly 


Tyr 


Asp 


Val 


Leu 


Ala 


Pro 


Leu 


Leu 










245 










250 










255 




Arg 


Asp 


Asp 


Ser 


Pro 


Leu 


Lys 


Arg 


Tyc 


Leu 


Pro 













260 265 
(2) INFORMATION FOR SEO ID HO: 51 4 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 
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(D) TorOLOGY: linear 

f ii ) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Poiphyromonas gingivalxs 

Ux) FEATURE: 

(A) NAME/ KEY: ml sc_f cat ure 

(B) LOCATION 1...25 9 

(xi ) SEQUENCE DESCRI PTIOII : SEQ ID NO: 514 

Met Lye Lys Thr Thr Leu Thr Gly Ser lie Cys Ala Leu Leu Leu Phe 

15 10 is 

Leu Gly Leu Ser Ala Asn Ala Gin Scr Lys Leu Lyc lie Lys S-r II- 

20 25 30 

Glu Ma Ala Thr Thr Phe Ser Ser Ala Thr Ala Gly Asn Gly Phe Gl- 

35 4 0 45 

Gly Asn He Phe Gly Met A.-p Met Ser He Arg Met Arg Val Hi- Hi - 

50 5£ 6r , 

Ser He Leu Pro Glu Gly Leu Aep Phe Ser Val Gly He His Glu An 
65 70 75 

Arg Ala His Trp Glu Glu Ala Gly Ser Pro Lys Leu Met Tyr Thr Asn 



85 on 



95 



Val Pro Ser He He Gly He Val Glu Lye Val He Val Pho Glu Ai>p 

100 105 no 

Ala Glu Asp Phe Phe Asp Lys Lys Ala Leu Gly Arg Phe Leu lie Ser 

115 120 i2s 

Leu Gly He Ser Tyr Thr Lys His Leu Gly Ala Tyr Trp Gly Trp Thr 

130 135 i<io 

Asn Asp Al* His He Leu Phe Ser Pro He Pro Lys Ser Lys Val His 
14S 150 155 igo 

Tyr Asp Thr Tyr Thr Arg Ala Gly Ser Asp Leu Val Leu Gin Ser Glu 

165 no 175 

Asp Val Ala Thr Val Ser Asn Gly Phe Ser fro Gly He Gly Leu Lys 

180 185 igo 

Ser Ser He Trp Trp Lys Met Pro He Lys Ser Lye Tyr Asp Phe Arg 

195 200 205 

Leu Gly Phe Ser Leu Gly Tyr Glu Tyr Leu Asn Leu Leu Tyr Pro Tvr 

210 215 220 

Arg Asn Phe Lys Leu Asp Gly Asn Lys Pro L=u Ser Ala Leu Ser Pro 
225 230 235 240 

Arg .Met Asn His Tic Gly His Val Gly Phe Asn Phe Thr Val Gly Leu 
2 45 250 255 

Trp Thr Asn 

(2) INFORMATION FOR SEQ ID NO: 515 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1266 amino acids 
IB) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: protein 

[iii> HYPOTHETICAL: YES 

(vl> ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonai yj.ngj.vslis 

(ix> FEATURE: 

(A) NAME/KEY: miscfeature 
<B) LOCATION 1...1266 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515 

Met G3y Lys Tyr Lys Arg Ala Lys Tyr Arg Tyr Trp Leu Phe Pro Phe 

5 10 is 

Cys Set Asp Tyr Tyr Thr Phe Glu Gly Val Thr Phe Leu Cys Ala S-r 

20 25 30 

Asp Asp Met Thr Thr Lys L~ s Pro Gin Al a He l.au A* P Leu Glu Ly- 
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35 40 45 



Al a 


Tyr 




ri e 


Glu 


He 




Asp 


Leu 


Ser 


Ser 




Glu 


Gly 


He 


Ser 




50 










55 
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Val Gly Gly Asn Pro Asp Gly Lys Asp Thr Pro Gin Thr Thr Asp Asp 

645 650 655 

Thr 11- lie 11- Gin Thr His Ala Asp Glu Thr Gly Ala Lys Gin Gin 

660 665 670 

Thr Leu Gly Cys Ala Ala Glu Asn Gly Val L«u Glu Glu I)« Tyr Val 

6/5 680 $85 

Ser Leu Glu Pro Lys Ala Asn Ser Ala Val His Ala Leu Asn Tyr Leu 

6?0 695 70D 

Asn Glu Arg Val Arg Glu Val Val Ala fi-r Arg Scr Lys Srr XI- Gin 
7 °5 7 10 715 720 

He Thr Glu Lys Asp Lys Gly Leu Tyr Glu Ala Leu Pro Thr Il«* Ala 

725 730 73S 

Gly Asp Asn Lys His He Pro Ho Ser Leu Glu A) n Lou Ala Ala Gin 

740 745 750 

Leu Asn Lys Gly Arg Ala Glu Asn Asp Leu Tyr Thr He Glu Tyr L-u 

755 760 765 

Gin Thr Glu Leu Asn Gin Leu Ser Leu Arg Gly Glu Val Leu Tyr T"r 

770 775 780 

Arg Glu Asn Glu Lys Leu Asn Asn Tyr Val Trp Leu Asp Pro Ala Ala 
785 7 90 79S 00 0 

Pho Val Gin Met He Hip Gly Glu He Leu Gin Lys Asp Asn He Asn 

805 bio B1S 

Arg Gly Thr Val Pro Lys Asp He Phe Glu Cys Lys Leu His Asn L-u 

825 830 
S~r Ser Gly S-r He Phe Glu Glu Asp Gly Gin Asn Gly Asn Met n- 

835 840 845 

Leu Gin Leu Leu Leu Glu Glu Leu He Val Tyr Glu Asp Lvs Asd Cv* 

850 855 860 

Tyr val 71- Pro Gly Tyr Leu Pro Leu His Ser Asp Asp Glu Ala Tyr 
870 875 8 flO 

Lys Trp Leu Thr Leu Gly Phe Glu Arg Pro Asn Phe Val Leu Lys Phc 

00S 890 895 

Glu Arg Phe He Pro Phe Gly Leu He Asn Gin He He Ala Tyr Tyr 

900 90S gin 

Gly Arg Glu Glu Gly Ala Leu Lys Arg Tyr Trp Arg Asp Gin Val Il« 

915 920 925 

Phe Thr Ala Gly Arg Glu Met Asp Arg Gin Thr Leu Glu Gin Glu Glu 

930 935 940 

Glu Lys Glu Gly Leu Pro Lys Thr Asn Ala Glu Asp Tyr Gin He Trp 
94S 9SO 955 96 Q 

He Lys Leu A.-p Phe Thr Asp Leu Ala He Ser Val Phe He Lys Glu 

965 970 075 

Gin Arg Lys Thr Sex Ala Lys Asp Met Gin Arg Lys Glu Ala Thr He 

900 985 990 

Leu Ser Asp Met Leu Asp Met Tyr Trp Asn Asn He Pro Pro Arg Glu 

1000 1005 
Gin He Gly Asp Lys Asp Thr Glu Gin Thr Axg Ser Thr He Arg Glu 

1010 1015 1020 

Thr Asn Arg l.yr Ly* Ar<] Pro He Gin Asp Leu Tyr Leu S~r Cys Ala 
1025 1030 1035 10^0 

Gin Ala Asp Lys Asp Leu Thr Glu Ser His Tyr He His Leu Gly Thr 

1045 1050 1055 

Leu Asp Asp Glu Ser Lys Thr Thr Ala Arg He Ala Ala Tyr Pro Leu 

1060 1065 1070 

Lys Asn Gly Val He Ar>p Lys Glu Arg Val Arq Glu Val Ser Thr Arq 

1075 1080 1085 

Pro Tyr Lys H1e Leu Ser Val Asn Lys Asn Leu Ala Thr Ala Lys Gin 

1090 1095 HOO 

Tl- Fhe He Ser Tyr Ser Lys Glu Asp Gin Thr Glu Leu Glu Thr Cys 
1105 mo 1115 y U 20 

Leu Gin Phe Phe Lys Pro Leu Glu Lye Asn Gly Gin He Glu Tie Tyr 

H25 H30 U35 

Tyr Asp Lys Leu Thr Ly.- Phe Glu Thr Pro He His Pro Glu He Arg 

1140 1145 iiso 

Lys Arg He Val Glu Ala Asp Cys He H a Ala Leu He Ser Gin Arq 

11^5 H60 H65 

Tyr Leu Ala Thr Asp Tyr He Leu Asp His Glu Leu Pro Val Ph~ Ara 

H 7 0 1175 neo 

Glu Tyr A^n Lys Thr He Val Pro He Leu He Lys. Pro Cys Thr Phc 
118S iigo 119b 12Q0 

blu A^P Glu Leu Arg Glu Lys Tyr Phe Ala Gin Lys Ala Gin 

1205 1210 1215 

He He Asn Leu Gly Lys Glu Gly Lyc Thr He Lys Ala Tyr Asp S=r 

1220 1225 1230 

He Thr Ala Ser Ala His Arg Asp Glu Asn Trp Val Ala Val Val Arn 
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10 



15 



20 



25 



30 



■ j.^iu 1245 

Glu Fhe Lys Glu Lys He Leu Arg He Thr Lys Gin Glu Val Asn Thr 

12S0 12S5 1260 

Asp Glu 
1265 

(2) IMFOkllATlOII FOR SEQ ID NO: 51 6 

(i) SEQUENCE CHARACTERISTICS: 

I A) LENGTH: 1232 amino acids 
tB) TYfE: amino acid 
(D) TOPOLOGY: linear 

(il» MOLECULE TYPE: protein 

Uii» HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Parphyromonds gingivalis- 

( 1 >: ) FEATURE : 

(A) NAME/ KEY: mtsc_f eature 
{B> LOCATION I... 1232 

(xJ ) SEQUENCE DCSCRIFTIOIf: SEQ ID NO: 516 

Net Tnr Thr Lys Lys Pro Gin Ala II o Leu Asp Leu Glu Lye Ala Tyr 
1 b 10 15 

Asn He Glu He Pro Asp Leu Ser Ser Gin Glu Gly He Ser Trp S-r 

20 25 3n 

Val Asn Arg Tyr Phe Lys Gin Asp Scr Ser Gly Ala Val Val Glu Leu 

3S 40 4S 

Cys Leu Arg Glu Cys Gin He Glu Ser Met Thr Trp Leu lie Asp Phe 

50 55 so 

Pro Ala Leu Lys Lys Leu Asp Leu Ser Tyr Asn Gin He Ser Lys Leu 
65 70 75 80 

Glu Gly Leu Glu Arg Leu Thr Ser Leu Thr Lys Leu Arg Leu Arg Ser 

85 90 as 

Asn Gin He Arg Lys Leu Glu Gly Leu Asp Ser Leu Thr Ser Leu Thr 

100 105 no 

Lys Leu Ser Leu Ser Asp Asn Gin He Ser Lys Leu Glu Gly Leu Glu 

115 120 125 

Arg Leu Thr Scr Leu Ala Glu Leu Tyr Leu Leu Asp Asn Gin He S~r 
_ T iJ0 135 140 

Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Thr Leu Glu Leu 
I 45 . 150 155 160 

Ser ^ly Asn Gin He Arg Lyc Leu Glu Gly Leu Glu Arg Leu Thr Ser 

165 170 175 

Leu Ala Thr Leu Glu Leu Ser Gly Asn Gin He Arg Lys Leu Glu Gly 
OU 180 X85 190 

Leu Glu Axg Leu Thr Ser Leu Thr Lys I.eu Arg Leu Arg Ser Asn Gin 

195 20O 205 

He Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Thr Leu 

210 215 220 

Glu Leu Ser Gly Asn Gin He Arg Lys Leu Glu Gly Leu Clu Arg L-u 
i? 5 230 ?35 ?4 0 

Thr Ser Leu Ala Thr Leu Glu Leu Ser Gly Asn Gin He Ser Lys Leu 

245 250 255 

Glu Gly Leu Glu Arg Leu Ser Ser Leu Thr Lys Leu Arg Leu Ara S~r 

260 265 270 

Asn Gin II- Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr S~r L°u Thr 

275 280 285 

Lys Leu S~r Leu Ser Asp Asn Gin He Ser Lys Leu Glu Gly Leu Glu 

2?0 295 300 

Arg Leu Thr Ser Leu Ala Glu Leu Tyr Leu Leu Asp Asn Gin He Arg 
r f . 310 315 320 

Lys Leu Gin Gly Leu Glu Arg Leu Thr Ser Leu Thr Lys Leu Arg Leu 

325 330 335 

Arg Ser Asn Gin He Ser Lys Leu Glu Gly Leu Asp Ser Leu Thr Ser 

— 310 345 350 

Leu Ihr Lys Leu Ser Leu Ser Asp Asn Gin He Ser Lys Leu Glu Gly 

355 360 365 

Leu Glu Arg Lou Thr Ser Leu Ala Glu Leu Tyr Leu Leu Asp Asn Gin 
3/0 375 3fiQ 

I-» Arg Lys Leu Glu Gly Leu Asp Gly Leu Als Ser Leu Thr Arg T.eu 



40 



45 



55 



60 



65 



70 



75 
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385 390 395 400 

Ser Leu Arg Arg Asn Gin lie Ser Lys Leu Glu Gly Leu Asp Arg Leu 

4 05 410 415 

Lys Vol Leu Arg Lye Leu Asp Val Ser Gly Asn A^p n« Gin b'er 11- 

420 425 430 

Asp Asp lie Lys Leu Leu Ala Pro lie Leu Glu Gin Thr Leu Glu Lys 

435 440 445 

Leu Arg lie His Asp Asn Pro Phe Val Ala Ser Ser Gly Leu lie Leu 

450 455 460 

Ser Pro Tyr Asp Asn His Leu Pro Glu lie Lys Ala Leu Leu Glu Lys 
4 65 47 0 47b 4 80 

Glu Lys Glu Lys Gin Lys Lys Thr Ser Val Glu Tyr Ills Pro Phe Cys 

485 490 495 

Lys Val Met Leu Leu Gly Asn His Ser Ser Gly Lys Thr Thr Pha Leu 

500 505 510 

Ser Gin Tyr Asp Thr Asn Tyr Thr Tyr Gin Lys Asn Thr His Val Leu 

515 520 525 

Ser lie His Arg Ser Asn Asn Pro Asn Ala lie Phe Tyr Asp Phe Gly 

530 535 5,10 

Gly Gin Asp Tyr Tyr His Gly lie Tyr Gin Ala Phe Phe Thr Thr Gin 
545 550 555 560 

Ser L%u Tyr Leu Leu Phe Trp Asp Ala I.ys T.ys Asp Arg A.^n Fhe Val 

565 570 575 

Ser Val Asp Asp Lys Glu Tyr Gin Thr Leu Asn Phe Asn Arg Pro Tyr 

580 565 590 

Trp Leu Gly Gin lie Aid Tyr Ala Cys Asn Arg Cys I let Ser Val Gly 

595 600 60S 

Gly Asn Pre Asp Gly Lys Asp Thr Pro Gin Thr Thr Asp Asp Thr II- 

610 615 620 

He He Gin Thr His Ala Asp Glu Thr Gly Ala Lys Gin Gin Thr Leu 
625 630 635 640 

Gly Cys Ala Ala Glu Asn Gly Val Leu Glu Glu He Tyr Val Ser Leu 

645 650 655 

Glu Pro Lys Ala Asn Ser Aid Val His Ala Leu Asn Tyr Leu Asn Glu 

660 665 670 

Arg Val Arg Glu Val Val Ala Ser Arg Ser Lys Ssr He Gin He Thr 

675 680 685 

Glu Lys Asp Lys Gly Leu Tyr Glu Ala Leu Pro Thr He Ala Gly A^p 

690 695 700 

Asn Lyft His He Pro He Ser Leu Glu Ala Leu Ala Ala Gin Leu Asn 
705 710 715 720 

Lys Gly Arg Ala Glu Asn Asp Leu Tyr Thr Tie Glu Tyr Leu Gin Thr 

725 730 735 

Glu Leu Asn Gin Leu Ser Leu Arg Gly Glu Val Leu Tyr Tyr Arg Glu 

740 74S 750 

Asn Glu Lys Leu Asn Atn Tyr Val Trp Leu Asp Pro Ala Ala Phe Val 

755 760 765 

Gin Met He His Gly Glu He Leu Gin Lys Asp Asn He Asn Arq Glv 

770 775 730 

Thr Val Pro Lys Asp lis Phe Glu Cys Lys Leu His Asn Leu Ser 3-r 
785 790 795 e00 

Gly Ser He the Glu Glu Asp Gly Gin Asn Gly Asn Met He Leu Gin 

005 910 815 

Leu Leu Leu Glu Glu Leu He Val Tyr Glu Asp Lys Asp Cys Tyr Val 

820 825 830 

He Pro Gly Tyr Leu Pro Leu His Ser Asp Asp Glu Ala Tyr Lys Trp 

835 840 845 

Leu Thr Leu Gly Phe Glu Arg Pro Asn Phe Val Leu Lys Ph» Glu Arg 

850 855 8 60 

Fhe He Pro Phe Gly Leu He Asn Gin He lie Ala Tyr Tyr Gly Ara 
865 870 875 880 

Glu Glu Gly A3 z L»n Lys Arg Tyr Trp Arg Asp Gin Val Ha Phe Thr 

985 flqo 895 

Ala Gly Arg Glu llet Asp Arg Gin Thr Leu Glu Gin Glu Glu Glu Ly? 

900 905 910 

Glu Gly Leu Pro Lys Thr Asn Ala Glu Asp Tyr Gin He Trp He Lys 

915 920 925 

Leu Asp Phe Thr Asp Leu Ala He Ser Val Phe He Lys Glu Gin Ara 

930 935 940 

Lye Thr Sor Ala Lys Asp Met Gin Arq Lys Glu Ala' Thr He Leu Ser 
945 950 955 9&O 

Asp llet Leu Afp Met Tyr Trp Asn Asn He Pro Pro Arg Glu Gin He 

965 970 975 

Gly Asp Lys Asp Thr Glu Gin Thr Arg Ser Thr Tie Arg Glu Thr Asn 
980 985 Q90 
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Arg Lys L^s Arg Pro He Gin Asp Lau Tyr Leu Sex Cys Al<i Gin Ala 

?95 1000 1005 

P&p l.ys Af P I.en Thr Glu Ser His Tyr He Mis Leu Gly Thr L~u Asp 

1010 1015 1020 

Asp Glu Ser Lys Thr Thr Ala Arg lie Ala Ala Tyr Pro Leu Lys Asn 
1025 1030 10 35 1040 

Gly Val Tie Asp r.ya? Glu Arg Val Arg Glu Val Ser Thr Arg Pro Tyr 

1045 1050 1055 

Lys His Leu Ser Val Asa Lys Asn Leu Ala Thr Ala Lys Gin lie Phe 

1060 1065 1070 

He Ser Tyr Ser Lys Glu Asp Gin Thr Glu Leu Glu Thr Cys Leu Gin 

1075 10SO 1085 

Phe Phe Lys Pro Leu Glu Lys Asn GJ y Gin Tie Glu He Tyr Tyr Asp 

1090 1095 HOO 

Lys Leu Thr Lys Phe Glu Thr Pro He His Pro Glu He Arq Lvs Ara 

H05 mo lils g y g 112Q 

He Val Glu Ala Asp Cys He Tic Aln Leu Tie Ser Gin Arg Tyr Lou 

1125 H30 H35 

Ala Thr Asp Tyr He Leu Asp His Glu Leu Fro Val Phe Arg Glu Tyx 

11-10 H45 11S0 

Asn Lis Thr He Val Pro He Leu Ho Lys Pro Cye Thr Phe Glu Asp 

1X55 1160 H65 

Asp Glu Phe Leu Arg Glu Lys Tyr Phe Ala Gin Lys Ala Gin II- II ~ 

1175 H80 
Asn Leu Gly Lys Glu Gly Lys Thr II- Lvf Air. Tyr Asp Scr Ho Thr 
1185 H90 ^ H95 1200 

Ala Ser Ala His Arg Asp Glu Asn Trp Val Ala Val Val Arg Glu Phe 

1205 1210 1215 

Lys Glu Lyc He Leu Arg Tie Thr Lys Gin Glu Val A.-=n Thr Asp Glu 
1220 1225 1230 

(2> INFORMATION FOR SEQ ID NO: 517 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1175 amino acids 
<P) TYPE: amino acid 
<D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival Is 

Uk) FEATURE: 

(A) NAME/ KEY: miec_featurc 
<B> LOCATION 1. . .1175 

Ui) SEQUENCE DESCRI PTTON : SEQ ID NO: 51.7 
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Ser 
65 
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55 
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Leu 


Glu 


Gly 


Leu 
70 


Glu 


Arg 


Leu 


Thr 


Ser 
75 


Leu 


Ala 


Glu 


Leu 
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30 


Leu 


Leu 


Asp 
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Ser 
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Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 
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Leu 
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Leu 
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Leu 
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Asn 


Gin 


He 


Arg 
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Leu 
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Gly Asn Gin He Ser Lys I.cu Glu Gly Leu Glu Arg Leu Ser Ser Leu 

195 200 205 

Thr Lys Leu Arg Leu Arg Ser Asn Gin He Ser Lys Leu Glu Gly Leu 

210 215 220 

Glu Arg Leu Thr Ser Leu Thr Lys T.cu Ser Lou Ser Asp A^n Gin II o 
225 230 235 240 

Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr 

245 250 255 

I.eu Leu Aep Afn Gin Ho Arg Lys Leu Glu Gly Leu Glu Arg Leu Thr 

260 265 270 

Ser Leu Thr Lys Leu Arg Leu Arg Ser Aim Gin He Ser Lys Leu Glu 

275 280 285 

Gly J.cmj Affp Ser Leu Thr Ser Leu Thr Lys Leu Ser Leu Ser Asp Asn 

220 295 3O0 

Gin lie Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu 
305 310 315 320 

Leu Tyr L°u Leu Asp Asn Gin He Arg Lys Leu Glu Gly Leu Asp Gly 

325 330 335 

Leu Ala Ser Leu Thr Arg Leu Ser Leu Arg Arg Asn Gin He Ser Lys 

340 345 350 

Leu Glu Gly Leu Asp Arg Leu Lys Val Leu Arg Ly? Leu Asp Val Ser 

355 360 365 

Gly Asn Asp He Gin Ser He Asp Asp He Lys Leu Leu Ala Pro He 

370 375 380 

I.ou Glu nln Thr Leu Glu Ly3 Leu Arq lie Mis Asp Asn Pro Phe Val 
385 390 395 400 

Ala Ser Ser Gly Leu He L^u Ser Pro T^r Asp Asn His Leu Pro Glu 

4 05 410 415 

He Lys Ala Leu Leu Glu Lys Glu Lys Glu Lys Gin Lys Lys Thr Ser 

420 425 430 

Val Glu Tyr His Pro Phe Cys Lys Val Met Leu Leu Gly Aen His Ser 

435 440 445 

Ser Gly Lys Thr Thr Phe Leu Ser Gin Tyr Asp Thr Asn Tyr Thr Tyr 

450 455 460 

Gin Lys Asn Thr His Val Leu Ser lie His Arg Ser Asn Asn Pro Asn 
465 470 475 480 

Ala lis Phe Tyr Asp Phe Gly Gly Gin Asp Tyr Tyr His Gly He Tyr 

485 490 495 

Gin Ala Phe Phe Thr Thr Gin Ser Leu Tyr Leu Leu Phe Tip Asp Ala 

500 505 510 

Lys Lys Asp Arg Asn Phe Val Ser Val Asp Asp Lys Glu Tyr Gin Thr 

515 520 525 

Leu Asn Phe Asn Arg Pro Tyr Trp Leu Gly Gin lie Ala Tyr Ala Cys 

530 535 54 0 

Asn Arg Cys Met Ser Val Gly Gly Asn Pro Asp Gly Lys Asp Thr Pro 
54 5 550 555 560 

Gin Thr Thr Asp Asp Thr He He He Gin Thr His Ala Asp Glu Thr 

565 57C 575 

Gly Ala Lys Ciln Gin Thr Leu Gly Cys Ala Ala Glu Asn Gly Val Leu 

580 - 585 590 

Glu Glu He Tyr Val Ser Lou Glu Pro Lys Ala Asn Ser Ala Val His 

595 600 605 

Ala Leu Asn Tyr Leu Asn Glu Arg Val Arg Glu Val Val Ala Ser Arg 

610 615 620 

Ser Lys Ser He Gin He Thr Glu Lys Asp Lys Gly Leu Tyr Glu Ala 
G25 630 635 64 0 

Leu Pro Thr He Ala Gly Asp Asn Lys His He Pro He Ser Leu Glu 

645 650 655 

Ala Leu Ala Ala Gin Leu Am Lys Gly Arg Ala Glu Asn Asp Leu Tyr 

660 665 " 670 

Thr He Glu Tyr Leu Gin Thr Glu Leu Asn Gin Leu Ser Leu Arg Gly 

675 600 6H5 

Glu Val Leu Tyr Tyr Arg Glu Asn Glu Lys Leu Asn Asn Tyr Val Trp 

690 695 700 

Leu Asp Pro Ala Ala Phe Val Gin Met He His Gly Glu He Leu Gin 
710 715 720 

Lys Asp Asn He Asn Arg Gly Thr Val Pro Lys Asp He Phe Glu Cys 

725 730 735 

Lys Leu His Asn Leu Ser Ser Gly Ser He Phe Glu Glu Asp Gly Gin 

740 745 ** 750 

Asn Gly Asn Ket He Leu Gin Leu Leu Leu Glu Glu Leu He Val Tyr 

755 760 765 

Glu Asp L^s Asp Cys Tyr Val He Pro Gly Tyr Leu Pro Leu His Ser 

770 775 7flC 

Asp Asp Glu Ala Tyr Lys Trp Leu The Leu Gly Phe Glu Arg Pro Asn 
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785 










7 90 










795 










800 


Phe 


Val 




Lys 


Phe 


Glu 


Arg 


Phe 


He 


Pro 


Phe 


Gly 


Ltu 


11* 


Asn 


Gin 










80S 










810 










81b 




He 


He 


Ala 


Tyr 


Tyr 


Gly 


Arg 


Glu 


Glu 


Gly 


Ala 


Leu 


Lys 


Arg 


Tyr 


Trp 








820 










825 
















Arg 


Asp 


Gin 


Val 


lie 


Phe 


Thr 


Ala 


Gly 


Arg 


Glu 


(let 


Asp 


Arg 


Gin 


Thr 






83S 










840 










845 








Leu 


Glu 


Gin 


Glu 


Glu 


GlU 


Lys 


GlU 


Gly 


Leu 


Pro 


Lys 


Thr 


Asn 


Ala 


Glu 




850 










855 










860 










Asp 


Tyr 


Gin 


He 


Trp 


He 


Lys 


Leu 


Asp 


Phe 


Thir 


Asp 


Leu 


Ala 


He 


Scr 


865 










870 










875 










880 


Val 


Phc 


Tic 


I>ys 


Glu 


Gin 


Arg 


Lys 


Thr 


Her 


Ala 


Lys 


Asp 


Met 


Gin 


Arg 










885 










8 90 










895 




I,ys 


Gl u 


Ala 


Thr 


11*2 


Leu 


Ser 


Asp 


Met 


Leu 


Asp 


Het 


Tyr 


Trp 


Asn 


Asn 








900 










905 










910 






He 


Tro 


Pro 


Arg 


Glu 


Gin 


He 


Gly 


Asp 


Lys 


Asp 


Thr 


Glu 


Gin 


Thr 


Arg 






915 










920 










925 








Ser 


Thr 


He 


Arg 


Glu 


Thr 


Asn 


Arg 


Lys 


Lys 


Arg 


Pro 


He 


Gin 


Asp 


Leu 




930 










935 










9-JO 










Tyr 


Leu 


Ser 


Cys 


Ala 


Gin 


Ala 


Asp 


Lys 


Asp 


Lsu 


Thr 


Glu 


Ser 


His 


Tyr 


9-Jo 










950 










955 










960 


He 


His 


Leu 


Gly 


Thr 


Leu 


Asp 


Asp 


Glu 


Sir 


Lye 


Thr 


Thr 


Ala 


Arg 


He 










965 










970 










975 




Ala 


AID 


Tyr 


Pro 


Leu 


Lys 


Ash 


Gly 


val 


lie 


Asp 


Lys 


Glu 


Arg 


Val 


Arg 








990 










985 










990 






Gl u 


Val 


Ser 


Thr 


Arg 


Pro 


Tyr 


Lys 


His 


Leu 


Ser 


Val 


Asn 


Lys 


Asn 


Leu 






995 










1000 








1005 






Ala 


Thr 


Ala 


Lys 


Gin 


Tic 


Phe 


lis 


Ser 


Tyr 


Ser 


Lys 


Glu 


Asp 


Gin 


Thr 




1010 








1015 








1070 








Glu 


Leu 


Glu 


Thr 


Cys 


Leu 


Gin 


Phe 


Phe 




Pro 


Lsu 


Glu 


Lys 


Asn 


Gly 


1025 








1030 








1035 








1040 


Gl n 


Tie 


Glu 


He 


Tyr 


Tyr 


Asp 


Lys 


Leu 


Thr 


Lys 


the 


Glu 


Thr 


Pro 


He 










1045 








1050 








1055 


Hle 


Pro 


Glu 


He 


Axg 


Lyt 


Arg 


lie 


Val 


Glu 


Ala 


Asp 


Cys 


Tie 


He 


Ala 








1060 








1065 








1070 




Leu 


He 


Ser 


Gin 


Arg 


Tyr 


Leu 


Ala 


Thr 


Asp 


Tyr 


He 


Leu 


Asp 


His 


Glu 






1075 








1080 








1085 






Lsu 


Pro 


Val 


Phe 


Axg 


Glu 


Tyr 


Asn 


Lys 


Thr 


He 


Val 


Pro 


He 


Leu 


He 



1090 1095 1100 



Lys Pro Cys Thr Phe Glu Asp Asp Glu Phe Leu Arg Glu Lys Tyr Phe 

3105 1110 1115 1120 

Aid Gin Lys Ala Gin He He Asn Leu Gly Lys Glu Gly Lys Thr He 

1125 1J30 1135 

Lys Ala Tyr Asp Ser He Thr Ala Ser Al** His Arg Asp Glu Asn' Trp 

1140 1145 1150 

Val Ala Val Val Arg Glu Phe T.y.<* Glu Lys He Leu Aro He Thr Lys 

1155 H60 1165 

Gin Glu Val Asn Thr Asp Glu 

1170 1175 

U) INFORMATION FOR SEQ ID HO: 519 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 amino acids 

(B) TYPE: amino acid 
tD} TOPOLOGY: linear 

Hi) MOLECULE TYPE: protein 

(lii) HYPOTHETICAL: YES 

ORIGINAL SOURCE: 

(A) ORGANISM: Porphyrons na# gingivalis 

(lxl FEATURE: 

{A) NAME/KEY: misc_fsature 

( B) LOCATION 1. . .229 

<xi) SEQUENCE DESCRIPTION: SEQ ID U0:5^8 

Met Met Lys Lys Ala Phe Val Phe Val Leu Leu Val Cys Leu Phe Set 

15 10 is 

Ser Phe Ser Ser Ser Ala Gin Thr Thr Thr Asn Ser Ser Arg Ser Tyr 

20 25 30 

Phe Thr Gly Arg He Glu Lys Val Ser Lsu A»n Leu Gly Val Pro Pro 
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35 










40 










4b 






Val 


Ser 

50 


Thr 


Glu 


Val 


Trp 


Gly 
55 


[let 


Thr 


His 


Asp 


Ala 
60 


Asn 


Gly Leu 


Pro 


Fhe 


Glu 


lie 


Pro 


lie 


Sec 


Phe 


Ser 


Arg 


Phe 


Am 


Ser 


Gin 


Gly Asp 


lie 


65 










70 










75 






80 


Ala 


Thr 


Thr 


Tyr 


Tyr 


He 


Ala 


Asn 


Ser 


Glu 


Ala 


Thr 


Leu 


Asn Glu 


Trp 




















90 








95 


Cys 


Asp 


Tyr 


Ala 
100 


His 


Pro 


Gly 


Gl y 


Tie 
105 


Val 


Arg 


Val 


Glu 


G3y Arg 
110 


Phe 


Trp 


Ly* 


t ler 


Thr 


Tyr 


A.<rn 


lis 


Pro 


Thr 


Tyr 


Asn 


Ala 


Val 


Cys Thr 


Arg 






115 










120 










125 


lie 


Thr 
130 


Phe 


Glu 


Asn 


Gin 


Glu 
135 


He 


Glu 


Gly 


Thr 


He 
140 


Val 


Leu He 


Pro 


Lys 


Pro 


Lys 


Val 


Ser 


Leu 


Pro 


His 


Val 


Ser 


Glu 


Ser 


Val 


Pro Cys 


He 


145 










150 










155 






160 


Arg 


Thr 


Glu 


Ala 


Gly 


Arg 


Glu 


Phe 


He 


Leu 


Cys 


Glu 


Glu 


Asp Asp 


Thr 










165 










170 






175 




Phe 


Val 


Ser 


His 


Asp 


Gly 


Asn 


Glu 


Val 


Thr 


lie 


Gly 


Gly 


Lys Pro 


Phe 








ISO 










185 








190 




Leu 


Leu 


Asn 


Thr 


Asn 


Val 


Lye 


Ho 


Vol 


Gil- 


Asp 


Val 


Ser 


Gin Lys 


Tyr 






195 










200 






205 




Ala 


Val 


Gly 


Val 


Gly 


Glu 


lie 


Arg 


Phe 


Leu 


Gin 


He 


Cys 


Ala Gin 


Thr 




21 0 










215 










220 






Val 


Ser 


Gin 


Gin 


Lys 























225 



(2 J IHFOR11ATJOII FOR SEQ ID IIO:5lo 

(i» SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 amino acid- 
(R) TYPE: amino acid 
(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: protein 

(iii> HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

r ix ) FEATURE : 

(A) NAME/ KEY: misc_feature 
(D) LOCATION 1...228 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 51 9 



llet 


Lys 


Lys 


Ala 


Phe 


Val 


Phe 


V*l 


Leu 


Leu 


Val 


Cys 


Leu 


Pie 


Ser 


Ser 


1 








5 










10 








15 




Phs 


Sec 


Ser 


Ser 


Ala 


Gin 


Thr 


Thr 


Thr 


Asn 


Ser 


Ser 


Arg 


Ser 


Tyc 


Phe 








20 










25 








30 




Thr 


Gly 


Arq 


He 


Glu 


Lys 


Val 


Ser 


Leu 


Asn 


Leu 


Gly 


Val 


Pro 


Pro 


Val 






35 










40 








45 








Ssr 


Thr 

SO 


Glu 


Val 


Trp 


Gly 


HeL 
£5 


Thr 


His 


Asp 


Ala 


Asn 
60 


Gly 


Lau 


Pro 


Phe 


Glu 


Tie 


Pro 


He 


Ser 


Phe 


Ser 


Arg 


Phe 


Asn 


Ser 


Gin 


Gly 


Asp 


He 


Ala 


65 










70 










75 






60 


Thr 


Tnr 


Tyr 


Tyr 


He 
85 


Ala 


Afn 


Ser 


Glu 


Ala 
90 


Thr 


Leu 


Asn 


Glu 


Trp 
95 


Cys 


Asp 


Tyr 


Ala 


His 


Pro 


Gly 


Gly 


He 


Val 


Axg 


Val 


Glu 


Gly 


Arg 


Phe Trp 








100 










105 










HO 






Lys 


Met 


Thr 


Tyr 


Asn 


Tl e 


Pro 


Thr 


Tyr 


Asn 


Ala 


Val 


Cys 


Thr 


Arg 


He 






115 










1 20 










125 






Thr 


Phe 


Glu 


Asn 


Gin 


Glu 


He 


Glu 


Gly 


Thr 


He 


Val 


Leu 


lie 


Pro 


Lys 




13C 




















140 








Pro 


Lys 


Val 


Ser 


Leu 


Pro 


His 


Val 


Ser 


Glu 


Ser 


Val 


ero 




He 


Arg 


145 










150 










155 










160 


Thr 


Glu 


Ala 


ciy 


Arg 


Glu 


Fhe 


He 




Cys 


Glu 


Glu 


J\fp Asp 


Thr 


Phe 










165 










170 










175 




Val 


S-r 


His 


Asp 


Gly Asn 


Glu 


Val 


Thr 


He 


Gly 


Gly 


Lys 


Pro 


Phe 


Leu 








180 










185 










1?C 






Leu 


Asn 


Thr 
195 


Asn 


V„l 


Ly t 


He 


Val 
200 


Gly 


Asp 


Val 


Ser 


Gin 

205 


Lys 


Tyr 


Al n 


Val 


Gly Val 


Gly 


Glu 


He 


Arg 


Phe 


Leu 


Gin 


He 


Cys 


Ala 


Gin 


Thr 


Val 




210 










215 










220 










Ser 


Gl n 


Gin 


Lys 
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{2 1 IMFORI1ATION FOR SEQ ID MO: 520 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 0 amino acids 

( B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(il) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonaa qinqivalis 

(ix) FEATURE: 

(A) MAI IE/ KEY: misc_f eature 

(B) LOCATION 1 . . .54 0 





{xi ) 


SEQUENCE DESCRIPTION: SEQ ID NO: 520 












Lys 


Thr 




Val 


L**u 


Ar 


Y- 


Phe 


Val 


Val 


Al a 


Ala 


Phe 


Ala 


\al 


1 








5 










10 










15 




Ala 


Thr 


Leu 


Cy.T 


Pro 


Leu 


Ala 


Gin 


Ala 




Thr 


[ i e t 


Gly 


Gly 


AiFp 


Asp 








20 










25 










30 




Val 


Lys 


Val 


Val 


Gin 


Tyr 


Asn 


Gin 


Glu 


Lys 


Leu 


Val 


Gl't 


Thr 


Arg 


Met 






35 










40 










45 








S-r 


Val 


Ala 


Asp 


Asn 


Gly 


Trp 


lie 


Tyr 


Val 


Uet 


Thr 


Hie. 


Ser 


Gly 


Tyr 




50 










55 










60 










Asp Thr 


Gly 


Asn 


Ser 


Asn 


Val 


Lys 


lie 


Phe 


Arg 


Ser 


Lys 


Asp 


Gin 


Gly 


65 










70 










75 










80 


Ala 


Thr 


Tyr 


Gin 


Lys 


Leu 


Arg 


Asp 


Trp 


Asp 


Pro 


Ser 


Asp 


Asp 


Tyr 












85 










90 










95 




Phe 


Gin 


Asp 


Phi 


Asp 


He 


Val 


Val 


Thr 


<jly 


Lys 


Asn 


Glu 


Ser 


Asp 


He 






100 










105 










110 






Lys 


lie 


Trp 


Ser 


Val 


Glu 


Leu 


Met 


Asn 


Lys 


Pro 


Gly 


Gly Tyr 


i.ys 


Ser 






115 










120 










125 








Arg 


val 


Ala 


Val 


Phe 


Ser 


Arg 


Asp 


Ala 


Asn 


Ala 


Gin 


Asn 


Ala 


Lys 


Leu 




130 










135 










140 










Val 


Tyr 


Lys 


Glu 


Asp 


Phe 


Ser 


Asn 


Val 


Gin 


Leu 


Tyr 


Asp 


Val 


Asp 


lie 


145 










150 










155 










160 


Ala 


ser 


A?n 


Tyr 


Arg 


Ser 


Pro 


Ser 


Ser 


Leu 


Asn 


Asn 


Gly Gly Asn 


Pro 










165 










170 










175 




Phe 


Ala 


Leu 


Ala 


Phe 


Ala 


Tyr 


Thr 


Gly 


Phe 


Asn 


Asn 


Thr 


Hi.-* 


Lys 


He 








100 










185 










190 






Sor 


Phe 


Vol 


Asp 


Tyr 


Val 


Phe 


Ser 


Leu 


Asn 


Gly 


Gly 


Gin 


Asn 


Phe 


Asn 






195 










200 










205 








Lys 


Asn 


Leu 


Leu 


Phe 


Ser 


Gin 


Asp 


Gly 


Glu 


L;-s 


Lys 


He 


Asp 


Lys 


Val 




210 










215 










220 










Asp 


Leu 


Ser 


lieu 


Gly 


Ser 


Thr 


Ser 


Glu 


Ser 


tlet 


Gly 


His 


Asn 


Ala 


Trp 


225 










230 










235 










240 


Pro 


Leu 


Met 


Gly 


Val 


Val 


Phe 


Glu 


Met 


Asn 


Lys 


Gin 


Gly 


Gl y 


Lys 


S-r 










245 










250 










255 




Asp 


He 


Gly 


Phe 


Leu 


Ser 


Asn 


Phe 


Val 


Asp 


Asn 


Asp 


Pro 


Glu 


Phe 


Gin 








250 










265 










270 






Trp 


Ser 


Gly 


Pro 


He 


Lys 


Val 


Ser 


Glu 


Ser 


Asp 


Met 


Ser 


Phe 


Ser 


Pro 






275 










280 








285 








Lys 


He 


Gin 


llet 


Leu 


Leu 


Asp 


Glu 


Asp 


Asn 


Asn 


Thr 


He 


Asn 


Gly 


Glu 




290 










295 










300 










Ser 


Cys 


His 


Asn 


Phe 


I1et 


He 


Thr 


Tyr 


Ser 


Asp 


Tyr 


Asp 


Ser 


Glu 


Tyr 


305 










310 










31 5 










320 


Ser 


Asp Trp 


Asp 


He 


Arg 


Tyr 


Val 


Tyr 


Pro 


Lys 


L> s 


Ser 


Phe 


Lys 


Tyr 










325 










330 










335 




Glu 


Lys 


Gly 


Lys 


Thr 


Pro 


Thr 


Met 


Asp 


Asp 


Leu 


Val 


Glu 


Ala 


Phe 


Leu 








34 0 










34 5 










350 






Thr 


Ala 


Ser 


Tyr 


Gin 


Ser 


Glu 


Thr 


Asn 


Sec 


Gly 


Leu 


Gly Tyr 


Asp 


Lys 






355 










360 










365 






Asn 


Aia 


Asn 


tt\ n 


Tyr 


Leu 


He 


Thr 


Tyr 


Ala 


Lys 


Lys 


Glu 


Glu 


Asn 


Gly 




370 










37 5 










380' 








Thr 


Asn 


Thr 


Leu 


Lys 


Tyr 


Arg 


Trp 


Ala 


Asn 


Tyr 


Asp 


Lys 


He 


His 


Asn 


365 










390 










325 










400 


Lys 


Asp 


Leu 


Trp 


Ser 


Asp 


Thr 


Phe 


Thr 


Tyr 


Thr 


Ser 


Ser 


Ala 


Asn 


Ala 










4 05 










410 










415 




Leu 


Tjr 


Thr 


Pro 


Gin 


Val 


Asp 


He 


Asn 


Pro 


Thr 


Lys 


Gly 


Leu 


V«»l 
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420 










425 










430 






Trp 


Ser 


Trp 
435 


Val 


Glu 


Tyr 


Leu 


Pro 
440 


Cli- 


Lyj 


Arg 


lie 


Val 
445 


Trp 


Ser 


Asp 


Thr 


Gin 
4 50 


Trp 


Thr 


His 


Ala 


Am 
455 


Gly 


val 


Glu 


A-p 


Tie 
460 


Val 


Met 


Gin 


Glu 


Gly 


Ser 


list 


LV* 


Leu 


Tyr 


Pro 


Asn 


Pro 


Ala 


Gin 


Glu 


Tyr 


Ala 


Val 


lie 


465 










470 










475 










400 


Scr 


Leu 


Pro 


Thr 


Ala 
485 


Ala 


Asn 


Cys 


Lys 


Ala 
490 


Val 


Val 


Tyr 


A.^p 


Met. 
4 95 


Gin 


Gly 


Arg 


Val 


Val 
500 


Ala 


Glu 


Ala 


Ser 


Phe 
505 


Ser 


Gly 


Asn 


Glu 


Tyr 
510 


Arg 


Leu 


Asn 


Val 


Gin 


His 


Leu 


Ala 


Lys 


Gly 


Thr 


Tyr 


He 


T.eu 


Lys 


Val 


Val 








515 








520 










525 








Asp 


Thr 

530 


Glu 


Arg 


Phe 


Val 


Glu 
535 


Lys 


Leu 




Val 


Glu 
540 











(21 IMEOR11ATIOH FOR. SEQ ID HO: 521 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 771 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromona*: gingivalir 

(ix) FEATURE: 

(A) NAI IE/KEY: mi sc f eature 

(B) LOCATION 1. , .771 



(j:i) SEQUENCE DESCRIPTION: SEQ ID HO: 521 



Met 


Leu 


Thr 


11c 


Arg 


Asn 


Phe 


Leu 


Leu 


Phe 


Cys 


Cys 


Leu 


Ser 


Leu 


lie 


1 








5 










10 










15 




Ala 


Phe 


Ala 


Ala 


Asp 


Ala 


Gin 


Ser 


Ser 


Val 


Ser 


S~r 


Gly 


Arg 


Arg 


Leu 








20 










25 










30 






Thr 


Glu 


Tyr 


Val 


Asn 


Pro 


Phe 


lie 


Gly 


Thr 


Ala 


Acn 


Tyr 


Gly 


Thr 


Thr 






35 










40 










45 








Asn 


Pro 


Gly 


Ala 


Val 


Leu 


Pro 


Asn 


Gly 


Leu 


Met 


Ser 


Val 


Thr 


Pro 


Phe 




50 










55 










60 










Asn 


Val 


Ser 


Gly 


Ser 


Thr 


Glu 


Aen 


Arg 


Pho 


Arp 


Lys 


Asp 


Ser 


Arg 


Trp 


65 










70 










75 










80 


Trp 


s*r 


Ala 


Pro 


Tyr 


Ser 


Ala 


Asp 


Asn 


Ser 


Tyr 


cys 


He 


Gly 


Phe 


Ser 










85 




















95 




His 


Val 


Asn 


Leu 


Ser 


Gly 


Val 


Gly 


Cys 


Pro 


Glu 


Leu 


Ser 


Gly 


He 


Leu 








100 










105 










110 






Leu 


Het 


Ala 


Thr 


Ser 


Gly 


Thr 


Phe 


Asp 


Pro 


Asp 


Tyr 


Cys 


Cys 


Tyr 


Gly 






115 










120 










125 








Ser 


Ser 


Leu 


Ser 


Arg 


Glu 


Tyr 


Ala 


Arg 


Pro 


Gly 


Glu 


Tyr 


Lys 


Ala 


Val 




130 










135 










140 






Leu 


f\sp 


Lys 


Tyr 


Gly 


lie 


Asp 


Ala 


Ala 


Val 


Thr 


Val 


Thr 


Glu 


Arg 


Thr 


145 










150 










155 










160 


Ala 


Leu 


Thr 


Glu 


Phe 


Ala 


Phe 


Pro 


Glu 


Gly 


Glu 


Gly 


His 


lie 


Leu 


Leu 










165 










17 0 










175 




Am 


Leu 


Jly 


Gin 


Ala 


Leu 


Ser 


Asn 


Glu 


Ser 


Gly 


Ala 


Ser 


V^l 


Arg 


Phe 








100 










185 










190 






Leu 


Asn 


A£p 


Ser 


Thr 


Val 


Val 


Gly 


Ser 


Arg 


Leu 


Het 


Gly 


Thr 


The 


Cys 






195 










200 










205 








Tyr 


Asn 


Pro 


Gin 


Ala 


Val 


Phe 


Arg 


Gin 


Tyr 


Pne 


Val 


Leu 


Gin 


Val 


Ser 




210 










215 










220 










Arg 


Arg 


Pro 


lie. 


Ser 


Ala 


Gly 


Tyr 


Trp 


Lys 


Lys 


Gin 


Pro 


Pro 


Met 


Thr 


225 










230 










235 










240 


Val 


Glu 


Ala 


Gin 


Trp 


Atrp 


Sec 


Thr 


Ala 


Gly 


Lys 


Tyr 


I.ys 


Gin 


Tyr 


Asp 










245 










250 










255 


Gly Tyr 


Lys 


Axq 


Glu 


Het 


Ser 


Gly 


Asp 


Asp 


He 


Gly. Val 


Arg 


Phe 


Ser 








?60 










265 










270 






Phe 


Asn 


Cys 


Arp 


Gin 


Gly 


Glu 


Lys 


lie 


Tyr 


Val 


Arg 


Ser 


Ala 


Val 


Ser 






27 5 










280 










285 








Phe 


Val 


Ser 


Glu 


Ala 


Asn 


Ala 


Leu 


Tyr 


Acn 


Leu 


Glu 


Ala 


Glu 


Gin 


Gl ti 




290 










295 








300 










Glu 


Val 


Phe 




Ser 


Val 


Gly 


Gly 


Asn 


Pre 


Ala 


Lys 


Ala 


Phe 


Ser 


Aid 
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305 










310 










315 












lie 


Arg 


Ser 


Arg 




He 


Gl u 


t\cq 


rrp 


Glu 


Glu 


Ala 


Leu 


Gly 


Thr 


Val 










325 




















335 




Glu 
U 


Val 


Glu 


Gly 


Gly 


Thr 


Pro 


At p 


Gl u 


Lys 


Thr 


He 


Phe 


Tyr 


Thr 


Al a 








340 










34 b 










350 






Leu 


Tyr 


His 


Leu 


I ~u 

,o\ 


Hp 


s 




Asn 


He 


Leu 


Gin 


Asp 


Ala 




Gi V 






355 










360 










365 






Glu 


Tyr 


Pro 


Met 


f let 


G) v 
«a y 


Set 




T • - v 


The 


Gly 


Asn 


Thr 


Ala 


His 






370 










375 










380 








Arg 


Tyr 


Thr 


Val 


Phe 


Sec 


Leu 


<|<rn 

l rp 


rt-p 


Thr 


Tyr 


Arg 


Asn 


Val 


His 


Pro 


385 










390 










3 95 












Leu 


Leu 


Cys 


Le 


Leu 


* 1 r 


Pro 


Gl 


Lys 


Gin 


Leu 


Asp 


Met 


Val 


Arg 


Th° 
T r 










4 05 










410 










415 




Leu 


He Asp 


net 




Ar 


Gl u 


er 


oj.y 


Trp 


Leu 


Pro Arg 


Trp 


Glu 


Leu 








420 










4 25 










430 






Tyr 


Gly 


Gin 


Glu 


Thr 


Leu 


Tn r- 


He t 


(~1 ii 

oX TJ 


Gly I\cp 


Pro 


Scr 


Leu 


He 


Val 






435 




















445 








7 1 *"* 


Asn 


Asp 


Th r 


Trp 


Cln 


Arg 


t»l y 


Leu 


Arg 


Ala 


Phe 


Asp 


Thr 


Ala 


Thr 




450 










4 55 










460 










Ala 


Tyr 


Glu 






Lys 


Lys 




Al a 


Ser 


Sor 


Ala 


Gly Ma 


Th r 


His 


4 65 










47 0 










475 










4 80 


Pro 


lie 


Axg 




Asp 


Asn 


A.»p 


sp 


Tyr 


Leu 


Thr 


Leu 


Gly 


Phe 




Pro 










4 85 










4 90 










1 




eu 


Arq 


Glu 


Gin 


Tyr 






Ser 


Val 


Set 




Ala 


Leu 


Glu 


Tyr 


A ,J r 








500 










SOS 










510 






Leu 


Ala 


Asp 


Trp 


Asn 




Ser 


Arg 


Phe 


Ala 


His 


Ala 


Leu 


Gly 


His 


Lys 






515 










52 0 










£25 










Asp 


Ala 




Leu 




Gly 


Lys 


Arg 


Ser 


Lou 


Gly Tyr 


Arg 


Hi f 


Tyr 




530 




















54 0 










Tyr 


Asn 


Lys 


G1 u 


T-^r 


Gly 


et 


Leu 


Cys 


Pro 


Leu 


Leu 


Pro 


Asp 


Gly 


Ser 


545 










550 










555 










560 


Phe 


Ley 


Thr 


Pro 


Phe 


Asp 


Pro 


Lys 


Gl n 


Gly 


GlU 


Asn 


Phe 


Glu 


Pro 


Asn 










565 










57Q 










57 5 




Pro 


Gly 


Phe 


His 


Glu 


Gly 


Ser 


Ala 


Tyr 


Asn 


Tyr 


Ala 


Phe 


Phe 


Va 1 


Pro 








580 










58S 










590 






Hi S 


Asp 


He 


Gin 


Gly 


Leu 


Ala 


Arg 


Leu 


Met 


Gly 


Gly 


Ala 


Lys 


Val 


Phe 






595 










600 










605 








Ser 


Glu 


Arg 


Leu 


Gin 


Lys 


Val 


Phe 


A^p 


Glu 


Gly 


Tyr 


Tyr 


Asp 


Pro 


Thr 




610 










615 










620 








As ii 


Glu 


Pro 


Asp 


lie 


Ala 


Tyr 


Pro 


Tyr 


Leu 


Phe 


Ser 


Tyr 


Phe 


Pro 


Lys 


625 










630 










635 










640 


Glu 


Ala 


Trp 


Arg 


TUr 


Gin 


Lys 


Leu 


Thr 


Arg 


Glu 


Leu 


He 


Asp 


Lys 


His 










645 










650 










655 




Phe 


Cys 


Asn 


Ala 


Pro 


Asn 


Gly 


Leu 


Pro 


Gly 


Asn 


Asp 


Asp 


Ala 


Gly 


Thr 








660 










665 










670 






Met 


Ser 


Ala 


Trp 


Leu 


Val 


Tyr 


Ser 


Met 


Leu 


Gly 


Phe 


Tyr 


Pro 


Asp 


C\ s 






675 










690 










605 








Pro 


Gly 


Ser 


Pro 


Thr 


Tyr 


Thr 


Lfu 


Thr 


Ser 


Pro 


Val 


Phe 


Pro 


Arg 


Val 




690 










655 










700 










Arg 


11- 


Arg 


Leu 


Am 


Pro 


Gin 


Tyr 


Tyr 


Pro 


Gin 


Gly 


Glu 


Leu 


He 


11- 


705 










710 










715 










720 


Thr 


Th r 


Asn 


Thr 


Glu 


Asn 


Gin 


Pro 


Thr 


Asp 


Ser 


He 


Tyr 


He 


His 


Thr 










725 










730 










735 




Val 


Gee 


Leu 


Gly Asn 


Lys 


Thr 


Leu 


Pro 


His 


Gly 


Thr 


Arg 


His 


He 


Ser 








740 










745 






750 






His 


Ala 


Asp 


Leu 


Val 


Arg 


Cys 


Gly 


His 


Leu 


Arg 


Tyr 


Glu 


Leu 


Ser 


Asn 






755 










760 










765 








Arg 


Pro Arg 






























770 































12) II I FORMATION FOR SEQ ID MO: 522 

(i) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 77 6 amino acids 
(B ) TYPE: amino acid 
ID) TOPOLOGY : linear 

(il) MOLECULE TYPE: protein 

(lii) HYPOTHETICAL: YES 

O'i) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas yingivalis 

tlx) FEATURE : 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/0I023 



477/ 490 

(A) UAHE/KEY: miscfeature 

(B) LOCATION 1. . .776 

SEQUF.tiCE DESCRIPTION: SEP ID HO:522 

Met Cys Lys lie Arg Phe Ser Leu T.~u 11 n Ala Leu Val Val Cys Leu 

1 5 10 is 

T.cu Phe Thr Ser The Ser Leu Gin Ala Gin Glu Glu Gly He Trp Asn 

20 25 30 

Thr Leu Leu Ala He His Lys Thr Glu Lys Ala Val Glu Thr Pro Ly« 

35 40 45 

Lys Val Phe Ala Val Ala Asn Gly Val Leu T,r Ser Val Gly Lvs Glu 

SO 55 60 

Ala Pro His Glu Ala Lis He Phe Ap P Arg He Ser Gly Leu S-r Asp 
65 70 75 8 n 

Thr Ser Vol Sec Ser He Ala Tyr Ser Glu Glu Leu Lys Ser Leu Val 

B5 ?0 9b 

He Tyr Tyr Ala Ser Gly Asn He Asp He Leu Asp Glu Ala Gly Arg 

100 105 110 

Val Thr Acn Val Pro Ala Leu Lys Asp Asn He Asp Leu He Asp Lys 

115 120 125 

Thr Leu Asn Arg Leu L*u He Val Gly Asn Arg Ala Tyr Leu Ala Glv 

130 135 ]40 

Gly Phe Gly Leu Ser Val Leu Asp Val Ala Glu Ala Arg He Pro Ala 
145 ISO 155 160 

Thr T^r Ala Lys Gly Thr T.ys Val Thr Asp Val Ala Lys Leu Asp Atn 

165 1.-70 175 

A«p Arg Leu Leu Met Leu Lys Glu Gly Gin Leu Phe He Gly I.ys Glu 

180 185 ISO 

Thr Asp Asn L«u Gin Asp Pro Ala Ala Trp Thr Ala Leu Ser Leu Asn 

195 200 205 

Leu Pro llet Gly Ser Val Thr Gly Leu Gly 11= Val Gly Glu Arp He 

210 215 220 

Cys Phe Leu Leu Ala Asp Gly Arg Val Tyr Val Ala Ala Asn Gin Ser 
225 230 235 240 

Phe Glu Pro Glu Leu Leu Leu Ser Ser Ser Ala Asp Ser Arg Leu Tyr 

245 250 255 

Val Thr Asp Axg Gly Leu Phe He Cys Ala Glu Asn Arg He Tyr Ph~ 

260 265 270 

He Glu Lys Gly Arg Lys Thr Thr Gin Phe Pro He Ala Asp Val L-u 

275 280 285 

Gly Val Gly Ala Met Asn Glu Ser Asn Thr Ala Tyr He Ala Leu Gly 

290 295 3 00 

Glu Glu Gly Leu Ala Ser Leu Leu Leu Ala Glu Gly Ser Thr Ala Glu 
305 310 315 320 

Ala Met Pro Val Ala Phe Asp Gly Pro Gly Asp Asn Asp Phe Tyr Glu 

325 330 335 

Met Arg Phe Ser His Gly Aru Leu Tyr Ala Ala Sir Gly Leu Trp Gly 

340 3cJ5 3qQ 

Thr Asn Leu Met Gly His Ala Gly Met Vax Lys Leu Tyr Asp Gly Asn 

355 360 365 

Arg Trp Thr Asn Phe Asp Lys Lys Thr Val Gin Glu Gin Leu Glv Gly 

370 375 3eo 

Gly Phe Ser Phe Asn Asp Ala lie Asp He Ala Val Ser Asn Gly Aep 
38S 390 395 400 

Pro Asp His Phe Phe Val Gly Thr Trp Gly Asn Gly Leu Phe Glu Fh- 

405 410 415 

Lye Asp Gly Lys Ala He Ala Arg Tyr Ser Gly Asn Glu Thr Ala Tlf* 

*»2Q 425 430 

Ala Glu Cys Asn Pro Gly Asp Ala Arg Val Lys Ala He Ala Phe Asp 

435 440 44S 

Asn Lys Gly Asn Leu Trp Gly Thr Leu Gly Ala Val Gly Lys Asn He 

450 455 460 

Phe Met Tyr Asp Pro Gin Ser Ser Thr Trp His Ser Phe Ser Tyr Pro 
465 470 4"5 4B0 

Asp Val Ala Asn Leu Ala Ser Phe Gly Acn llet Tie He Leu Pro Asn 

4 90 495 
Gly Asp Lys Trp Val Asn Ilf> Leu His Arg Ser Gly Gly Ser Thr Arg 

500 505 510 

Lys Gly Val Leu He Phe Asn Asp Axg Gly Thr Pro Glu Thr Thr S-r 

515 520 525 

Asp Asp Ser His Leu Tyr Val Glu Gin Phe Val Asn Arg Leu Gly Ala 

530 53S 540 
Ala He Gly His Lys Thr He Tyr Ala Met Ala Val Asp His Asn Gl~ 
545 550 



555 560 
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Ser 


Val 


Trp 




Gly 


Ser 


Asp 


He 


Gly 


He 


Phe 


Gly, 


Val 


Tyr 


Asn 












565 










57 0 










575 




Ala 




Va 1 




Ser 


Ser 


Thr 


Ser 


Thr 


Pro 


He 


Al a 


Val 


Arg 


Pro 


Val 








500 










585 










590 






Gly 


Gly 


Glvi 


Glu 


Pro 


Asn 


Leu 


Tyr 


Tyr 


Val 


Leu 


A*p 


Lys 


Val 


Thr 


Val 






595 










600 










605 








Thr 


Asp 


He 


Val 


Val 


Asp 


Lys 


Leu 


Asr: 


His 


Lys 


Trp 


Val 


Ala 


Thr 


Gin 




610 










615 










620 










Gly 


Thr 


Gly 


Leu 


Tyr 


Lau 


Leu 


ser 


Glu 


Asp 


Cys 


Ser 


Lys 


He 


Leu 


Ala 


625 










630 










635 










64 0 


Gin 


Phe 


Thr 


Val 


Glu 


Asn 


Ser 


Pro 


Leu 


Leu 


Ser 


Asn 


Asn 


He 


Leu 


Scr 










64S 










650 










655 




Leu 


Ala 


Leu 


Aen 


Asp 


Asp 


Asn 


Gly 


Leu 


Leu 


Tyr 


He 


Gly Thr Ala 


Asp 








660 










665 










670 






Gly 


Leu 


Met 


Thr 


Phe 


Gin 


Thr 


Gly 


Thr 


Gly Ser 


Gly 


Ser 


Ala 


Ser 


Glu 






675 










680 










605 








Leu 


Asp 


Gly 


Val 


Tyr 


Val 


Tyr 


Pro 


Asn 


Pro 


Leu 


Arg 


Pro 


Glu 


Tyr 


Pro 




690 










695 










700 










Asp 


Gly 


Val 


Thr 


He 


Ala 


Gly 


Leu 


Gin 


Ala 


Gly 


Cys 


Ser 


Val 


Lys 


He 


705 










710 










715 










720 


Thr 


Asp 


Thr 


Thr 


Gly Arq 


Leu 


Leu 


Tyr 


Gin 


Thr 


Glu 


Ser 


Val 


Thr 


Thr 










7?S 










730 










735 




Glu 


Val 


Lys 


Trp 


Asn 


Ala 


Arg 


Gly Ala 


Asp 


Gly 


Asn 


Arq 


Val 


Ala 


Ser 








140 










7-1S 










750 






Gly 


Val 


Tyr 


Ala 


Val 


Ala 


Val 


Tyr 


Asp 


Pro 


Val 


Ser 


Lys 


Lys 


Ser 


Lyc 






755 










760 










7 65 








Leu 


He 


Arg 


Phe 


Ala 


Val 


He 


Arg 



















77 0 77 5 



(2) 1 M FORI tAT I OH FOR SEQ ID MO t 523 

{1} SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1150 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

tlx) FEATURE: 

(AJ MAI IE/KEY: mis cofeature 
(B> LOCATION 1...H58 

UiJ SEQUENCE DESCRIPTION: SEQ ID NO: 523 



Met 


Lye 


Arg 


He 


Leu 


Pro 


He 


Val 


Ala 


Phe 


Leu 


Ser 


Leu 


Phe 


Leu 


Ala 


1 








S 










10 










15 




Leu 


Ala 


Leu 


Pro 


Ala 


Lys 


Ala 


Gin 


Arg 


Ala 


Met 


Gly 


Lys Thr Ala 


Asp 








20 










2b 










30 






Arg 


Ser 


Leu 
35 


Met 


Ala 


Ser 


Gly 


His 
40 


Trp 


Val 


Lys 


He 


Arg 
45 


Val 


Asp 


Ala 


Ser 


Gly 
50 


Val 


Tyr 


Arg 


Leu 


Thr 
55 


Asp 


Glu 


Gin 


Leu 


Arg 
60 


Ala 


Asn 


Gly 


Phe 


Ser 


Asp 


Pro 


Ser 


Lys 


Val 


Gly 


Val 


Phe 


Gly 


Tyr 


Gly 


Gly 


Gly 


Val 


L=u 


65 










70 










75 










ao 


Pro 


Glu 


Asp 


Leu 


Ser 
85 


Arg 


Tie 


Thr 


Thr 


Asp 
90 


Asp 


Leu 


Pro 


Pro 


Val 

95 


Pro 


Val 


Leu 


Arq 


Gin 

100 


Gly 


Asn 


Ala 


Leu 


Tyr 
105 


eh* 


Tyr 


Ala 


Val 


Gly 
110 


Pro 


Val 


Thi 


Trp 


Fhe 
115 


Tyr 


Asn 


Pro 


Ala 


Lys 
120 


Thr 


Thr 


Met 


Glu 


His 
125 


Thr 


val 


Asn 


Thr 


Tyr 


Ser 


Thr 


Hxs 


Gly 


Tyr 


Tyr 


Phe 


Leu 


Sec 


A*p 


Aid 


Ala 


Gly Ala 




130 










135 










140 










Pro 


Leu 


Gin 


Met 


S«r 


Gin 


Tyr 


Thr 


Gly 


Gly 


Gly 


Ala 


Ser 


Ala 


Glu 


Ala 


145 










150 










155 










160 


Leu 


He 


Asp 


Tyr 


Tyr 
165 


Asp 


Glu 


Leu 


Met 


Leu 
170 


His 


Glu 


Gin 


Glu 


Lftit 
175 


Tyr 


Ser 


Pro 


Lys 


Glu 
180 


Ser 


Gly 


Arg 


Asp 


Leu 
185 


Tyr 


Gly 


Glu 


Ser 


Phe 
1?0 


Ser 


Ala 


Val 


Asn 


Thr 


Arg 


Thr 


Val 


Lys 


Phe 


Pro 


Leu 


Arg 


Gly 


Asn 


Tin- 


Arg 


Ser 



195 200 205 
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Ser Gly Glu Leu Gly Thr Val Phe Ser Tyr lie Ala Lys Ala Arg Ser 

210 215 2? o 

A] a Gly Gly Gly Arg Glu Met Ser Leu Ser Ala Asn Gly lie Leu lie 
225 230 23b 240 

Phe Ser Asp Pro Phe Ser H«t Thr Ser Asn Glu Val Sec Asn Ser Tyr 

245 250 2SS 

Leu Ala Gly Lys Lys Arg Arg Leu Tyr His Ser Thr Pro Met Asn Ser 

260 265 270 

Leu Val Asn Glu Leu Arg Leu Asp Ala Asn Tyr 3er Met Thr Gly Asp 

2/b 280 285 

Ala Val Asn Leu Asp Fhe He Glu Val Ala Thr Gin Aen Aso Leu Arq 

290 2 95 300 

Tyr Asp Gly Ala Pro Met His lie Arg Arg Phe Ser Asn Leu Pro Val 
305 310 315 320 

Leu Gly Gly Glu Ser Cys Arg Phe Val lis Ser Glu Val Pro Glu Ser 

325 330 335 

Leu Val Val Leu Gin Ala Asn Ser Ser Leu Thr Ala Ser Leu Val Pro 

340 345 350 

Val Lys Thr Val Gly Asp Lys Thr lie Glu Phe Val Ala Pro Pro Lys 

3S5 360 365 

Gly Gin Asp Arg Arg Thr lie Asn Thr Phe Tyr Ala Val Asp Leu Ser 

3V0 375 380 

Gin Ala Ser Ala Pro Glu lie Leu Gly Ala Val Pro Asn Gin Asn Leu 
385 390 395 400 

Hie Gly olu Glu Tit- Pro Asp Leu He He Val Ser Thr Gin Ala Leu 

405 410 415 

Leu Leu Glu Ala Asp Arg Leu Ala Thr Tyr Arg Arg Glu Lys Asn Gly 

420 425 430 

Leu Lys Val T.eu Val Val Leu Gin Glu Gin Val Fhe Asn Glu Phe Ser 

435 440 445 

Gly Gly Thr Pro Asp Ala Thr Ala Tyr Arg Leu Phe Ala Lys Met Phe 

450 455 450 

Tyr Asp Arg Trp Lys Ala Asn Ala Pro Val Gly Glu Thr Phe Pro H«*L 
465 470 475 480 

Gin Met Leu Leu Phe Gly Asp Gly Ala His Asp Asn Arg Lys Val Ser 

405 490 495 

Val Ala Trp Gin T.ys Pro Tyr Leu Gin Gin Thr Glu Phe Leu Leu Thr 

500 505 510 

Phe Gin Ala Val Asn Ser Thr Asn Val Asn Ser Tyr Val Thr Asp A/*p 

S15 520 525 

Tyr Phe Gly Leu Leu Asp Asp Gin Pro Ala Ser Val Asn He Gly Trp 

530 535 540 

Arg Asn Tyr Asn Met Ala Val Gly Arg Phe Fro Val Arg Thr Pro Ala 
545 550 555 560 

Glu Ala Arg He Ala Val Asp Lys Thr He Arg Tyr Glu Glu Asp Arg 

565 570 b/5 

Glu Ser Gly Ala Trp Arg lie Arg Ala Cys Phe Ala Ala Asp Asn Gly 

580 585 590 

Aep Lys His Ala Thr Glu Thr Ser Arg Leu He Asp Thr Val Lys Arg 

595 600 605 

Tyr Ala Pro Ala He Met Pro Val Arg Ala Phe Gin Asp Val Tyr Pro 

610 615 620 

His Val He Glu Asn Gly Leu His Ser He Pro Gly Ala Lys Lys Lys 
625 630 635 640 

Met Leu Glu Thr Leu Gin Ser Gly He He Leu Leu Asn Tyt Ala Gly 

645 650 655 

His Gly Gly Pro Ala Gly Trp Ser Asp Glu His Lou Leu Thr Leu Asn 

660 665 670 

Asp He Hit- Lys Phe Asn Tyr Lys His Met Pro He Trp He Thr Ala 

675 680 685 

Thr Cys Asp Phe Ala Asn Tyr Asp Ser Gin' Thr Thr Ser Air. Gly Glu 

690 695 700 

Glu Val Phe Leu His Glu T.ys Ser Gly Thr Pro He Met Phe Ser Thr 
70S 710 715 720 

Thr Arg Val Val Tyr Asn Thr Gin Asn Glu Lys Tie Asn Gly Phe H-t 

7 25 730 735 

Leu Arg Arg Mot Fhe Clu Lys Ala Lys Asp Gly Aro Tyr Arg Thr Met 

740 745 750 

Gly Glu He He Arg Ser Ala Lye Gin Gly Met Leu S-r Thr Val Phe 

755 760 765 

Pro Asp Ser Tic Asn Gin Leu Ser Phe Phe Leu Met Gly Asp Pro Ser 

770 775 780 

Val Arg [let Asn Leu Pro Thr His Lys Val Gin Leu Thr Ala He Asn 
785 790 795 e0Q 

Gly Gin Asp Pro Glu Gly Gin Tyr Gly Thr He Met Leu Lys Ser Leu 
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480/ 490 











805 










BIO 










015 




Glu 


A 


Val 


Ala 




Lvs 


Gly 


Lye 


Val 


Thr 


Asd 


Gl u 


Lys 


Glv 


Thr 










820 










825 










830 






Asd 


Gl u 


Thr 


Phe 


Ser 


Gl V 


Lys 


Val 


Phe 


Leu 


Thr 


Val 


Phe 


Asp 


Gly 


Arg 






835 










840 










845 








- 


Lys 


(let 


The 


Ala 


Leu 


Glu 


Glu 


Glu 


Glv 


Asn 


Asp 


Leu 


Ser 




VAl 




850 










B5b 










660 










Tyr 


Tyr 


A.-p 


Tyr 


Pro 


Asn 


Val 


Met 


Tyr 


Al a 


Gly 


He 


Ala 


Glu 


Val 


Lys 


865 










870 










81 S 










000 


Asp 


Gly 


Leu 


Phe 


Glu 


Thr 


Ser 


Phe 


He 


Val 


Pro 


Lys 


Asp 


Val 


Asn 


Tyr 










885 










8 90 










895 




Ser 


Glu 


His 


Glu 


Gly 


Arg 


He 


Asn 


Leu 


Tyr 


Ala 


Tyr 


Asn 


Glu 


Ser 


Thr 








900 










905 










910 






Lys 


Aid 


Glu 


Ala 


Met 


Gly 


Val 


Asp 


Phe 


Sec 


He 


Arg 


Val 


Gin 


Pro 


Gly 






915 










220 










925 








lie 


Pro 


Asp 


Glu 


Val 


Thr 


Glu 


Asp 


Asn 


Thr 


Pro 


Pro 


Glu 


He 


He 


Ser 




930 










935 










94 0 










Cys 


Phe 


Leu 


Asn 


Asp 


Ser 


Thr 


Phe 


Arg 


Ser 


Gly 


Asp 


Glu 


Val 


Asn 


Pro 


945 










950 










955 










960 


Thr 


Pro 


Leu 


Phe 


Met 


Ala 


Glu 


Val 


Phe 


Asp 


Leu 


Asn 


Gly 


He 


Asn 


He 










96S 










9?0 










97 5 




Thr 


Gly 


Scr 


Gly 


Val 


Gly 


His 


Asp 


He 


Thr 


Leu 


Cys 


He 


Asp 


Gly Arg 








980 










985 










990 






Ala 


A^p 


Leu 


Thr 


Tyr 


Asn 


Leu 


Asn 


Ala 


Tyr 


Phe 


Thr 


Ser 


Ser 


Ma 


Thr 
















1000 








1005 






Asp 


Ala 


Gly 


Val 


Gly 


Thr 


11 «= 


Leu 


Phe 


tlet 


He 


Pro 


Ala 


Leu 


Ala 


Glu 




1010 








1015 








1020 








Gly 


Asp 


His 


Thr 


Ala 


Arg 


Leu 


Thr 


Val 


Trp 


Atfp 


He 


Phe 


Asn 


Asn 


Ala 


1025 








1030 








1035 










Val 


his 


His 


Asp 


Phe 


Scr 


Phe 


Arg 


Val 


Val 


Asp 


Giy 


He 


Ala 


Pro 


Asp 










1045 








1050 








1055 


Val 


Ala 


Asp 


Val 


He 


Leu 


Phe 


Pro 


Asn 


Pro 


Val 


Arg 


Glu 


Ser 


Ala 


Thr 








1060 








1065 








1070 




Phe 


Arg 


lie 


Phe 


His 


Asn 


Arg 


Pro 


Gly 


Ser 


Asp 


Leu 


Asn 


Val 


Ala 


Val 






1075 








1080 








1085 






Glu 


lie 


Tyr 


Asp 


Phe 


Thr 


Gly 


Arg 


Leu 


Val 


Asn 


Ser 


Leu 


Pro 


Val 


Lys 




1090 








1095 








1100 








Thr 


TyL 


Ser 


Ser 


Scr 


Tyr 


Gly 


Glu 


Pro 


He 


Glu 


He 


Lys 


Trp Asp 


Leu 


11 OS 








1110 








1115 










Thr 


Ser 


Lys 


Tyr Gly Val 


Lys 


He 


Gly 


Asn 


Gly 


Phe 


Tyr 


Leu 


Tyr 


Arg 










1125 








1130 








113b 


Cys 


Val 


V-l 


Asn 


Scr 


Pro 


Gly Gly Gin 


Thr 


Ala 


Ser 


Met 


Ala 


Lys 


Lys 



1040 



1110 1145 1150 

Met He Val Val Gly Gin 
1155 

(2) IIIFORI'tATION FOR SEQ ID NO: 52 4 

<i) SEQUENCE CHARACTERISTICS: 

tA) LEHCTH: 961 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pcrphyromonas gingiva I is 

llr.) FEATURE : 

(A) NAME/ KEY* : miso_f eature 
(BJ LOCATION 1. . . 961 

(xl) SEQUENCE DESCRIPTION: SEQ ID HO: 52 4 

Iter Lyy Lys Leu Phe Pro Leu Lfiii Leu Leu He Leu Ser He Leu Val 

15 10 15 

Cly Cys Gly Lys Lys Glu Lys His Ser Val Thr Glu He Ala Arg Glu 

20 25 " 30 

Lyt L^s Arg Tin Thr Ala Leu Leu Tyr Glu Lys Glu Leu Pro Thr Asp 

35 40 45 

Ser Val Lys Gin L**u Tyr Glu Asn Ser Val Gin Asn Lys Asn Leu Val 

50 55 60 

Gly Gin tiet Leu Phe Ala He Glu Vsl Gly Lys Arg Met Arg Asn Met 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



481/490 

65 70 75 SO 

Ser Gin Tyr Thr Asp Ala Met I.eu Tyr His Gin Lys Gly Leu Asn Ala 

85 90 95 

Ala Leu Asn Leu Arg Asp Thr He Val Ala Ala Gin Ala Trp Asn Hi^ 

lOO 105 no 

Leu Gly Thr Asp Sor Arg Arg Tie Gly Ala Leu Ala Glu Ala Ser Asp 

115 120 125 

Tyr His Tyr Lys Ala Leu Ser Leu He Glu Ser Phe Ser Gly Asn Gin 
ISO 135 140 

Asn Arg Pro Ala He Lys Ala Arg Ser Ala Ala Leu Asn Gly lie Gl- 
" 5 150 155 16 0 

Asn He Asn Leu Glu Leu Gl> Tyr His Asp Glu Ala Glu Lys Asn Fh- 

165 170 175 

Leu Lys Ala Leu Gin Gly Glu Lys Glu Leu Asp Ser Pro Leu Gly Gin 

180 los 190 

Ala He Asn Tyr Ala Asn Leu Gly Arg Tic Tyr Arg Gin Arg Lys Glu 

195 ■ 200 205 

Tyr Asp Lys Ala Arg Thr Tyr Phe Leu Leu Ser Leu Glu Gin Asn Asn 

210 215 220 

Het Ala Glu Asn Leu net Gly He Gly Leu Cys Ser He Asn Leu Glv 
225 230 235 jmO 

Glu Val Asp Glu Glu Lys Gly Asp Tyr Gin Lys Ala Leu Gin Glu T'r 

245 250 255 

Ala Thr Ala Tyr Li s Leu Met Glu Gin Leu Ser Asn Arg Trp His Trp 

250 265 " 270 

Lou Arn Ser Cys He Pro Met Ala Arg He Asn Leu Lys Gin Glv Asn 

2?o 280 285 

Glu Arg Leu Tyr Gin His Phe He Ser Leu Ala Glu Gly Thr Ala Ly 

290 295 300 

Glu He Asn Ser Thr Ser His Leu He Glu He Tyr Asn Leu Gin T"i 
305 310 315 320 

Glu Asn Leu Glu Arg Lys Lys Glu Tyr Lys Gin Ala Leu Glu Ala Phe 

325 330 335 

Cys Leu Ser Lys Thr Leu Ser Asp Ser Met Ser He Ala His Lys Val 

340 345 350 

Ser S^r He Gin Glu Thr Arg Phe Asn Tyr Glu Arg Asn Lys S~r Gin 

355 360 3 6 5 

Lys Glu Leu Glu Glu He Gin Gin Val Ser Lys Ala Lys Gin Glu Lys 

3 ? 0 375 380 

Ser Lys Phe He Leu Leu Scr Thr Leu Fhe Ala Leu Phe II- Ser He 
385 390 395 400 

Leu Leu He Ser Val Leu Thr Tyr Ala Tyr Arg Gin Gly Lys Lys His 

405 410 415 

Asn Lys Leu He Lys Glu Thr Asp Lys Leu Arg Ser Gly Phe Phe Thr 

420 425 430 

Gly He Thr His Glu Phe Arg Thr Pro He Thr Val He Gin Gly Leu 

435 440 445 

Asn Glu Lys Met Ser Ser Ser Pro Asp Leu Gin Ala Ser Asp Arc Thr 

450 455 460 

Glu Leu His Lys He He Asp Arg Gin Ser Scr His Met Leu Asn Leu 
465 470 475 480 

Val Asn Gin Leu Leu Asp He Cys Lys He Arg Ser Gly Val Ser Thr 

4B5 490 495 

Pro Glu Trp Arg Asn Gly Asp He Val Ser Phe Val Gin II- L-u lis 

500 505 510 

Asp Ser Phe Ala Pro Tyr Ala Gin Ala Gin Asp He Thr L*u Glu Leu 

515 520 525 

Gin Pro Glu Sec Lys Pro He Val Val Asp Phs Val Pro Ser T V r L-u 

530 S35 540 

Gin Lys Tie Tie Ser Asn Leu Leu Sec Asn Ala lie Lye Tyr Ser Leu 
545 550 555 Son 

Ala Gly Gly Arg Val Val lit* Ser Leu Ala Lys Thr Lys Asn Glu Lys 

565 5J 0 575 

Asn Leu He He Arg Vi.1 Ala Asp Asn Gly He Gly He Asp Lys Thr 

580 585 590 

Asp Gin Ala His He Phe Asp He Phe Tyr Arg Gly Gin Ser Ala Thr 

555 600 605 

Glu Lys His Gly Ser Gly Val Gly Leu Ser Phe Thr Asn He Leu Val 

610 615 6 20 

Glu Asn Leu Arg Gly Thr He Lye Val Glu Ser Gin Pro Gly Lys Glv 
675 "0 €3S fil|0 

Ser Ala Phe Thr He Ser He Pro Thr Gin Asn Gin Ser Ser S«r Ala 

650 655 
Glu He Leu Pro Trp Leu Fro Ser Ser Asp Asp He Val Met Pro Val 

665 6 7 o 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



482/ 490 



His 


He 


Aid 


Pro 


Asp 


Asp 


Ser 


Pro 


Thr 


Ser 


Pro 


Met 


Val 


Ala 


Ala 


Leu 






675 










680 










685 








Arn 


His 


Arq 


Phe 


Glu 


Asp 


Glu 


Arg 


Pro 


Thr 


Tie 


L-n 


Leu 


Val 


Glu 


Asp 




690 










695 










700 










Asn 




Asp 


He 


Asn 


Leu 


Leu 


Val 


Lys 


Leu 


Leu 


Leu 


Cys 


Asp 


Arg 


Tyr 


7 05 










710 










715 










720 


Asn 


Val 


Lau 


Ser 


Ala 


Ala 


As ii 


Gly 


Lys 


Glu 


Gly 


Il- 


Ala 


Leu 


Ala 


Thr 










725 








730 










735 




Glu 


His 


He 


Pro 


Asp 


He 


Ti- 


He 


Thr 


Asp 


He 


l-let 


Met 


Pro 


He 


Met 








74 0 










745 










7S0 






Asp 


Gl'i- 


He 


Glu 


llet 


Thr 


lls 


Arg 


net 


i-y- 


Gin 


Ser 


Pro 


Leu 


Leu 


Cys 






755 










7 60 










7 65 








His 


Ile 


Pro 


He 


Val 


Ala 


Leu 


Thr 


Ala 


Ly& 


Ser 


Thr 


Glu 


Gin 


Asp 


Arg 




770 










775 










780 










Leu 


Glu 


Gly 


He 


Lys 


Ser 


Gly 


Val 


Val 


Ser 


Tyr 


Leu 


Cys 


Lys 


Pro 


Phe 


7 65 










790 










795 










800 


Scr 


Pro 


Glu 


Glu 


Leu 


Leu 


Met 


Arg 


He 


Glu 


Gin 


Leu 


Leu 


Lys 


Ag P 


Arg 










8 0S 










810 










015 




Glu 


Leu 


Leu 


Lys 


Lys 


Phe 


Tyr 


Met 


Gin 


Lys 


Leu 


Het 


Leu 


Asp 


Arg 


Lys 








820 










825 










830 






Pro 


Glu 


Glu 


Glu 


Pro 


Gin 


Pro 


He 


Asp 


Asp 


Ser 


Ser 


Het 


Gin 


Phe 


Leu 






835 










840 










845 








Leu 


Ala 


Ala 


Lys 


Asp 


Ala 


Val 


Ser 


Gly 


Gly 


He 


Lys 


Gin 


Asn 


Pro 


Asp 




850 










855 










860 










Phe 


Ser 


Ala 


Gin 


Asp 


Leu 


Ala 


Glu 


Lys 


Het 


Cyc 


Met 


Scr 


Pro 


Ser 


Gin 


865 










870 










87 5 










880 


Leu 


Asn 


Arg 


Ly*; 


Leu 


Thr 


Ser 


Val 


Val 


Gly 


Cys 


Ser 


Thr 


He 


Gly 


Tyr 










885 










aqo 










095 




He 


'31 n 


Gin 


He 


Lys 


He 


Lys 


Leu 


Ala 


Cys 


Lys 


Leu 


Leu 


Ala 


Asp 


Glu 








900 










906 










910 






Ser 


Lys 


Asn 


He 


Ser 


Asp 


He 


Ser 


11- 


Glu 


Ala 


Gly 


Phe 


Ser 


Asp 


Pro 






915 










920 










925 








Ala 


Tyr 


Phe 


Ser 


Arg 


Thr 


Phe 


Lys 


Arg 


Tyr 


llet 


Asn 


Cys 


Ser 


Pro 


Ser 




930 










935 










940 










Gin 


Tyr 


Arg 


Gin 


Lys 


Leu 


Leu 


Ala 


Het 


Pro 


Gly 


Ser 


Asp 


Lys 


Glu 


Thr 



945 950 955 960 



(21 I M FORI tAT ION fc'OR SEQ ID 110:525 

U* SEQUEUCK CHARACTERISTICS: 

(A) LENGTH: 556 amino acids 

( B) TYPE: amino acid 
( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

(Vij ORIGINAL SOURCE: 

(A| ORGANISM: Porphyromonas gingival is 

[ix> FEATURE: 

t A) MAMF./KEY: misc_£eature 
(B> LOCATION 1- . -556 



SEQUENCE DESCRIPTION : SEQ ID HO: 525 



Met 


Lys 


Lys 


Leu 


Leu 


Gin 


Ala 


Lys 


Ala 


Leu 


He 


Leu 


Ala 


Leu Gly 


Leu 


1 








5 










1 0 










15 




Phe 


Gin 


Leu 


Pro 
20 


Ala 


He 


Ala 


Gin 


Thr 

25 


Gin 


Mot 


Gin 


Ala 


Asp 
30 


Arg 


Thr 


Asn 


Gly Glu 


Phe 


Ala 


Thr 


Glu 


Glu 


Het 


Gin 


Arg 


Ala 


Phe 


Gin 


Glu 


Thr 






35 










40 








45 








Asn 


Pro 
50 


Pro 


Ala 


Gly 


Pro 


Val 
55 


Arg 


Ala 


He 


Ala 


Glu 
60 


Tyr 


Glu 


Arg 


Ser 


Ala 


Ala 


V„l 


Leu 


Val 


Arg 


Tyr 


Pro 


Phe 


Gly 


He 


Pro 


Het 


Glu 


Leu 


He 


65 










70 










75 










80 


Lye 


Glu 


T.eu 


Ala 


Lys 


Asn 


Asp 


Lys 


Val 


lie 

90 


Thr 


lie 


Val 


Ala 


Ser 

95 


Glu 


Ser 


Gin 


Lys 


Asn 


Thr 


Val 


Il- 


Thr 


Gin 


Tyr 


Thr 


Gin 


Ser 


Gly 


Val 


Asn 








100 










105 








no 






Leu 


Ser 


Asn 
115 


Cys 


Asp 


Phe 


Ile 


He 
120 


Ala 


Ly t 


Thr 


Acp 


Ser 
125 


Tyr 


Trp 


Thr 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



483/490 



Arg Asp Tyr Thr Gly Trp Phe Ala Met Tyr Asp Thr Asn Lys Val Gly 

130 135 140 

Leu Val Asp Phe Hr Tyr Asn Arg Pro Arg Pro Asn Asp Asp Glu Ph- 
145 150 155 J 60 

Pro Lys Tyr Glu Ala Gin Tyr Leu Gly He Glu MeL Phe Gly Met Lys 

165 no 175 

Leu Lye GJ n Thr Gly Gly Asn Tyr Met Thr Asp Gly Tyr Gly Ser Ala 

180 185 190 

Val Gin Ser His He Ala Tyr Thr Glu Asn Ser Ser Leu Ser Gin Ala 

195 200 205 

Gin Val Asn Gin Lys Met Lys Asp Tyr Leu Gly He Thr His Hi** Asp 

210 215 220 

Val Val Gin Asp Pro Asn Gly Glu Tyr lis Asn His Val Asp Cys Trp 
225 230 235 240 

Gly Lys Tyr Leu Ala Pro Asn Lys lie Leu 11a Arg Lys Val Pro A*p 

245 250 255 

Asn His Pro Gin His Gin Ala Leu Glu Asp Net Ala Ala Tyr Phe Ala 

260 265 270 

Ala Gin Thr Cys Ala Trp Gly Thr Lys Tyr Glu Val Tyr Aro Ala Leu 

275 280 285 

Ala Thr Asn Glu Gin Pro Tyr Thr Acn Ser Leu He Leu Asn Asn Ara 

290 295 300 

Val Phe Val Pro Val Asn Gly Pro Ala Ser val Asp Asn Asp Ala L-u 
305 310 315 320 

Asn Val Tyr Lys Thir Ala Net Pro Gly Tyr Glu He Tie Gly Vr,l Lv 

325 330 335 

Gly Ala Ser Gly Thr Pro Trp Leu Gly Thr Asp Ala Leu His Cys Arq 

340 345 35C 

Thr His Glu Val Ala Asp Lys Gly Tyr Leu Tyr He Lys His Tyr Pro 

355 360 365 

He Leu Gly Glu Gin Ala Gly Pro Asp Tyr Lys He Glu Ala Asp Val 

370 375 380 

Vdl Ser Cys Ala Asn Ala Thr He Ser Pro Val Gin Cys Tyr Tvr Ara 
305 390 395 400 

He Asn Gly Ser Gly Ser Phe Lys Ala Ala Asp Met Thr Met Glu Ser 

405 410 415 

Thr Gly His Tyr Thr Tyr Ser Phe Thr Gly Leu Asn Lys Asn Asp Lys 

420 425 430 

Val Glu Tyr Tyr He Ser Ala Ala Asp Asn Ser Gly Arg Lys Glu Thr 

435 440 445 

Tyr Pro Phe He Gly Glu Pro Asp Pro Phe Lys Phe Thr Cys Met Asn 

450 455 460 

Glu Thr Asn Thr Cys Thr Val Thr Gly Ala Ala Lys Ala Leu Arg Ala 
465 470 475 480 

Trp Phe Asn Ala Gly Arq Ser Glu Leu Ala Val Ser Val Ser Leu Asn 

4 85 4 90 495 

He Ala Gly Thr Tyr Arg He Lys Leu Tyr Asn Thr Ala Gly Glu Glu 

500 505 510 

Val Ala Ala Met Thr Lys Glu Lsu Val Ala Gly Thr Ser Val Phe S°r 

515 570 525 

Met Asp Val Tyr Ser Gin Ala Pro Gly Thr Tyr Val Leu Val Val Glu 

530 535 540 

Gly Asn Gly He Arg Glu Thr Met Lys He Leu Lys 
545 550 555 

(2) III FORMATION fe'OR SEQ ID HO: 52 6 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 28 amino acids 

(B) TYPE: amino acid 
(PI) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

till) HYPOTHETICAL: YES 

Ivi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) IIAME/KEY: misc_featurs 

(B) LOCATION 1. . .428 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 526 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



484/ 490 



net 


Lys 


Leu 


Ser 


Ser 


Lys 


Lys 


He 


Leu 


Ala 


lie 


lie 


Ala 


Leu 


Leu 


Thr 


1 








5 










10 










15 




(let 


Gly 


His 


Ala 


Val 


Gin 


Ala 


Gin 


Phe 


Val 


Pro 


Ala 


Pro 


Thr 


Thr 


Gly 








20 










25 










30 






lie 


Arg 


Met 


Ser 


Val 


Thr 


Thr 


Thr 


Lys 


Ala 


Val 


Gly 


Glu 


Lys 


He 


GlU 






35 










40 








45 








Leu 


Leu 


Val 


His 


Ser 


lie 


Glu 


Lys 


Lys 


Gly 


lie 


Trp 


He 


Asp 


Leu 


Asm 




SO 










55 










60 










Gly 


Asp 


Ala 


Thr 


Tyr 


Gin 


Gin 


Gly 


Glu 


Glu 


lie 


Thr 


Val 


Phe 


Asp 


Glu 


65 










70 










75 










80 


Ala 


Tyr 


His 


Glu 


Tyr 


Thr 


He 


Gly Thr 


Gin 


Thr 


Leu 


Thr 


He 


Tyr 


Gly 










85 










90 










95 




A^n 


Thr 


Thr 


Arg 


Leu 


Gly 


Cys 


Arg 


Ser 


Thr 


Gly 


Ala 


Thr 


Ala 


Val 


Asp 








100 










105 










110 






Val 


Thr 


Lys 


Asn 


Pro 


Asn 


Leu 


Thr 


Tyr 


Leu 


Ala 


Cys 


Pre 


Lye 


Asn 


Asn 






115 










120 










125 








Leu 


Lye 


Ser 


Leu 


Asp 


Lpu 


Thr 


Gin 


Asn 


Pro 


Lys 


Leu 


Leu 


Arg 


Val 


Trp 




130 










135 










140 










Cys 


Asp 


Ser 


Asn 


Glu 


He 


Glu 


Ser 


Leu 


Asp 


Leu 


Ser 


Gly 


Asn 


Pro 


Ala 


145 










150 










155 










160 


Leu 


lie 


lie 


Lsu 


Gly 


Cys 


Acp 


Arg 


Asn 


Lys 


Leu 


Thr 


Glu 


Leu 


Lys 


Thr 










165 










170 










175 




Asp 


Asn 


Asn 


Pro 


Lys 


Leu 


Ala 


Ser 


Leu 


Trp 


cys 


Ser 


Asp 


Asn 


Asn 


Leu 








180 










185 










190 






Thr 


Glu 


Leu 


Glu 


Lc<i 


Ser 


Ala 


Asn 


Pro 


Arg 


Leu 


Asn 


Asp 


Leu 


Trp 


Cys 






195 










200 










205 








Phe 


Gly 


Asn 


Arg 


He 


Thr 


Lys 


Leu 


Asp 


Leu 


Ser 


AID 


Am 


Pro 


Leu 


Leu 




210 










215 










220 










Val 


Thr 


Leu 


Trp 




Ser 


Asp 


Asn 


Glu 


Leu 


Ser 


Thr 


Leu 


Asp 


Leu 


Ser 


22b 










230 










235 










240 


Ly3 


Asn 


Ser 


Asp 


Val 


Ala 


Tyr 


Leu 


Trp 


Cys 


Ser 


Ser 


Asn 


Lyr 


Leu 


Thr 










245 










250 










255 




Ser 


Leu 


Am 


Lsu 


Ser 


Gly 


Val 


Lys 


Gly Leu 


Ser 


Val 


Leu 


Val 


Cys 


His 








260 










265 










27 0 






Ser 


Asn 


Gin 


He 


Ala 


Gly 


Glu 


Glu 


Met 


Thr 


Lye 


Val 


Val 


Asn 


Ala 


Leu 






27 5 










280 










285 








Pro 


Thr 


Leu 


Ser 


Pro 


Gly 


Ala 


Gly Ala 


Gin 


Ser 


Lys 


Phe 


Val 


Val 


Val 




290 










295 










300 










Asp 


Leu 


Lys 


Asp 


Thr Asp 


Glu 


Lys 


Acn 


He 


C" T C 


Thr 


Val 


Lya 


Asp 


Val 


305 










310 










315 










320 


Glu 


Lys 


Ala 


Lys 


Ser 


Lys 


Asn 


Trp 


Arg 


Val 


Phe 


Asp 


Phe 


Asn 


Gly 


Asp 










325 










330 










335 




Ser 


Asp 


Asn 


llet 


Leu 


Pro 


Tyr 


Glu 


Gly 


Ser 


Pro 


Thr 


Ser 


Asn 


Leu 


Ala 








340 










345 










350 






Val 


Afp 


Ala 


Pro 


Thr 


Val 


Arg 


He 


Tyr 


Pro 


Asn 


Pro 


Val 


Gly 


Arg 


Tyr 






355 










360 










365 








Ala 


Leu 


Val 


Glu 


He 


Pro 


Glu 


Ser 


Leu 


Leu 


Gly 


Gin 


Glu 


Ala 


Ala 


Leu 




310 










37 5 










380 










Tyr 


Asp 


I let 


Asn 


Gly 


Val 


Lys 


Val 


Tyr 


Ser 


Pn* 


Ala 


Val 


Glu 


Ser 


Leu 


385 










390 










395 










4 00 


Arg 


Gin 


Asn 


He 


Asp 


Leu 


Thr 


His 


Leu 


Pro 


Asp 


Gly 


Thr 


Tyr 


Phe 


Phe 










4 05 










410 










415 




Arg 


Leu 


Asp 


Asn 


Tyr 


Thr 


Thr 


Lys 


Leu 


He 


Lys 


Gin 











420 425 



(2) INFORI1ATION FOR SEQ ID MO: 527 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 amino acids 

(B ) TYPE: amino acid 
(0) TOPOLOGi*: linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vij ORIGINAL SOURCE: 

<AJ ORGANISM; Porphyromonas gingiva lis 

Ux> FEATURE: 

(A) MAI IE/ KEY: nu.se feature 
<B> LOCATION 1. . .310 

(xi) SEQUENCE DESCRIPTION: SEQ ID KO:5?7 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



WO 99/29870 



PCT/AU98/01023 



485/490 



Met 


Arg 


.Lys 


ThL 


He 


He 


Phf 


Cys 


Leu 


Leu 


Leu 


Ala 


Leu 


Phe 


Gly Cys 


1 








5 










10 










15 




Ser 


Trp 


Ala 


Gin 


Glu 


Arg 


Val 


Asp 


Glu 


Lys 


Val 


Phe 


Ser 


Ala 


Gly Thr 








20 










25 










30 






Ser 


He 


Phe 


Arg 


Gly 


He 


Leu 


Glu 


Lyr 


Val 


Lys 


Ala 


Pro 


Leu 


Ket 


Tyr 






35 










40 










45 








Gly 


Asp 


Arg 


Glu 


Val 


Trp 


Gly 


Met 


Ala 


Arg 


Ala 


Ser 


Glu Asp 


Phe 


Phe 




50 










55 










60 










Phe 


He 


Leu 


Pro 


Val 


Thr 


Asp 


Asp 


Leu 


Thr 


Pro 


Val 


Leu 


Phe 


Tyr 


Asn 


65 










70 










75 








eo 


Arg 


Leu 


Thr 


Asn 


Glu 


Pro 


Cys 


Phe 


Val 


Ser 


Asp 


Gin 


Gly 


tie 


Thr 


GlU 










85 










90 










95 




Tyr 


Phe 


Lys 


Phe 


Ala 


Gin 


Glu 


Gly 


Asp 


Tyr 


He 


Glu 


Val 


Glu 


Gly 


Ser 








100 










105 










110 






Ssr 


Val 


Phe 


Hat 


Ala 


Asn 


Leu 


Leu 


Tyr 


Tyr 


Arg 


Phe 


Phe 


Pro 


Thr 


Arg 






115 










120 










125 








lie 


Thi 


Ser 


Tyr 


Asn 


Ala 


Pro 


lie 


Glu 


Gly 


Val 


Val 


Ser 


Lys 


Thr 


Gly 




130 










135 










140 










Asm 


Pro 


Ala 


Phe 


Thr 


He 


Pro 


Met 


Leu 


Pro 


Gly 


Val 


Ser 


A*p 


Cys 


He 


145 










150 










155 










160 


Glu 


lie 


Ser 


Asn 


Aen 


Arg 


Lyr 


Val 


Phe 


Leu 


Thr 


Asn 


Gin 


Leu 


Gly 


Val 










16b 










1 70 










175 




Val 


Asa 


Tic 


Thr 


Asp 


Gly 


Met 


Glu 


Pxo 


Pro 


He 


He 


Ala 


Gly 


Val 


Ser 








180 










105 










190 






Ala 


Ser 


Tyr 


Gly 


Ser 


Ser 


Val 


A eg 


Val 


Tyr 


Gly 


His 


Val 


Ser 


Gin 


Arg 






195 










200 










205 








Tcp 


Asp 


He 


Tie 


Gly 


His 


Cys 


Tyr 


Leu 


Asp 


He 


Tyr 


Pre 


Thr 


Asn 


Cys 




210 










215 










220 










Tyr 


Pro 


Leu 


Ser 


Thr 


Lys 


Pro 


Val 


Ala 


Gly Asp 


Asp 


Glu 


Val 


Phe 


Val 


225 










230 










235 










240 


Lys 


Gin 


Gin 


Gly 


Arg 


Gin 


He 


Glu 


He 


Asp 


Ser 


Asn 


Sex 


Pro 


He 


Val 










245 










250 










255 




Gin 


Val 


Val 


Val 


Tyr 


Asp 


Leu 


Glu 


Gly Lys 


Ser 


Val 


Phe 


Arg 


Lys 


Arg 








260 










26S 










2/0 






Net 


Thr 


Glu 


Asm 


Ala 


Tyr 


Thr 


Leu 


Ser 


Phe. 


Arg 


Ala 


Pro 


Met 


Leu 


Gly 






275 










280 










285 








Phe 


[let 


Thr 


He 


Met 


lie. 


Glu 


Thr 


Gin 


Asn 


Ser 


lie 


He 


Asn 


Lys 


Lys 




290 










295 










300 






Lou 


Rsn 


Val 


Thr 


Gin 


Leu 























305 310 



{?.) 1H FORMATION FOR SEQ ID HO: 52 Q 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 05 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A> ORGANISM: Porphyromonas gingival is 

tlx) FEATURE: 

{A> MAI IE/KEY: misc feature 
{BJ LOCATION 1...4 05 



txi) SEQUENCE DESCRIPTION: S EQ ID NO:528 



Met 


Lys 


Lys 


Thr 


Thr 


He 


He 


Sec 


Leu 


He 


Val 


Phe 


Gly Ala 


Phe 


Phe 


1 








6 










10 










15 




Ala 


Ala 


Val 


Gly 
20 


Gin 


Thr 


Lys 


Asp 


Asn 
25 


Ser 


Ser 


Tyr 


Lys 


Pro 
30 


Phe 


S&r 


Lys 


Glu 


Asp 


lie 


Ala 


Gly 


Gly 


Val 


Tyr 


Ser 


Leu 


Pro 


Thr 


Gin 


Asn 


Arg 






35 










40 










45 






Ala 


Gin 

50 


Lys 


Asp 


Asn 


Ala 


Glu 

55 


Trp 


Leu 


Leu 


Thr 


Ala 
60 _ 


Thr 


Val 


Ser 


Thr 


Asn 


Gin 


Ser 


Ala 


Asp 


Thr 


His 


Phe 


He 


Phe 


Asp 


Glu 


Asn 


Asn 


Arg 


Tyr 


65 










70 










75 








30 


He 


Ala 


Arg 


Asp 


He 
85 


Lys 


Ala 


Asn 


Gly 


Val 

90 


Arg 


Lys 


Ser 


Thr 


Asp 
95 


Ser 


He 


Tyr 


Tyr 


Asp 
100 


Ala 


Asn 


Gly 


Arg 


He 
105 


Ser 


His 


Val 


Asp 


Leu 
110 


Tyr 


He 
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Ser 


Phe 


Scr 


Gly 


Gly 


G) u 


Pro 


Ala 


I.eu 


Asp 


Thr 


Arg 


Phe 


Lys 


Tyr 


Thr 






115 










120 










125 








Tyr 


Asp 


Asp 


Glu 


Gly 


Lys 


Met 


Thr 


Val 


Arg 


Glu 


Val 


Phe 


Met 


I.cu 


Val 




130 










135 










140 










Met 


Asp 


Pro 


Asn 


Thr 


Pro 


11* 


Ser 


Arq 


Leu 


Glu 


Tyr 


His 


Tyr 


Asp 


Ala 


145 










ISO 










155 










160 


Gin 


Gly 


Arg 


Leu 


Thr 


His 


Trp 


He 


Ser 


Phe 


Ala 


Phe 


Gly 


Ala 


Glu 


Ser 










165 










170 










175 




Gin 


Lys 


Rsn 


Thr 


Tyr 


His 


Tyr 


Asn 


Glu 


Lys 


Gly 


Leu 


Leu 


Val 


Ser 


Glu 








160 










185 










190 






Val 


Leu 


Ser 


Asn 


Ala 


Met 


Gly 


Thr 


Thr 


Tyr 


Ser 


Asp 


Thr 


Gly 


Lys 


Thr 






195 










200 










205 








Glu 


Tyr 


Ser 


Tyr 


Asd 


Asp 


Ala 


Asp 


Asn 


net 


Val 


Lys 


Ala 


Glu 


Tyr 


Phe 




210 










215 










220 










Val 


Val 


Gin 


Gin 


Gly 


Lys 


Al a 


Trp 


Gin 


Val 


Leu 


Lys 


Arg 


Glu 


Glu 


Tyr 


225 










230 










235 










240 


Thr 


Tyr 


Glu 


Asp 


Asn 


lie 


Cys 


He 


Gin 


Tyr 


Leu 


Ala 


He 


Asn 


Gly 


Thr 










245 










250 










255 




Asp 


Thr 


Lys 


Val 


Tyr 


Lys 


Arg 


Asp 


lie 


Glu 


Ser 


Asp 


Lys 


Ser 


He 


Ser 








260 










265 










270 






Ala 


Asn 


Val 


lis 


Asp 


11-2 


Pro 


Ser 


Met 


Pro 


Glu 


Gin 


Thr 


Trp 


Pro 


Asn 






275 










2B0 










285 








Met 


Tyr 


Gly 


Phe 


Asn 


Ala 


Lys 


Arg 


Leu 


Lys 


Glu 


Thr 


Tyr 


Ser 


Ser 


Tyr 




290 










295 










300 










Glu Gly 


Arp 


Val 


Al a 


Thr 


Pro 


He 


Phe 


Asp 


Tyr 


He 


Tyr 


Thr 


Tyr 


Lys 


305 










310 










31S 










320 


Ala 


Leu 


Thr 


Ser 


Net 


Ala 


Thr 


Pro 


Sec 


Thr 


Glu 


Ala 


Gin 


Val 


Ala 


Val 










325 










330 










335 




Tyr 


Leu 


Asn 


Pro 


Set- 


Thr 


Asp 


Arq 


Leu 


Val 


He 


Leu 


Ala 


Ai?n 


Gly 


He 








340 










345 










350 






Thr 


Hxs 


Leu 


Ser 


Met 


Tyr 


Asp 


Leu 


Gin 


Gly 


Ly- 


Leu 


He 


Arg 


Asp 


Cys 






355 










360 










365 








Ala 


T.eu 


Ser 


Gly 


Asp 


Lys 


Val 


Glu 


Met 


Gly 


Val 


Gly 


Ser 


Leu 


Thr 


Lys 




37 0 










375 










380 










Gly Thr 


Tyr 


Leu 


Leu 


Lys 


Val 


Asn 


Thr Asp 


Cln 


Gly 


Ala 


Phe 


Val 


Arg 


385 










390 










395 










400 


Lys 


Val 


Val 


He 


Arg 
































405 
























<2) 


IUFORUATION 


FOR 


SEQ 


ID NO:529 

















(1) SEQUENCE CHARACTERISTICS '. 

(A) LENGTH: 2037 base pairs 

(B) TYPE: nucleic acid 
(C> STRAIN DEDNESS: double 
(D( TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

<iii) HYPOTHETICAL: MO 

(iv) ANTI -SENSE: NO 

(vi ) ORIGINAL SOURCE: 

JA) ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE : 

(A) HA1 IE/KEY: misc feature 
(BJ LOCATION H..2037 

Ui) SEQUENCE DESCRI PTIOt) : SF.Q ID NO: 52 9 

GTAATGGACT ACAAACTCAC TTCTCGATTC AAGCCCACGG GCGACCAGCC GGAAGCCATT 

CGCCAACTCG T ACAGGG CAT CAACGAAGGG ATGCCGGCTC A'JACGCTGCT CGGCGTAACG 

GGTTCGGGCA AAACTTTTAC GGTGGCTAAC GTGGTGGCGG CGGTCAATCG TCCGACCCTT 

GTCCTGAGTC ACAACAAGAC CTTGGCAGCA CAGCTATACG GAGAGTTCAA AGCTTTCTTC 

CCCGAGAATG CGGTGGAGTA TTTCGTCAGC TACTACGACT ACTATCAGCC CGAGGCCTAC 

CTCCCCGTCA CAGACACCTA TATCGAAAAG GACATGGCCA TCAACGCGGA GATCGAAAAA 

CTGCGATTGA GGGCCACGGC TTCGCTCCTG TCAGGGCGGA AAGATGTGCT TGTGGTCAGC 

TCCGTATCCT GTCTCTACGG TATGGCCAAT CCTGAAGCTT TTTCCGAAAA GGTGATCAGC 

CTGCACACGG GACAAAGGGC AGACAGGGAT CATTTTATCC GCCTGCTGGT AGAGAGCTAC 

TACACGAACA ATAAAGTAGA GTTCGAGAGC GGCAACTTCC GTGTCAAAGG CGACAGCGTG 

GACATATTCC CCGCCGTAGA AGGTTATGAC GGCGTGGCAT ACAGGGTGGA GTTTTGGGAT 

GGAGAGGTCG AGCGGCTGAG TACCTTCGAT CCGCGAACGG GACGGGAATA CGGCCTGCTG 

TCGGAGCTGA AGATATATCC GGCCAATCTC TTCGTGACGA CTAAGGA3CA GGTGGAT CGG 
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GCAGTAGGGA AAATCGATGT GCATCTGGGC 
AAACCATATG AAGCCAAACG CTTGTATGAG 
GAGTTGGGTT ATTGTTCCGG TATAGAGAAC 
GGCGAACGTC CTTTCTGTCT GTTGGATTAT 
5 GAAAGCCATG TAACGATACC GCAGATACGT 
GAGAATCTGG TCGAATACGG ATTCCGCCTG 
TTCGACGAGT TCGAAGCTCT CACCCCCCGG 
TATGAGCTGA ACAGAAGCGA AGGCGTGATC 
CTGGATCCCA TCATOGACGT CAAGCCGACG 

10 ATAGCACGCT GCATCGAAAA GAAAGAGCGC 
GCAGAGGAGC TTAGCGAATA CCTGCTACGC 
GATGTGGACA CGCTGGAGCG TGTGCGTATC 
GCACT CATCG GGGTGAATCT GCTCCGCGAA 
GCTATTCTGG ATGCGGATAA GGAAGGATTC 

15 GCAGGACGTG CCGCCCGGCA CATTCATGGG 
GACAGTATGC AGCTCACCAT GGACGAGACT 
AACGAAGCGC ACGGCATCAC CCCCCAACAG 
GAAGGCGATG TGTCGGCCTT GCAATCCGAT 
AG CAT GG T G G CTGCCGATCC TTTGGCCGAC 

20 ATTGCTTCGA CCAAGAAGCA AATGCTGGCA 
GCACGACTTC GGGACGAAGC CGCACGATTG 



GCACAGGTCG ATTTTCTGAA AGAAATCGGC 8-10 

CGGGTCACGT ATGACTTGGA AATGATCCGT 900 

TATTCGCGCT ACTTCGACGG CCGTGACGCG 960 

TTCCCGGAG3 ATTTCCTGTT GGTCATAGAC 1020 

GCCATGTACG GAGGCGATCG TTCGCGCAAG 1080 

CCTGCCGCTC TCGACAATCG GCCGCTTCGC 114 0 

ACCCTTTATA TCAGTGCCAC GCCTGCCGAC 1200 

GTCGAGCAGC TGATCCGTCC GACCGGACTG 1260 

UCAAACCAAG TGGACGATCT GATGGAGGAG 1J20 

GTACTGGTAA CGACCCTGAC CAAACGTATG 1380 

CACGGTATCA GCACCGGCTA CATACACAGC 14 4 0 

ATGGAAGACC TGCGCAAGGG GGTCTACGAT 1500 

GG ATTGGA'JT TGCCGGAAGT TTCGCTTGTG 1SGG 

CTGCGCTCGC ATCGTTCGCT CACGCAGACT 1620 

CGTGTCATCT TCTACGCGGA CAAGATCACC 1680 

GCACGCCGAC GCGCAAAGCA ACTGGCCTAC 17 4 0 

ATAGTGAAGA ACAGTGCTGC CATTTGGGGA 1800 

ACAGAATCCG GTGCGTACAT AG AAGAGAG C I860 

TATCTGAGCA AACCCAAGCT GG AAGCACT C 1?20 

GCAGCCAAAG AGCTGGACTT TCTGGAAGCG 1980 

GAAAAGAAAC TGGAGCAACT CACAGCC 2037 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) INFORIlATIOif FOR SEQ ID HO: 530 

ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2034 bass pairs 
(D) TYPE: nucleic acid 

(C) STRA1IDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) HOLECULE TYPE : DNA (genomic) 

Uil) HYPOTHETICAL: HO 

<iv) All VI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas qinqivalts 

(ix) FEATURE: 

(A) MAI IE /KEY : miscjaature 

(B) LOCATION 1 2034 

0:1) SEQUENCE DESCRIPTION: SEQ ID 110:530 



ATGGACTACA 
CAACTCGTAC 
TCGGGCAAAA 
CTGAGTCACA 
GAGAATGCGG 
CCCGTCACAG 
CGATTGAGGG 
GTATCCTGTC 
CACACGGGAC 
ACGAACAATA 
AT ATT CCCCG 
GAGGTCGAGC 
GAGCTGAAGA 
GTAGGGAAAA 
CCATATGAAG 
TTGGGTTATT 
GAACGTCCTT 
AGCCATGTAA 
AATCT GGTCG 
CACGAGTTCG 
GAGCTGAAGA 
GATCCCATCA 
GCACGCTGCA 
GACGAGCTTA 
GTGGACACGG 
CTCATCGGGG 
ATTCTGGATG 
GGACGTGCCG 
AGTATGCAGC 



AACTCACTTC 
AGGGCATCAA 
CTTTTACGGT 
ACAAGACCTT 
TGGAGTATTT 
ACACCTATAT 
CCACGGCTTC 
TCTACGGTAT 
AAAGGGCAGA 
AAGTAGAGTT 
CCGTAGAAGG 
GGCTGAGTAC 
TATATCCGGC 
TCGATGTGGA 
CCAAACGCTT 
GTTCCGGTAT 
TCTGTCTGTT 
CGATACCGCA 
AATACGGATT 
AAGCT CTC AC 
GAAGCGAAGG 
TCGACCTCAA 
TCGAAAAGAA 
GCGAATACCT 
TGGAGCGTGT 
TGAATCTGCT 
CGGATAAGGA 
CCCGGCACAT 
TCACCATGGA 



TCGATTCAAG 
CGAAGGGATG 
GGCTAACGTG 
GGCAGCACAG 
CGTCAGCTAC 
CGAAAAGGAC 
GCTCCT GTCA 
GGCCAATCCT 
CAGGGATCAT 
CGAGAGCGGC 
TTATGACGGC 
CTTCGATCCG 
CAATCTCTTC 
TCTGGGCGCA 
GTATGAGCGG 
AGAGAACTAT 
GGATTATTTC 
GATACGTGCC 
CCGCCTCCCT 
CCCCCGGACC 
CGTGATCGTC 
GCCGACGGCA 
AGAGCGCGTA 
GCTACGCCAC 
GCGTATCATG 
CCGCGAAGGA 
AGGATTCCTG 
TCATGGGCGT 
CGAGACTGCA 



CCCACGGGC Z 
CCGGCTCAGA 
GTGGCG3CGG 
CTATACGGAG 
TACGACTACT 
ATGGCCATCA 
GGGCGGAAAG 
GAAGCTTTTT 
TTTATCCGCC 
AACTTCCGTG 
GTGGCATACA 
CGAACGGGAC 
GTGACGACTA 
CAGGTCGATT 
GTCACGTATG 
TCGCGCTACT 
CCGGAGGATT 
ATGTACGGAG 
GCCGCTCTCG 
CTTTATATCA 
GAGCAGCTGA 
AACCAAGTGG 
CTGGTAACGA 
GGTATCAGCA 
GAAGACCTGC 
TTGGACTTGC 
CGCTCGCATC 
GT GAT CTT "T 
CGCCGACGZG 



ACCAGCCGGA 
CGGTGCTCGG 
TCAATCGTCC 
AGTTCAAAGC 
ATCAGCCCGA 
ACGCGGAGAT 
ATGTGCTTGT 
CCGAAAAGGT 
TGCTGGTAGA 
TCAAAGGCGA 
GGGTGGAGTT 
GGGAATACGG 
AGGAGCAGGT 
TTCTGAAAGA 
ACTTGGAAAT 
TCGACGGCCG 
TCCTGTTGGT 
GCGATCGTTC 
ACAATCGGCC 
GTGCCACGCC 
TCCGTCCGAC 
ACGATCTGAT 
CCCTGACCAA 
CCGGCTACAT 
GCAAGGGGGT 
CGGAAGTTTC 
GTTCGCTCAC 
ACGCGGACAA 
CAAAGCAACT 



AGCCATTCGC 
CGTAACGGGT 
GACCCTTGTC 
CTTCTTCCCC 
GGCCTACCTC 
CGAAAAACTG 
GGTCAGCTCC 
GATCAGCCTG 
GAGCTACTAC 
CAGCGTGGAC 
TTGGGATGGA 
CCTGCTGTCG 
GGATCGGGCA 
AATCGGCAAA 
CATCCGTGAG 
TGACGCGGGC 
CATAGACGAA 
GCGCAAGGAG 
GCTTCGCTTC 
TGCCGACTAT 
CGGACTGCTG 
GGAGGAGATA 
ACGTATGGCA 
ACACAGCGAT 
CTACGATGCA 
GCTTGTGGCT 
GCAGACTGCA 
GATCACCGAC 
GGCCTACAAC 



60 
12 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10L*0 
1140 
1200 
1260 
1320 
1380 
1410 
1500 
1560 
1S20 
1660 
1740 
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GAAGCGCACG GCATCACCCC CCAACAGATA GTGAAGAACA GTGCTGCCAT TTGGGGAGAA 1800 

GGCGATGTGT CGGCCTTGCA ATCCGATACA GAATCCGGTG CGTACATAGA AGAGAGCAGC I860 

ATGGTGGCTG CCGATCCTTT GGCCGACTAT CTGAGCAAAC CCAAGCTGGA AGCACTCATT 1920 

GCTTCGACCA AGAAGCAAAT GCTGGCAGCA GCCAAAGAGC T GG ACT TTCT GGAAGCGGCA 198C 

CGACTTCGGG ACGAAGCCGC ACGATTGGAA AAGAAACTGG AGCAACTCAC AGCC 2034 



(2) INFORMATION FOR SEQ ID HO: 531 

10 <i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 67 9 amino acids 
( 0) TYPE: amino acid 
(I?) TOPOLOGY: linear 

15 Hi) MOLECULE TYPE: protein 

{ill} HYPOTHETICAL: YES 



20 



( Vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Porphyromonae gingiva lie 



25 



t ix) FEATURE: 

(A) NAI IE/KF.Y: mi?cj eature 

(B) LOCATION 1. . .67 9 



/ 



i;<i) SEQUENCE DESCRIPTION: SEQ. ID NO: 531 



30 



35 



40 



45 



55 



60 



65 



70 



75 



Val 


Met 


Asp 


Tyr 


Lys 




Thr 


Ser 


Arg 


Phe 


Lys 


Pro 


Thr 


Gly 


Asp 


Gin 


1 








5 










1 0 










15 




Pro 


Glu 


Ala 


lie 


Arg 


Gin 


Leu 


Val 


Gin 


Gly 


He 


Asn 


Glu 


Gly 


Met 


Pro 








20 










25 










30 






Ala 


Gin 


Thr 


Leu 


Leu 


Gly 


Val 


Thr 


Gly 


Ser 


Gly 


Lys 


Thr 


Phe 


Thr 


Val 






35 










40 










45 








Ala 


Asn 


Val 


Val 


Ala 


Ala 


Val 


Asn 


Arg 


Pro 


Thr 


Lou 


Val 


Leu 


Ser 


His 




50 










55 










SO 










Asn 


Lys 


Thr 


Leu 


Ala 


Ala 


Gin 


Leu 


Tyr 


Gly 


Glu 


Phe 


Lys 


Ala 


Phe 


Phe 


65 










70 










-75 










go 


Pro 


GlU 


Asa 


Ala 


Val 


Glu 


Tyr 


Phe 


Val 


Ser 


Tyr 


Tyr 


Asp 


Tyx 


Tyr 


Gin 










85 










90 










95 




PLO 


Glu 


Ala 


Tyr 


Leu 


Pro 


Val 


Thr 


Asp 


Thr 


Tyr 


He 


Glu 


Lys 


Asp 


Met 








100 










105 










110 






Ala 


lie 


Asn. 


Ala 


Glu 


He 


Glu 


Lys 


Leu 


Arg 


Leu 


Arg 


Ala 


Thr 


Aid 


Ser 






US 










120 










125 








Leu 


Leu 


Scr 


Gly 


Arg 


Lys 


Asp 


Val 


Leu 


Val 


Val 


Ser 


Ser 


Val 


Ser 


Cys 




130 










135 










140 










Lsu 


Tyr 


Gly 


Met 


Ala 


Asn 


Pro 


Glu 


Ala 


Put 


Sen 


Glu 


Lys 


Val 


He 


Ser 


] 45 










150 










155 










160 


Leu 


Hxs 


Thr 


Gly 


Gin 


Arg 


Ala 


Asp 


Arq 


Asp 


His 


Phe 


He 


Arg 


Leu 


Leu 










165 










170 










175 




Val 


Glu 


Ser 


Tyx 


Tyr 


Thr 


Asn 


Asn 


Lys 


Val 


Glu 


Phe 


Glu 


Ser 


Gly 


Asn 








180 










1Q5 










190 






Phe 


Arg 


Val 


Lys 


Gly 


Asp 


Ser 


Val 


Asp 


He 


Phe 


Pro 


Ala 


Val 


Glu 


Gly 






195 










200 










205 








Tyr 


Asp 


Gly 


Val 


Ala 


Tyr 


Arg 


Val 


Glu 


Phe 


Trp 


Asp 


Gly 


Gill 


Val 


Glu 




210 










215 










220 










Arg 


Lsu 


Ser 


Thr 


Phe 


Asp 


Pro 


Arg 


Thr 


Gly 


Arg 


Glu 


Tyr 


Gly 


Leu 


Leu 


225 










230 










235 










240 


Ser 


Glu 


Lsu 


Lys 


He 


Tyr 


Pro 


Ala 


Asn 


Lsu 


Phe 


Val 


Thr 


Thr 


Lys 


Glu 










245 










250 










255 




Gin 


Val 


Asp 


Arg 


Ala 


Val 


Gly 


Lys 


He 


Asp 


Val 


Asp 


Leu 


Gly 


Ala 


Gin 








260 










265 










270 






Val 


Asp 


Phe 


Leu 


Lys 


Glu 


He 


Gly 


Lys 


Pro 


Tyr 


Glu 


Ala 


Lys 


Arg 


Leu 






275 










280 










285 








Tyr 


Glu 


Arg 


Val 


Thr 


Tyr 


Asp 


Leu 


Glu 


Het 


lie 


Arg 


Gl u 


Leu 


Gly 


Tyr 




290 










295 










300 








Cys 


Ser 


Gly 


lis 


Glu 


Asn 


Tyr 


Ser 


Arq 


Tyr 


Phe 


Asp 


Gly 


Arg 


Asp 


Ala 


305 










310 










315 










320 


Gly 


Glu 


Arg 


Pro 


Phe 


Cys 


Leu 


Leu 


Asp 


Tyr 


Phe 


Pro 


Glu 


Asp 


Phe 


Leu 










325 










3J0 








335 




Leu 


Val 


lie 


Asp 


Glu 


Scr 


His 


Val 


Thr 


He 


Pro 


Gin 


He 


Arg 


Ala 


Het 








340 










34 5 










350 






Tyr 


Gly 


Gly 


Asp 


Arg 


Ser 


Arg 


Lys 


Glu 


Atn 


Leu 


Val 


Glu 


Tyr 


Gly 


Phe 






355 










360 










365 








Arg 


Leu 




Ala 


Ala 


Leu 


Asp 


Asn 


Arg 


Pio 


Leu 


Arg 


Phe 


Asp 


Glu 


Phe 




370 










37 5 










380 
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Glu Ala Leu Thr Pro Arg Thr Leu Tyr lie Ser Ala Thr Pro Ala Asp 
385 390 395 40 n 

Tyr Glu Leu Asn Arg Ser Glu Gly Val lie Val Glu Gin Leu He Arg 

405 410 415 

Pr? Thr Gly Leu Leu Asp Pro He He Atp Val L/C Pro Thr Ala Asn 

425 430 
Gin Val Asp Asp Leu Met Glu Glu He Ala Arg Cys He Glu Lys Lys 

4 35 44Q 445 

Glu Arg Val Lsu Vnl Thr Thr Leu Thr Lys Arg Met Ala Glu Glu Leu 

•150 4 55 4 60 

Ser Glu Tyr Leu Leu Arg His Gly Hp Ser Thr Gly Tyr He His Ser 
465 470 475 4QQ 

Asp Val Asp Thr Leu Glu Arg Val Arg He Met Glu Asp Leu Arg Lys 

485 400 4 oc 

Gly V,-l Tyr Asp Ala Leu He Gly Val Am Leu Leu Arg Glu Gi" L~u 

500 505 510 

Asp Leu Pro Glu Val Ser t.eu Val Ala He Leu Asp Ala Asp Lys Glu 

515 520 525 

Gly Phe Leu Arg Ser His Arg Ser Leu Thr Gin Thr Ala Gly Arq Ala 

530 535 540 

Ala Arg His He Hie Gly Arg Val He Phe Tyr Ala Asp Lys He Thr 
545 550 555 560 

Asp Sc-r Met Gin Leu Thr Met Asp Glu Thr Ala Arg Arg Arg Aia Lie 

565 570 575 

Gin Leu Ala Tyr Asn Glu Ala His Gly He Thr Pro Gin Gin He Val 

580 £05 590 

Lys Asn Ser Aia Ala He Trp Gly Glu Gly Asp Val Ser Ala Leu Gin 

595 600 605 

Ser Asp Thr G3 u Ser Gly Ala Tyr He Glu Glu Ser Scr Met Val Ala 

610 615 620 

Ala Asp Pro Leu Ala Asp Tyr Leu Ser Lys Pro Lys Leu Glu Ala L°u 
625 630 635 £40 

He Ala Ser Thr Lys Lys Gin Met Leu Ala Ala Ala T.ys Glu Leu Asp 

645 650 655 

Phe Leu Giu Ala Ala Arg Leu Arg Asp Glu Ala Ala Arg Leu Glu L * = 

660 665 670 

Lys Leu Glu Gin Leu Thr Ala 
67 5 



{2> INFORMATION FOR SEQ ID HO: 532 

U> SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH : 67Q amino acids 
IB) TYPE: ammo acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE; protein 
(Hi) HYPOTHETICAL: YES 



tvi) ORIGINAL SOURCE: 

iA> ORGANISM: Porphyromonas gingival is 

[ ix ) FEATURE : 

<A) MAMF./KEY: miscfeature 
IB) LOCATION 1. . -67 8 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532 



Met 


Asp 


Tyr 


Lys 


Leu 


Thr 


Ser 


Arg 


Phe 


Lys 


Pro 


Thr 


Gly Asp Gin 


Pro 




Ala 






5 










10 










15 




Glu 


He 


Arg 


Gin 


Leu 


Val 


Gin 


Gly 


Tie 


Asn 


Glu 


Gly Met 


Pro 


Ala 


Gin 






20 










25 










30 




Thr 


Leu 


Leu 


Gly 


Val 


Thr 


Gly 


Ser 


Gly 


Lys 


Thr 


Phe 


Thr 


Val 


Ala 






35 










40 










4 Lr 






Asn 


Val 


Val 


Ala 


Ala 


Val 


Asn 


Arg 


Pro 


Thr 


Leu 


Val 


Leu 


Ser 


His 






50 










55 










60 








Lys 


Thr 


Leu 


Ala 


Ala 


Gin 


Leu 


Tyr 


Gly 


Glu 


Phe 


Lys 


Ala 


Phe 


Phe 


Pro 


65 










7 0 










75 












Glu 


Asn 


Ala 


Val 


Glu 


Tyr 


Phe 


Val 


Ser 


Tyr 


Tyr 


Asp 


Tyr 


Tyr 


Gin 


Pro 


Glu 








85 










90 








95 




Ala 


Tyr 


Lou 


Pro 


Val 


Thr 


Asp 


Thr 


Tyr 


He 


Glu 


Lys 


Asp 


Met 


Ala 








100 










105 








110 






He 


Asr 


Al» 
115 


Glu 


He 


Glu 


Ly* 


Leu 
120 


Arg 


Leu 


Arg 


Ala 


Thr 

125 


Ala 


S-r 


Leu 
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Leu 


Ser 


Gly Arg 


Lys 


Asp Val 


Leu 


Val 


Val 


Ser 


Ser 


Val 


Ser 


Cys 


Leu 




130 










135 










140 










Tyr 


Gly 


Met 


Ala 


Asn 


Pro 


Glu 


Ala 


Phe 


Ser 


Glu 


Lys 


Val 


He 


Ser 


Leu 


145 










150 










155 










160 


His 


Thr 


Gly 


Gin 


Arg 


Ala 


Asp Arg 


Asp 


Hir 


Phe 


He 


Arg 


Leu 


Leu 


Val 










165 










1 10 










175 




Glu 


Ser 


Tyr 


Tyr 


Thr 


Asn 


Asn 


Lys 


Val 


Glu 


Phe 


Glu 


Ser 


Gly 


Asn 


Phe 








180 










185 










190 






Arg 


Val 


Lys 


Gly 


Asp 


Ser 


Val 


Asp 


He 


Phe 


Pro 


Ala 


Val 


Glu 


Gly 


Tyr 






195 










200 










205 








Asp 


Gly 


Val 


Ala 


Tyr Arg 


Val 


Glu 


Phe 


Trp 


Asp 


Gly 


Glu 


Val 


Glu 


Arg 




210 










215 










220 










r.pu 


Sor 


Thr 


Phe 


Asp 


Pro 


Arg 


Thr 


Gly Arg 


Glu 


Tyr 


Gly 


Leu 


Leu 


Scr 


225 










230 










235 










240 


Glu 


Leu 


Lye 


He 


Tyr 


Pro 


Ala 


Asn 


Leu 


Phe 


Val 


Thr 


Thr 


Lys 


Glu 


Gin 










245 










250 










255 




Val 


Asp 


Arg 


Ala 


Val 


Gly 


Lys 


He 


Asp 


Val 


Asp 


Leu 


Gly 


Ala 


Gin 


Val 








260 










265 










270 






Acp 


Phe 


Leu 


Lys 


Glu 


He 


Gly 


Lys 


Pro 


Tyr 


Glu 


Ala 


Lys 


Arg 


Leu 


Tyr 






275 










2fl0 










295 








Glu 


Arq 


Val 


Thr 


Tyr Asp Leu 


Glu 


Met 


He 


Arg 


Glu 


Leu 


Gly 


Tyr 


Cys 




2 90 










2 95 










300 










Ser 


Gly 


lie 


Glu 


Asn 


Tyr 


Ser 


Arg 


Tyr 


Phe 


Asp 


Gly Arg 


Asp 


Ala 


Gly 


30b 










310 










215 










320 


Glu 


Arg 


Pro 


Phe 


Cys 


Leu 


Leu 


Asp 


Tyr 


Phe 


Pro 


Glu 


A.*p 


Phe 


Leu 


Leu 










325 










330 










335 




Val 


lie 


Acp 


Glu 


Ser 


His 


Val 


Thr 


He 


Pro 


Gin 


lie 


Arg 


Ala 


Met 


Tyr 








340 










345 










3S0 






Gly 


Gly 


Asp 


Arg 


Ser 


Arg 


Lys 


Glu 


Asn 


Leu 


Val 


Glu 


Tyr 


Gly 


Phe 


Arg 






355 










360 










365 








Lou 


Pro 


Ala 


Ala 


Leu 


Asp Asn 


Arg 


Pro 


Leu 


Arg 


Phe 


Asp 


Glu 


Phe 


Glu 




370 










375 










380 










Ala 


Leu 


Thr 


Pro 


Arg Thr 


Leu 


Tyr 


He 


Ser 


Ala 


Thr 


Pro 


Ala 


Asp 


Tyr 


385 










390 










395 










400 


Glu 


Leu 


Asn 


Arq 


Ser 


Glu 


Gly 


Val 


He 


Val 


Glu 


Gin 


Leu 


He 


Arg 


Pro 










405 










410 










415 




Thr 


Gly 


Leu 


Leu 


Asp 


Pro 


Tie 


He 


Asp 


Val 


Lys 


Pro 


Thr 


Ala 


Asn 


Gin 








420 










425 










4 30 






Val 


Asp 


Asp 


Leu 


Met 


Glu 


Glu 


He 


Ala 


Arg 


Cys 


He 


Glu 


Lys 


Lye 


Glu 






4 35 










440 










445 








Arq 


Val 


Leu 


Val 


Thr 


Thr 


Leu 


Thr 


Lys 


Arg 


Met 


Ala 


Glu 


Glu 


Leu 


Ser 




450 










455 










460 










Glu 


Tyr 


Leu 


Leu 


Arg 


His 


Gly 


lie 


Ser 


Thr 


siy 


Tyr 


He 


His 


Ser 


Asp 


465 










470 










475 










400 


val 


Asp 


Thr 


Leu 


Glu 


Arg 


Val 


Arg 


He 


Met 


Glu 


Asp 


Leu 


Arg 


Lys 


Gly 










405 










4?0 










4 95 




Val 


Tyr 


Asp 


Ala 


Leu 


He 


Gly Val 


Asn 


Leu 


Leu 


Arg 


Glu 


Gly 


Leu 


Asp 








500 










505 










S10 






Leu 


Pro 


Glu 


Val 


Ser 


Leu 


Val 


Ala 


He 


Leu 


Asp 


Ala 


Asp 


Lys 


Glu 


Gly 






515 










520 










525 








Phe 


L-u 


Arq 


Ser 


His 


Arg 


Ser 


Leu 


Thr 


Gin 


Thr 


Ala 


Gly Arg 


Ala 


Ala 




530 










535 










5-10 










Arg 


His 


He 


Hie 


Gly Arg 


Val 


He 


Phe 


Tyr 


Ala 


Asp 


Lys 


He 


Thr 


Asp 












550 










555 










560 


Ser 


Met 


Gin 


Leu 


Thr 


Met 


Asp 


Glu 


Thr 


Ala 


Arg Arg Arg Ala 


Lys 


Gin 










565 










570 










575 




Leu 


Ala 


Tyr 


Asn 


Glu 


Ala 


His 


Gly 


He 


Thr 


Pro 


Gin 


Gin 


He 


Val 


Lys 








S80 










585 










S9C 




Asn 


Ser 


Ala 


Ala 


He 


Trp Gly Glu Gly A- a p 


Val 


Ser 


Ala 


Leu 


Gin 


Ser 






595 










600 










605 








Asp 


Thr 


Glu 


Ser 


Gly 


Ala 


Tyr 


lie 


Glu 


Glu 


Ser 


Ser 


Met 


Val 


Ala 


Ala 




610 










615 










620 










Asp 


Pro 


Leu 


Aia 


Asp Tyr 


Leu 


Ser 


Lys 


Pro 


Lys 


Leu 


Glu 


Ala 


Leu 


He 


625 










630 










635 










64 0 


Ala 


Ser 


Thr 


Lys 


Lys 


Gin 


Met 


Leu 


Al« 


Al-A 


Ala 


Lys 


Glu 


Leu 


Asp 


Phe 










645 










650 










6bb 




Leu 


Glu 


Ala 


Ala 


Arg 


Leu 


Axg 


Asp 


Glu 


Ala 


Ala 


Arg 


Leu 


Glu 


Lys 


Lys 








660 










665 










67 0 






Leu 


Glu 


Gin 


Leu 


Thr 


Ala 























675 
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